ClinicalT5-baseオープンソース言語モデル - 無料でのデプロイで医療自然言語処理タスクをサポート

ホーム

Clinicalt5 Base

luqhによって開発

ClinicalT5はT5アーキテクチャに基づく生成型言語モデルで、臨床テキスト処理に特化して事前学習されており、医療分野の自然言語処理タスクに適しています。

大規模言語モデル

Transformers

#臨床テキスト生成 #医療分野専用 #T5アーキテクチャ最適化

ダウンロード数 8,202

リリース時間 : 2/7/2023

モデル概要

ClinicalT5は臨床テキストに最適化された生成型言語モデルで、医療分野のテキスト生成や理解タスク、例えば臨床ノート生成や医療質問応答などを処理することができます。

モデル特徴

臨床分野最適化

臨床テキストに特化して事前学習されており、医療分野の専門用語や表現をより良く理解し、生成することができます。

生成型モデル

T5アーキテクチャに基づいており、質問応答や要約などの様々なテキスト生成タスクを処理することができます。

マルチタスク処理

テキスト分類や固有表現抽出など、様々な臨床関連の自然言語処理タスクに適用することができます。

モデル能力

臨床テキスト生成

医療質問応答

臨床テキスト要約

医療テキスト分類

使用事例

臨床文書処理

臨床ノート生成

医師の口述や簡単な記録に基づいて、完全な臨床ノートを生成します。

臨床文書作成の効率を向上させる

医療質問応答システム

患者や医療従事者の医療知識に関する質問に答えます。

正確な医療情報を提供する

医学研究

医学文献要約

医学研究論文の要約を自動生成します。

医学文献の検索と閲読を加速する

🚀 transformers

transformersライブラリは、事前学習済みのモデルを使用するための便利なツールを提供しています。このサンプルでは、ClinicalT5-baseモデルを使用した操作方法を紹介し、臨床テキスト処理に有効な解決策を提供します。

🚀 クイックスタート

以下のコードは、transformersライブラリを使ってClinicalT5-baseモデルを読み込む方法を示しています。

from transformers import AutoTokenizer, T5ForConditionalGeneration
tokenizer = AutoTokenizer.from_pretrained("luqh/ClinicalT5-base")
model = T5ForConditionalGeneration.from_pretrained("luqh/ClinicalT5-base", from_flax=True)

💻 使用例

基本的な使用法

from transformers import AutoTokenizer, T5ForConditionalGeneration
tokenizer = AutoTokenizer.from_pretrained("luqh/ClinicalT5-base")
model = T5ForConditionalGeneration.from_pretrained("luqh/ClinicalT5-base", from_flax=True)

📚 ドキュメント

このリソースが役に立った場合は、ぜひ私たちの研究を引用してください：ClinicalT5: A Generative Language Model for Clinical Text

@inproceedings{lu-etal-2022-clinicalt5,
    title = "{C}linical{T}5: A Generative Language Model for Clinical Text",
    author = "Lu, Qiuhao  and
      Dou, Dejing  and
      Nguyen, Thien",
    booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2022",
    month = dec,
    year = "2022",
    address = "Abu Dhabi, United Arab Emirates",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.findings-emnlp.398",
    pages = "5436--5443",
    abstract = "In the past few years, large pre-trained language models (PLMs) have been widely adopted in different areas and have made fundamental improvements over a variety of downstream tasks in natural language processing (NLP). Meanwhile, domain-specific variants of PLMs are being proposed to address the needs of domains that demonstrate a specific pattern of writing and vocabulary, e.g., BioBERT for the biomedical domain and ClinicalBERT for the clinical domain. Recently, generative language models like BART and T5 are gaining popularity with their competitive performance on text generation as well as on tasks cast as generative problems. However, in the clinical domain, such domain-specific generative variants are still underexplored. To address this need, our work introduces a T5-based text-to-text transformer model pre-trained on clinical text, i.e., ClinicalT5. We evaluate the proposed model both intrinsically and extrinsically over a diverse set of tasks across multiple datasets, and show that ClinicalT5 dramatically outperforms T5 in the domain-specific tasks and compares favorably with its close baselines.",
}