Medical Summarization
M
Medical Summarization
由Falconsai開發
基於T5 Transformer架構的專用變體,專為醫學文本摘要任務微調,能生成醫療文檔、研究論文、臨床筆記等醫療相關文本的簡潔連貫摘要。
下載量 2,215
發布時間 : 10/23/2023
模型概述
該模型通過大量醫學文獻預訓練,能精準捕捉專業醫學術語,提取關鍵信息並生成有意義的醫學文本摘要。
模型特點
醫學專業適配
通過大量醫學文獻預訓練,能精準捕捉專業醫學術語,生成高質量的醫學文本摘要。
高性能摘要
在醫學文本摘要任務上表現優異,ROUGE分數(F1)達到0.95。
多樣化訓練數據
訓練數據集包含多樣化的醫療文檔、臨床研究和人工撰寫的摘要,確保模型能處理各種醫學文本。
模型能力
醫學文本摘要
關鍵信息提取
專業術語識別
使用案例
醫療研究
醫學論文摘要
為長篇醫學研究論文生成簡潔的摘要,幫助研究人員快速瞭解核心內容。
生成精煉的醫學信息摘要,保留核心技術描述和關鍵醫學發現。
臨床筆記摘要
從複雜的臨床筆記中提取關鍵信息,生成簡明摘要供醫療專業人員參考。
幫助醫療專業人員快速獲取患者關鍵信息,提高工作效率。
🚀 T5大模型用於醫學文本摘要
T5大模型用於醫學文本摘要 是T5變壓器模型的一個專門變體,針對醫學文本摘要任務進行了微調。該模型旨在為醫學文檔、研究論文、臨床筆記和其他醫療相關文本生成簡潔而連貫的摘要。
T5大模型(即“t5-large”)在廣泛的醫學文獻上進行了預訓練,使其能夠捕捉複雜的醫學術語,提取關鍵信息,並生成有意義的摘要。該模型的微調過程十分精細,會關注超參數設置,包括批量大小和學習率,以確保在醫學文本摘要領域的最佳性能。
在微調過程中,為提高效率選擇了8的批量大小,並選擇了2e - 5的學習率,以平衡收斂速度和模型優化。這些設置確保了模型能夠生成高質量、信息豐富且連貫的醫學摘要。
微調數據集由多樣化的醫學文檔、臨床研究和醫療保健研究以及人工生成的摘要組成。這個多樣化的數據集使模型能夠準確、簡潔地總結醫學信息。
訓練這個模型的目標是為醫學專業人員、研究人員和醫療機構提供一個強大的工具,以自動生成高質量的醫學內容摘要,便於更快地獲取關鍵信息。
✨ 主要特性
- 專業微調:針對醫學文本摘要任務對T5大模型進行了精細微調,能更好地處理醫學領域的專業術語和複雜內容。
- 數據豐富:使用多樣化的醫學文檔、臨床研究和醫療保健研究作為微調數據,提升了模型對不同醫學文本的適應能力。
- 參數優化:在微調過程中精心設置批量大小和學習率,確保模型性能的最優化。
📦 安裝指南
文檔未提及安裝步驟,故跳過此章節。
💻 使用示例
基礎用法
from transformers import pipeline
summarizer = pipeline("summarization", model="your/medical_text_summarization_model")
MEDICAL_DOCUMENT = """
duplications of the alimentary tract are well - known but rare congenital malformations that can occur anywhere in the gastrointestinal ( gi ) tract from the tongue to the anus . while midgut duplications are the most common , foregut duplications such as oesophagus , stomach , and parts 1 and 2 of the duodenum account for approximately one - third of cases .
they are most commonly seen either in the thorax or abdomen or in both as congenital thoracoabdominal duplications .
cystic oesophageal duplication ( ced ) , the most common presentation , is often found in the lower third part ( 60 - 95% ) and on the right side [ 2 , 3 ] . hydatid cyst ( hc ) is still an important health problem throughout the world , particularly in latin america , africa , and mediterranean areas .
turkey , located in the mediterranean area , shares this problem , with an estimated incidence of 20/100 000 .
most commonly reported effected organ is liver , but in children the lungs are the second most frequent site of involvement [ 4 , 5 ] . in both ced and hc , the presentation depends on the site and the size of the cyst .
hydatid cysts are far more common than other cystic intrathoracic lesions , especially in endemic areas , so it is a challenge to differentiate ced from hc in these countries . here ,
we present a 7-year - old girl with intrathoracic cystic mass lesion , who had been treated for hydatid cyst for 9 months , but who turned out to have oesophageal cystic duplication .
a 7-year - old girl was referred to our clinic with coincidentally established cystic intrathoracic lesion during the investigation of aetiology of anaemia .
the child was first admitted with loss of vision in another hospital ten months previously .
the patient 's complaints had been attributed to pseudotumour cerebri due to severe iron deficiency anaemia ( haemoglobin : 3 g / dl ) .
chest radiography and computed tomography ( ct ) images resulted in a diagnosis of cystic intrathoracic lesion ( fig .
the cystic mass was accepted as a type 1 hydatid cyst according to world health organization ( who ) classification .
after 9 months of medication , no regression was detected in ct images , so the patient was referred to our department .
an ondirect haemagglutination test result was again negative . during surgery , after left thoracotomy incision , a semi - mobile cystic lesion , which was almost seven centimetres in diameter , with smooth contour , was found above the diaphragm , below the lung , outside the pleura ( fig .
the entire fluid in the cyst was aspirated ; it was brown and bloody ( fig .
2 ) . the diagnosis of cystic oesophageal duplication was considered , and so an attachment point was searched for .
it was below the hiatus , on the lower third left side of the oesophagus , and it also was excised completely through the hiatus .
pathologic analysis of the specimen showed oesophageal mucosa with an underlying proper smooth muscle layer .
computed tomography image of the cystic intrathoracic lesion cystic lesion with brownish fluid in the cyst
compressible organs facilitate the growth of the cyst , and this has been proposed as a reason for the apparent prevalence of lung involvement in children . diagnosis is often incidental and can be made with serological tests and imaging [ 5 , 7 ] .
laboratory investigations include the casoni and weinberg skin tests , indirect haemagglutination test , elisa , and the presence of eosinophilia , but can be falsely negative because children may have a poor serological response to eg .
false - positive reactions are related to the antigenic commonality among cestodes and conversely seronegativity can not exclude hydatidosis .
false - negative results are observed when cysts are calcified , even if fertile [ 4 , 8 ] . in our patient iha levels were negative twice .
due to the relatively non - specific clinical signs , diagnosis can only be made confidently using appropriate imaging .
plain radiographs , ultrasonography ( us ) , or ct scans are sufficient for diagnosis , but magnetic resonance imaging ( mri ) is also very useful [ 5 , 9 ] .
computed tomography demonstrates cyst wall calcification , infection , peritoneal seeding , bone involvement fluid density of intact cysts , and the characteristic internal structure of both uncomplicated and ruptured cysts [ 5 , 9 ] .
the conventional treatment of hydatid cysts in all organs is surgical . in children , small hydatid cysts of the lungs
respond favourably to medical treatment with oral administration of certain antihelminthic drugs such as albendazole in certain selected patients .
the response to therapy differs according to age , cyst size , cyst structure ( presence of daughter cysts inside the mother cysts and thickness of the pericystic capsule allowing penetration of the drugs ) , and localization of the cyst . in children , small cysts with thin pericystic capsule localised in the brain and lungs respond favourably [ 6 , 11 ] .
respiratory symptoms are seen predominantly in cases before two years of age . in our patient , who has vision loss , the asymptomatic duplication cyst was found incidentally .
the lesion occupied the left hemithorax although the most common localisation reported in the literature is the lower and right oesophagus .
the presentation depends on the site and the size of the malformations , varying from dysphagia and respiratory distress to a lump and perforation or bleeding into the intestine , but cysts are mostly diagnosed incidentally .
if a cystic mass is suspected in the chest , the best technique for evaluation is ct .
magnetic resonance imaging can be used to detail the intimate nature of the cyst with the spinal canal .
duplications should have all three typical signs : first of all , they should be attached to at least one point of the alimentary tract ; second and third are that they should have a well - developed smooth muscle coat , and the epithelial lining of duplication should represent some portions of alimentary tract , respectively [ 2 , 10 , 12 ] . in summary , the cystic appearance of both can cause a misdiagnosis very easily due to the rarity of cystic oesophageal duplications as well as the higher incidence of hydatid cyst , especially in endemic areas .
"""
print(summarizer(MEDICAL_DOCUMENT, max_length=2000, min_length=1500, do_sample=False))
>>> [{'summary_text': 'duplications of the alimentary tract are well - known but rare congenital malformations that can occur anywhere in the gastrointestinal ( gi ) tract from the tongue to the anus . in children , small hydatid cysts with thin pericystic capsule localised in the brain and lungs respond favourably to medical treatment with oral administration of certain antihelminthic drugs such as albendazole , and the epithelial lining of duplication should represent some parts of the oesophageal lesion ( hc ) , the most common presentation is . a 7-year - old girl was referred to our clinic with coincidentally established cystic intrathoracic lesion with brownish fluid in the cyst was found in the lower third part ( 60 - 95% ) and on the right side .'}]
📚 詳細文檔
預期用途
- 醫學文本摘要:該模型的主要目的是為醫學文檔、研究論文、臨床筆記和醫療相關文本生成簡潔而連貫的摘要。它旨在幫助醫學專業人員、研究人員和醫療機構總結複雜的醫學信息。
侷限性
- 專業任務微調:雖然該模型在醫學文本摘要方面表現出色,但在應用於其他自然語言處理任務時,其性能可能會有所不同。有興趣將此模型用於不同任務的用戶,應探索模型中心提供的微調版本,以獲得最佳效果。
訓練數據
模型的訓練數據包括多樣化的醫學文檔、臨床研究和醫療保健研究,以及相應的人工生成摘要。微調過程旨在使模型能夠有效地生成高質量的醫學文本摘要。
訓練統計信息
屬性 | 詳情 |
---|---|
評估損失 | 0.012345678901234567 |
評估Rouge分數(F1) | 0.95 |
評估運行時間 | 2.3456 |
每秒評估樣本數 | 1234.56 |
每秒評估步數 | 45.678 |
負責任使用
在將此模型應用於現實世界的醫學應用,特別是涉及敏感患者數據的應用時,必須負責任且合乎道德地使用該模型,遵守內容指南、隱私法規和道德考量。
參考資料
- Hugging Face模型中心
- T5論文
免責聲明
模型的性能可能會受到其微調數據的質量和代表性的影響。建議用戶評估該模型是否適合其特定的醫學應用和數據集。
📄 許可證
本項目採用Apache 2.0許可證。
Bart Large Cnn
MIT
基於英語語料預訓練的BART模型,專門針對CNN每日郵報數據集進行微調,適用於文本摘要任務
文本生成 英語
B
facebook
3.8M
1,364
Parrot Paraphraser On T5
Parrot是一個基於T5的釋義框架,專為加速訓練自然語言理解(NLU)模型而設計,通過生成高質量釋義實現數據增強。
文本生成
Transformers

P
prithivida
910.07k
152
Distilbart Cnn 12 6
Apache-2.0
DistilBART是BART模型的蒸餾版本,專門針對文本摘要任務進行了優化,在保持較高性能的同時顯著提升了推理速度。
文本生成 英語
D
sshleifer
783.96k
278
T5 Base Summarization Claim Extractor
基於T5架構的模型,專門用於從摘要文本中提取原子聲明,是摘要事實性評估流程的關鍵組件。
文本生成
Transformers 英語

T
Babelscape
666.36k
9
Unieval Sum
UniEval是一個統一的多維評估器,用於自然語言生成任務的自動評估,支持多個可解釋維度的評估。
文本生成
Transformers

U
MingZhong
318.08k
3
Pegasus Paraphrase
Apache-2.0
基於PEGASUS架構微調的文本複述模型,能夠生成語義相同但表達不同的句子。
文本生成
Transformers 英語

P
tuner007
209.03k
185
T5 Base Korean Summarization
這是一個基於T5架構的韓語文本摘要模型,專為韓語文本摘要任務設計,通過微調paust/pko-t5-base模型在多個韓語數據集上訓練而成。
文本生成
Transformers 韓語

T
eenzeenee
148.32k
25
Pegasus Xsum
PEGASUS是一種基於Transformer的預訓練模型,專門用於抽象文本摘要任務。
文本生成 英語
P
google
144.72k
198
Bart Large Cnn Samsum
MIT
基於BART-large架構的對話摘要模型,專為SAMSum語料庫微調,適用於生成對話摘要。
文本生成
Transformers 英語

B
philschmid
141.28k
258
Kobart Summarization
MIT
基於KoBART架構的韓語文本摘要模型,能夠生成韓語新聞文章的簡潔摘要。
文本生成
Transformers 韓語

K
gogamza
119.18k
12
精選推薦AI模型
Llama 3 Typhoon V1.5x 8b Instruct
專為泰語設計的80億參數指令模型,性能媲美GPT-3.5-turbo,優化了應用場景、檢索增強生成、受限生成和推理任務
大型語言模型
Transformers 支持多種語言

L
scb10x
3,269
16
Cadet Tiny
Openrail
Cadet-Tiny是一個基於SODA數據集訓練的超小型對話模型,專為邊緣設備推理設計,體積僅為Cosmo-3B模型的2%左右。
對話系統
Transformers 英語

C
ToddGoldfarb
2,691
6
Roberta Base Chinese Extractive Qa
基於RoBERTa架構的中文抽取式問答模型,適用於從給定文本中提取答案的任務。
問答系統 中文
R
uer
2,694
98