Vihealthbert Base Word
V
Vihealthbert Base Word
Developed by demdecuong
ViHealthBERT is a pre-trained language model for Vietnamese health text mining, providing strong baseline performance in the healthcare domain
Downloads 633
Release Time : 4/20/2022
Model Overview
A pre-trained language model specifically designed for Vietnamese medical and health texts, supporting tasks such as named entity recognition, abbreviation disambiguation, and text summarization
Model Features
Medical domain optimization
Specially pre-trained for Vietnamese medical and health texts, excelling in related tasks
Dual tokenizer support
Provides both word-level and syllable-level tokenizer versions to adapt to different application scenarios
Accompanying datasets
Includes released medical abbreviation dataset (acrDrAid) and frequently asked questions summarization dataset
Model Capabilities
Vietnamese medical text understanding
Named entity recognition
Abbreviation disambiguation
Text summarization generation
Use Cases
Medical information processing
COVID-19 entity recognition
Identifying COVID-19 related entities from Vietnamese medical texts
Achieved SOTA performance on the COVID-19 & ViMQ dataset
Medical abbreviation resolution
Resolving professional abbreviations in Vietnamese medical documents
Excellent performance on the acrDrAid dataset
Medical text summarization
FAQ summarization
Generating concise summaries of frequently asked medical questions in Vietnamese
Featured Recommended AI Models