V

Vihealthbert Base Word

Developed by demdecuong
ViHealthBERT is a pre-trained language model for Vietnamese health text mining, providing strong baseline performance in the healthcare domain
Downloads 633
Release Time : 4/20/2022

Model Overview

A pre-trained language model specifically designed for Vietnamese medical and health texts, supporting tasks such as named entity recognition, abbreviation disambiguation, and text summarization

Model Features

Medical domain optimization
Specially pre-trained for Vietnamese medical and health texts, excelling in related tasks
Dual tokenizer support
Provides both word-level and syllable-level tokenizer versions to adapt to different application scenarios
Accompanying datasets
Includes released medical abbreviation dataset (acrDrAid) and frequently asked questions summarization dataset

Model Capabilities

Vietnamese medical text understanding
Named entity recognition
Abbreviation disambiguation
Text summarization generation

Use Cases

Medical information processing
COVID-19 entity recognition
Identifying COVID-19 related entities from Vietnamese medical texts
Achieved SOTA performance on the COVID-19 & ViMQ dataset
Medical abbreviation resolution
Resolving professional abbreviations in Vietnamese medical documents
Excellent performance on the acrDrAid dataset
Medical text summarization
FAQ summarization
Generating concise summaries of frequently asked medical questions in Vietnamese
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase