Sinbert Small
S
Sinbert Small
Developed by NLPC-UOM
SinBERT is a model pretrained on a large Sinhala monolingual corpus (sin-cc-15M) based on the RoBERTa architecture, suitable for Sinhala text processing tasks.
Downloads 126
Release Time : 3/2/2022
Model Overview
This model is specifically optimized for Sinhala text processing and can be used for various Sinhala natural language processing tasks.
Model Features
Sinhala-specific pretraining
Pretrained using a large Sinhala monolingual corpus (sin-cc-15M), optimized for Sinhala language characteristics
Based on RoBERTa architecture
Adopts the RoBERTa architecture, inheriting its excellent text processing capabilities
Academic research support
Related research was published at the LREC 2022 conference
Model Capabilities
Sinhala text understanding
Sinhala text classification
Use Cases
Academic research
Sinhala text analysis
Used for Sinhala linguistic research and text analysis
Commercial applications
Sinhala content classification
Can be used for automatic classification of Sinhala news and social media content
Featured Recommended AI Models