S

Sinbert Small

Developed by NLPC-UOM
SinBERT is a model pretrained on a large Sinhala monolingual corpus (sin-cc-15M) based on the RoBERTa architecture, suitable for Sinhala text processing tasks.
Downloads 126
Release Time : 3/2/2022

Model Overview

This model is specifically optimized for Sinhala text processing and can be used for various Sinhala natural language processing tasks.

Model Features

Sinhala-specific pretraining
Pretrained using a large Sinhala monolingual corpus (sin-cc-15M), optimized for Sinhala language characteristics
Based on RoBERTa architecture
Adopts the RoBERTa architecture, inheriting its excellent text processing capabilities
Academic research support
Related research was published at the LREC 2022 conference

Model Capabilities

Sinhala text understanding
Sinhala text classification

Use Cases

Academic research
Sinhala text analysis
Used for Sinhala linguistic research and text analysis
Commercial applications
Sinhala content classification
Can be used for automatic classification of Sinhala news and social media content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase