S

Sundanese Roberta Base

Developed by w11wo
A Sundanese masked language model based on the RoBERTa architecture, trained on multiple datasets.
Downloads 32
Release Time : 3/2/2022

Model Overview

This is a Sundanese masked language model based on the RoBERTa architecture, primarily used for text understanding and generation tasks in Sundanese.

Model Features

Multi-dataset Training
Trained on four datasets—OSCAR, mC4, CC100, and Wikipedia—ensuring broad coverage of Sundanese usage.
High Accuracy
Achieves a validation accuracy of 63.98%, demonstrating strong performance in Sundanese tasks.
Optimized for Sundanese
Specifically designed and trained for Sundanese, offering better language understanding compared to multilingual models.

Model Capabilities

Sundanese Text Understanding
Masked Language Prediction
Text Feature Extraction

Use Cases

Education
Sundanese Learning Aid
Helps students understand and learn Sundanese grammar and vocabulary.
Natural Language Processing
Sundanese Text Analysis
Used for tasks like classification and sentiment analysis of Sundanese texts.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase