R

Roberta Hindi

Developed by flax-community
RoBERTa model pre-trained on massive Hindi data, supporting masked language modeling tasks
Downloads 212
Release Time : 3/2/2022

Model Overview

This is a RoBERTa model pre-trained on Hindi data using masked language modeling (MLM) objective, suitable for natural language processing tasks like text infilling.

Model Features

Large-scale Hindi pre-training
Joint pre-training on major Hindi datasets including mc4, oscar and indic-nlp
Dynamic masking strategy
Adopts 15% dynamic masking ratio: 80% replaced with <mask>, 10% random replacement, 10% unchanged
Multi-dataset integration
Integrates multiple high-quality Hindi datasets including news, reviews and Wikipedia data

Model Capabilities

Hindi text infilling
Hindi text understanding
Hindi language model inference

Use Cases

Text processing
Text auto-completion
Automatically completes missing parts in Hindi sentences
As shown in examples, can accurately predict words like 'рд╕рдлрд░' (journey), 'рдкрд▓' (moment)
Sentiment analysis
Product review analysis
Analyzes sentiment orientation of Hindi product reviews
Achieves 75.53% accuracy on IITP product review dataset
Featured Recommended AI Models
┬й 2025AIbase