M

Muril Base Cased

Developed by google
MuRIL is a BERT model pre-trained on 17 Indian languages and their transcribed texts, optimized for Indian contexts
Downloads 12.72k
Release Time : 3/2/2022

Model Overview

MuRIL is a multilingual model based on the BERT architecture, specifically pre-trained for 17 Indian languages with special optimizations for transcribed texts

Model Features

Multilingual Support
Supports 17 Indian languages and their transcribed texts
Transcription Optimization
Specifically optimized for Indian language transcription phenomena
Parallel Data Training
Uses translation and transcribed text pairs for pre-training
Low-resource Language Optimization
Employs an upsampling exponent of 0.3 to enhance performance for low-resource languages

Model Capabilities

Multilingual Text Understanding
Transcribed Text Processing
Masked Language Modeling
Cross-lingual Transfer Learning

Use Cases

Natural Language Processing
Named Entity Recognition
Named entity recognition tasks for Indian languages
Achieved an average F1 score of 77.60% on PANX tasks, significantly outperforming mBERT
Part-of-Speech Tagging
Part-of-speech tagging tasks for Indian languages
Achieved an average F1 score of 75.02% on UDPOS tasks, outperforming mBERT
Cross-lingual Natural Language Inference
XNLI tasks for Indian languages
Transcribed text accuracy improved from 39.23% to 64.70%
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase