I

Indicbart

Developed by ai4bharat
IndicBART is a multilingual sequence-to-sequence pre-trained model focused on Indian languages and English, supporting 11 Indian languages, built on the mBART architecture.
Downloads 4,120
Release Time : 3/2/2022

Model Overview

IndicBART is a multilingual sequence-to-sequence pre-trained model specializing in natural language generation tasks for Indian languages and English, such as machine translation, summarization, and question generation.

Model Features

Multilingual Support
Supports 11 Indian languages and English, including Assamese, Bengali, Gujarati, etc.
Efficient Computation
The model is significantly smaller than mBART and mT5 (base versions), resulting in lower computational costs during fine-tuning and decoding.
Large-scale Pretraining
Trained on a large Indian language corpus (452 million sentences and 9 billion tokens), including Indian English content.
Unified Script
All languages except English are written in Devanagari script to facilitate transfer learning among related languages.

Model Capabilities

Text Generation
Machine Translation
Summarization
Question Generation

Use Cases

Natural Language Processing
Machine Translation
Translate English to Indian languages or Indian languages to English.
Summarization
Generate summaries of Indian language texts.
Question Generation
Generate relevant questions based on Indian language texts.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase