wav2vec2-pretrained-clsril-23-10k Open-source Audio Model - Learning Representations from 23 Original Indian Language Audios

Wav2vec2 Pretrained Clsril 23 10k

Developed by Harveenchadha

An audio pre-training model based on self-supervised learning, capable of learning cross-lingual speech representations from raw audio of 23 Indian languages

Speech Recognition

Transformers

#Cross-lingual speech representation #Self-supervised learning #Indo-Aryan language support

Downloads 32

Release Time : 3/2/2022

Model Overview

CLSRIL-23 is a speech representation model based on the wav2vec 2.0 architecture, trained through contrastive learning tasks to learn shared speech feature representations across 23 Indian languages. This model is particularly suitable for speech processing tasks in India's multilingual environment.

Model Features

Multilingual support

Supports speech representation learning for 23 Indian languages, covering major Indo-Aryan language families

Self-supervised learning

Utilizes self-supervised learning methods to learn effective speech representations without requiring large amounts of labeled data

Shared quantized representation

Jointly learns shared latent quantized representations across all languages, facilitating cross-lingual transfer

Large-scale training data

Total training data exceeds 9000 hours, with Hindi having the largest volume (4563.7 hours)

Model Capabilities

Cross-lingual speech representation learning

Speech feature extraction

Multilingual speech processing

Use Cases

Speech recognition

Multilingual automatic speech recognition

Building speech recognition systems in India's multilingual environment

Speech technology development

Speech feature extraction

Serving as a pre-trained feature extractor for downstream speech tasks

Property	Details
Assamese	254.9 Hrs
Bengali	331.3 Hrs
Bodo	26.9 Hrs
Dogri	17.1 Hrs
English	819.7 Hrs
Gujarati	336.7 Hrs
Hindi	4563.7 Hrs
Kannada	451.8 Hrs
Kashmiri	67.8 Hrs
Konkani	36.8 Hrs
Maithili	113.8 Hrs
Malayalam	297.7 Hrs
Manipuri	171.9 Hrs
Marathi	458.2 Hrs
Nepali	31.6 Hrs
Odia	131.4 Hrs
Punjabi	486.05 Hrs
Sanskrit	58.8 Hrs
Santali	6.56 Hrs
Sindhi	16 Hrs
Tamil	542.6 Hrs
Telugu	302.8 Hrs
Urdu	259.68 Hrs

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 Pretrained Clsril 23 10k

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 CLSRIL-23: Cross Lingual Speech Representations on Indic Languages

🚀 Quick Start

📚 Documentation

Languages in the pretraining dataset

Repo for training