wav2vec2-large-xlsr-53 Open-source Speech Recognition Model - Free Support for Tamil and Nepali Recognition

Wav2vec2 Large Xlsr 53 Demo Colab

Developed by Mahalakshmi

This is an automatic speech recognition model based on the wav2vec2 architecture, specifically optimized for the Tamil language and supporting Nepali speech recognition tasks.

Speech Recognition

Transformers

OtherOpen Source License:Apache-2.0 #Multilingual speech recognition #Low WER rate #XLSR-53 architecture

Downloads 17

Release Time : 3/2/2022

Model Overview

This model is primarily used for automatic speech recognition (ASR) tasks, capable of converting Tamil and Nepali speech into text.

Model Features

Multilingual Support

Supports speech recognition for Tamil and Nepali languages.

High Performance

Achieves a test WER of 25.02 on the openslr dataset, demonstrating excellent performance.

Based on wav2vec2 Architecture

Utilizes the advanced wav2vec2-large-xlsr-53 architecture, providing robust speech recognition capabilities.

Model Capabilities

Speech recognition

Multilingual processing

Use Cases

Speech-to-Text

Tamil Speech Transcription

Converts Tamil speech into text, suitable for applications like voice assistants and subtitle generation.

Test WER of 25.02

Nepali Speech Transcription

Converts Nepali speech into text, suitable for applications like voice assistants and subtitle generation.

Property	Details
Model Type	wav2vec2-large-xlsr-53-tamil
Training Data	openslr

Task	Dataset	Metric	Value
Automatic Speech Recognition	openslr (args: ne)	Test WER	25.02

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 Large Xlsr 53 Demo Colab

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 xlsr-large-53-tamil

📄 License

📚 Documentation

Model Information

Evaluation Results