W

Wav2vec2 Large Xlsr Assamese

Developed by manandey
This is an automatic speech recognition model for Assamese, fine-tuned from Facebook's wav2vec2-large-xlsr-53 model using common voice datasets.
Downloads 83
Release Time : 3/2/2022

Model Overview

This model is used for automatic speech recognition tasks in Assamese, capable of converting Assamese speech into text.

Model Features

Multilingual pre-trained model fine-tuning
Fine-tuned specifically for Assamese based on Facebook's multilingual pre-trained model wav2vec2-large-xlsr-53
16kHz sampling rate support
Specifically handles audio inputs with 16kHz sampling rate
No language model required
Can be used directly without additional language models

Model Capabilities

Assamese speech recognition
Speech-to-text

Use Cases

Speech transcription
Assamese speech transcription
Convert Assamese speech content into text
Word error rate 74.25%
Voice assistants
Assamese voice command recognition
Used for command recognition in Assamese voice assistant systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase