W

Wav2vec2 Large Xls R 300m As

Developed by anuragshas
An automatic speech recognition (ASR) model fine-tuned on the Common Voice 7 Assamese (AS) dataset based on Facebook's wav2vec2-xls-r-300m model
Downloads 19
Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition system for Assamese, capable of converting Assamese speech into text

Model Features

Multilingual support
Based on Facebook's multilingual wav2vec2-xls-r model, supporting multiple languages including Assamese
Efficient training
Optimized training process using techniques like gradient accumulation for efficient training with limited resources
Robustness
Trained on the Common Voice dataset, the model exhibits a certain degree of robustness to speech variations

Model Capabilities

Assamese speech recognition
Speech-to-text
Supports 16kHz sample rate audio processing

Use Cases

Speech transcription
Assamese speech transcription
Convert Assamese speech content into text
Word error rate 56.995% (with language model)
Voice assistant
Assamese voice command recognition
Used to understand Assamese voice commands
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase