W

Wav2vec2 Large Xls R 300m As V9

Developed by DrishtiSharma
An automatic speech recognition model fine-tuned on the Assamese (Common Voice 8.0) dataset based on facebook/wav2vec2-xls-r-300m
Downloads 20
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition (ASR) model for Assamese, fine-tuned from a large-scale pre-trained wav2vec2 model, suitable for speech-to-text tasks.

Model Features

Assamese optimization
Specially fine-tuned for Assamese, with good recognition performance in this language
Large-scale pre-training foundation
Based on the facebook/wav2vec2-xls-r-300m pre-trained model, with powerful speech feature extraction capabilities
Multi-scenario adaptation
Trained on the Common Voice dataset, capable of adapting to various speech scenarios

Model Capabilities

Assamese speech recognition
Speech-to-text
Automatic speech transcription

Use Cases

Speech transcription
Assamese speech transcription
Convert Assamese speech content into text
61.64% WER on Common Voice 8.0 test set
Voice assistant
Assamese voice interaction
Supports Assamese voice command recognition
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase