W

Wav2vec2 Large Xls R 300m Assamese Cv8

Developed by infinitejoy
This is an automatic speech recognition (ASR) model fine-tuned on Assamese datasets based on the facebook/wav2vec2-xls-r-300m model
Downloads 18
Release Time : 3/2/2022

Model Overview

This model is a fine-tuned version on the MOZILLA-FOUNDATION/COMMON_VOICE_8_0 - AS dataset, specifically designed for Assamese speech recognition tasks

Model Features

Assamese-specific
Speech recognition model specifically optimized for Assamese
Based on XLS-R architecture
Uses Facebook's XLS-R-300M large-scale pre-trained model as the foundation
Fine-tuned on Common Voice dataset
Fine-tuned using the Assamese dataset from Mozilla Common Voice 8.0

Model Capabilities

Assamese speech recognition
Speech-to-text
Conversational speech processing

Use Cases

Speech transcription
Assamese speech transcription
Convert Assamese speech content into text
Achieves WER of 65.966 and CER of 22.188 on test set
Voice assistant
Assamese voice interaction
Supports Assamese voice command recognition
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase