W

Wav2vec2 Large XLSR 53 Assamese

Developed by infinitejoy
Assamese automatic speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained using the Common Voice dataset
Downloads 260
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition (ASR) model for Assamese, fine-tuned based on Facebook's Wav2Vec2-Large-XLSR-53 architecture, specifically designed to convert Assamese speech into text.

Model Features

Assamese-specific
Speech recognition model specifically optimized for Assamese
Based on XLSR-53
Fine-tuned using the powerful wav2vec2-large-xlsr-53 architecture
Common Voice dataset
Trained using the publicly available Common Voice dataset

Model Capabilities

Assamese speech recognition
16kHz audio processing

Use Cases

Speech-to-text
Assamese speech transcription
Convert Assamese speech content into text
Achieves a WER of 69.63% on the Common Voice test set
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase