W

Wav2vec2 Large Xlsr Gu

Developed by gchhablani
Gujarati automatic speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, achieving 23.55% WER on OpenSLR dataset
Downloads 3,582
Release Time : 3/2/2022

Model Overview

This is a model for Gujarati Automatic Speech Recognition (ASR), fine-tuned based on XLSR Wav2Vec2 architecture, supporting 16kHz sampling rate voice input.

Model Features

High Accuracy Speech Recognition
Achieves 23.55% Word Error Rate (WER) on OpenSLR Gujarati test set
No Language Model Required
Can be used directly without additional language model support
Multi-Sampling Rate Support
Built-in resampling function to handle audio inputs with different sampling rates

Model Capabilities

Gujarati Speech Recognition
Audio to Text Conversion
Speech Content Analysis

Use Cases

Speech Transcription
Gujarati Speech Transcription
Convert Gujarati speech content to text
Accuracy rate of 76.45% (WER 23.55%)
Voice Assistants
Gujarati Voice Command Recognition
For developing Gujarati voice assistants and control systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase