W

Wav2vec2 Large Xlsr Pa IN

Developed by danurahul
A speech recognition model fine-tuned on the Punjabi Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Downloads 26
Release Time : 3/2/2022

Model Overview

This is an optimized automatic speech recognition (ASR) model for Punjabi, fine-tuned on Facebook's wav2vec2-large-xlsr-53 architecture, supporting 16kHz sampled speech input.

Model Features

Punjabi Optimization
Specially fine-tuned for Punjabi, improving recognition accuracy for this language
Based on XLSR Architecture
Utilizes Facebook's wav2vec2-large-xlsr-53 pre-trained model as the base, featuring powerful speech feature extraction capabilities
16kHz Sampling Rate Support
Supports 16kHz sampled speech input, suitable for common speech application scenarios

Model Capabilities

Speech recognition
Punjabi speech-to-text
Automatic speech transcription

Use Cases

Speech Transcription
Punjabi Speech to Text
Convert Punjabi speech content into text format
WER of 54.86 on the Common Voice test set
Voice Assistants
Punjabi Voice Command Recognition
Used to build voice assistant systems supporting Punjabi
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase