W

Wav2vec2 Xlsr 53 Pa In

Developed by anuragshas
A Punjabi automatic speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, supporting 16kHz sample rate input.
Downloads 19
Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition system optimized for Punjabi (India), fine-tuned on the Common Voice dataset, suitable for speech-to-text tasks.

Model Features

High Compatibility
Supports standard 16kHz sample rate input, compatible with common voice devices.
No Language Model Required
Ready to use out-of-the-box without additional language model integration.
Data Augmentation
Fine-tuned on the large-scale Punjabi dataset from Common Voice.

Model Capabilities

Speech recognition
Punjabi speech-to-text
Real-time speech processing

Use Cases

Speech Transcription
Voice Memo Transcription
Automatically convert Punjabi voice memos into text
Test WER 58.05%
Voice Assistant
Provide voice interaction support for Punjabi users
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase