W

Wav2vec2 Large Xlsr Persian V3

Developed by m3hrdadfi
An automatic speech recognition (ASR) model fine-tuned on the Persian Common Voice dataset based on Facebook's wav2vec2-large-xlsr-53 model
Downloads 1,888
Release Time : 3/2/2022

Model Overview

This model is specifically designed for Persian (Farsi) speech recognition tasks, achieving high transcription accuracy through large-scale pre-training with XLSR architecture and fine-tuning on Persian data.

Model Features

Low Word Error Rate
Achieves a WER (Word Error Rate) of 10.36% on Persian test sets
Large-scale Pre-training
Based on the cross-lingual pre-trained model facebook/wav2vec2-large-xlsr-53
Specialized Data Fine-tuning
Fine-tuned using the Persian version of the Common Voice dataset

Model Capabilities

Persian Speech Recognition
16kHz Audio Processing
Long Speech Transcription

Use Cases

Speech Transcription
Persian Speech Transcription
Convert Persian speech content into text
Approximately 90% accuracy (WER 10.36%)
Voice Assistants
Persian Voice Command Recognition
Provides core recognition capabilities for Persian voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase