W

Wav2vec2 Large Xlsr Persian

Developed by m3hrdadfi
A fine-tuned automatic speech recognition model for Persian (Farsi) based on facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampled audio input.
Downloads 562
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition model optimized specifically for Persian, fine-tuned on the Common Voice Persian dataset using the XLSR architecture.

Model Features

Persian optimization
Fine-tuned specifically for Persian speech characteristics to improve recognition accuracy.
No language model required
Can be used directly without additional language model support.
16kHz sampling rate support
Supports standard 16kHz sampled audio input.

Model Capabilities

Persian speech recognition
Audio-to-text conversion
Automatic speech transcription

Use Cases

Speech transcription
Persian speech-to-text
Convert Persian speech content into text format
Word Error Rate 32.20%
Voice assistants
Persian voice command recognition
Used for command recognition in Persian voice assistant systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase