W

Wav2vec2 Large Xlsr Persian V2

Developed by m3hrdadfi
An automatic speech recognition model fine-tuned on Persian (Farsi) using the Common Voice dataset, based on facebook/wav2vec2-large-xlsr-53
Downloads 47
Release Time : 3/2/2022

Model Overview

This is a model for Persian automatic speech recognition (ASR), fine-tuned based on Facebook's wav2vec2-large-xlsr-53 architecture, supporting speech input at 16kHz sampling rate.

Model Features

Persian Optimization
Specially fine-tuned for Persian, including Persian character processing and normalization
Based on Common Voice Dataset
Trained and validated using the Persian Common Voice dataset
No Language Model Required
Can be used directly without additional language models

Model Capabilities

Persian Speech Recognition
16kHz Speech Processing

Use Cases

Speech-to-Text
Persian Speech Transcription
Convert Persian speech to text
Test WER of 31.92%
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase