W

Wav2vec2 Base Common Voice 40p Persian Colab

Developed by zoha
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-base for Persian language datasets, primarily used for Persian speech-to-text tasks.
Downloads 120
Release Time : 6/26/2022

Model Overview

This is an optimized Automatic Speech Recognition (ASR) model for Persian, based on the wav2vec2 architecture, fine-tuned for 40 epochs on the Common Voice Persian dataset.

Model Features

Persian Optimization
Specially optimized for the phonetic characteristics of Persian speech
Based on wav2vec2
Uses Facebook's open-source wav2vec2-base architecture as the base model
Medium-scale Training
Trained for 40 epochs, achieving a word error rate of 0.6024 on the validation set

Model Capabilities

Persian speech recognition
Speech-to-text

Use Cases

Speech Transcription
Persian Speech Transcription
Convert Persian speech content into text
Word error rate 0.6024
Voice Assistants
Persian Voice Command Recognition
Used for Persian voice assistant command recognition systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase