S

Sharif Wav2vec2

Developed by SLPL
A fine-tuned version of Sharif Wav2vec2 for Persian language, trained on Common Voice Persian samples, supporting automatic speech recognition tasks.
Downloads 88
Release Time : 6/25/2022

Model Overview

This model is an automatic speech recognition (ASR) model based on the Wav2vec2 architecture, specifically fine-tuned for Persian. It was trained on 108 hours of Common Voice Persian samples and incorporates a 5-gram language model to improve recognition accuracy.

Model Features

Persian optimization
Specifically fine-tuned for Persian, achieving 6.0% WER on the Common Voice Persian test set.
Language model integration
Incorporates a 5-gram language model trained with kenlm, improving the accuracy of online ASR.
Efficient processing
Supports 16kHz sampling rate audio input, suitable for real-time speech recognition applications.

Model Capabilities

Persian speech recognition
Audio transcription
Speech-to-text

Use Cases

Speech transcription
Persian speech-to-text
Convert Persian speech content into text
Achieves 6% word error rate (WER) on the Common Voice test set
Voice assistant
Persian voice command recognition
Used for voice command recognition in Persian voice assistants or smart home systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase