W

Wav2vec2 Xls R 300m Hebrew

Developed by imvladikon
This is a Hebrew automatic speech recognition model fine-tuned based on the facebook/wav2vec2-xls-r-300m model, optimized for performance through two-stage training on small-scale and large-scale datasets.
Downloads 1.2M
Release Time : 3/2/2022

Model Overview

This model is specifically designed for Hebrew automatic speech recognition tasks. It undergoes a two-stage fine-tuning process, training on both small-scale high-quality datasets and large-scale diverse datasets to improve recognition accuracy.

Model Features

Two-stage fine-tuning training
First fine-tuned on a small-scale high-quality dataset, then further trained on a large-scale diverse dataset to enhance model robustness.
Multi-source data training
Training data includes high-quality annotated data, diverse source data, and weakly labeled unannotated data.
Low word error rate
Achieves a word error rate of 17.73% on small-scale test sets and 23.18% on large-scale test sets.

Model Capabilities

Hebrew speech recognition
Audio-to-text conversion
Robust speech processing

Use Cases

Speech transcription
Hebrew meeting minutes
Automatically transcribe Hebrew meeting recordings into text.
Word error rate approximately 23.18%
Hebrew voice assistant
Provide speech recognition capabilities for Hebrew voice assistants.
Speech analysis
Hebrew speech content analysis
Analyze Hebrew speech content and extract key information.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase