2

20220517 150219

Developed by lilitket
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-xls-r-300m, supporting automatic speech recognition (ASR) tasks.
Downloads 29
Release Time : 5/17/2022

Model Overview

A speech recognition model based on the wav2vec2-xls-r-300m architecture, achieving a word error rate of 0.2344 and a character error rate of 0.0434 on the evaluation set after fine-tuning.

Model Features

Low Word Error Rate
Achieved a word error rate of 0.2344 on the evaluation set, demonstrating good performance
Low Character Error Rate
Achieved a character error rate of 0.0434 on the evaluation set, with high recognition accuracy
Based on Large-Scale Pre-trained Model
Fine-tuned from the facebook/wav2vec2-xls-r-300m model, inheriting its powerful speech feature extraction capabilities

Model Capabilities

Speech-to-Text
Automatic Speech Recognition

Use Cases

Speech Transcription
Automatic Meeting Minutes Transcription
Automatically convert meeting recordings into text transcripts
High accuracy with a word error rate of 23.44%
Voice Note Conversion
Convert voice notes into editable text
Character error rate as low as 4.34%
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase