W

Wav2vec2 Base Vios Commonvoice 1

Developed by tclong
This model is a speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-xls-r-300m, supporting automatic speech recognition tasks.
Downloads 21
Release Time : 6/10/2022

Model Overview

This is a speech recognition model based on the wav2vec2 architecture, fine-tuned for converting speech to text.

Model Features

Based on wav2vec2 architecture
Utilizes the advanced wav2vec2 architecture to provide high-quality speech recognition capabilities
Fine-tuning optimization
Fine-tuned on the Common Voice dataset to optimize recognition performance
Low Word Error Rate
Achieved a word error rate (WER) of 0.3621 on the evaluation set

Model Capabilities

Speech Recognition
Audio to Text Conversion

Use Cases

Speech Transcription
Speech-to-Text Service
Convert speech content into text transcripts
Word error rate 0.3621
Assistive Technology
Real-time Caption Generation
Generate real-time captions for video or live streaming content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase