Wav2vec2 Base 960h Finetuned Common Voice3
A speech recognition model fine-tuned based on facebook/wav2vec2-base-960h, suitable for general speech recognition tasks
Downloads 20
Release Time : 4/28/2022
Model Overview
This model is a fine-tuned version of wav2vec2-base-960h on the Common Voice dataset, primarily used for Automatic Speech Recognition (ASR) tasks.
Model Features
Based on wav2vec2 Architecture
Utilizes the advanced wav2vec2 architecture to provide high-quality speech recognition capabilities
Fine-tuned on Common Voice Dataset
The model was fine-tuned on the Common Voice dataset, improving recognition accuracy
Supports Large-Scale Training
Used a total batch size of 1024 during training to ensure the model fully learns data features
Model Capabilities
Speech Recognition
Audio-to-Text Conversion
Use Cases
Speech Transcription
Meeting Minutes
Automatically convert meeting recordings into text transcripts
Subtitle Generation
Automatically generate subtitles for video content
Voice Assistants
Voice Command Recognition
Recognize and process user voice commands
Featured Recommended AI Models
Š 2025AIbase