Ctrlv Wav2vec2 Tokenizer
A speech recognition model fine-tuned based on facebook/wav2vec2-base, achieving a 31.38% word error rate on the evaluation set
Downloads 25
Release Time : 4/20/2022
Model Overview
This model is a speech recognition model based on the wav2vec2 architecture, suitable for tasks converting speech to text
Model Features
Efficient Fine-tuning
Fine-tuned based on the wav2vec2-base model, achieving good results on a relatively small dataset
Low Word Error Rate
Achieved a 31.38% word error rate on the evaluation set, outperforming the base model
Optimized Training
Used linear learning rate scheduling with 1000 warm-up steps, ensuring stable and efficient training
Model Capabilities
Speech-to-Text
Automatic Speech Recognition
Use Cases
Speech Transcription
Meeting Minutes
Automatically convert meeting recordings into text transcripts
Accuracy approximately 68.62% (based on 31.38% WER)
Voice Notes
Convert voice memos into searchable text
Featured Recommended AI Models
Š 2025AIbase