C

Ctrlv Wav2vec2 Tokenizer

Developed by proseph
A speech recognition model fine-tuned based on facebook/wav2vec2-base, achieving a 31.38% word error rate on the evaluation set
Downloads 25
Release Time : 4/20/2022

Model Overview

This model is a speech recognition model based on the wav2vec2 architecture, suitable for tasks converting speech to text

Model Features

Efficient Fine-tuning
Fine-tuned based on the wav2vec2-base model, achieving good results on a relatively small dataset
Low Word Error Rate
Achieved a 31.38% word error rate on the evaluation set, outperforming the base model
Optimized Training
Used linear learning rate scheduling with 1000 warm-up steps, ensuring stable and efficient training

Model Capabilities

Speech-to-Text
Automatic Speech Recognition

Use Cases

Speech Transcription
Meeting Minutes
Automatically convert meeting recordings into text transcripts
Accuracy approximately 68.62% (based on 31.38% WER)
Voice Notes
Convert voice memos into searchable text
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase