W

Wav2vec2 10

Developed by chrisvinsen
A speech recognition model fine-tuned from facebook/wav2vec2-base, achieving a Word Error Rate (WER) of 1.0 on the evaluation set
Downloads 20
Release Time : 5/23/2022

Model Overview

This model is a speech recognition model based on the wav2vec2 architecture, fine-tuned for the task of converting speech to text

Model Features

Low Word Error Rate
Achieves a Word Error Rate (WER) of 1.0 on the evaluation set
Based on wav2vec2 Architecture
Uses facebook/wav2vec2-base as the base model for fine-tuning
Optimized Training
Trained for 30 epochs using a linear learning rate scheduler and Adam optimizer

Model Capabilities

Speech Recognition
Audio-to-Text

Use Cases

Speech Transcription
Meeting Minutes
Automatically convert meeting recordings into text transcripts
Word Error Rate 1.0
Voice Notes
Convert voice memos into searchable text
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase