W

Wav2vec2 Librispeech Clean 100h Demo Dist

Developed by patrickvonplaten
A speech recognition model fine-tuned on the LIBRISPEECH_ASR-CLEAN dataset based on facebook/wav2vec2-large-lv60
Downloads 15
Release Time : 3/2/2022

Model Overview

This model is a speech recognition model specifically optimized for the LIBRISPEECH_ASR-CLEAN dataset, capable of converting speech to text.

Model Features

Efficient Fine-tuning
Efficiently fine-tuned on the LIBRISPEECH_ASR-CLEAN dataset based on the facebook/wav2vec2-large-lv60 model.
Low Word Error Rate
Achieves a word error rate (WER) of 0.0417 on the evaluation set, demonstrating excellent performance.
Distributed Training
Supports multi-GPU distributed training, improving training efficiency.

Model Capabilities

Speech Recognition
English Speech to Text

Use Cases

Speech Transcription
Meeting Minutes
Automatically convert meeting recordings into text transcripts
High accuracy with a word error rate of only 0.0417
Voice Assistant
Used as the speech recognition module for voice assistants
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase