L

Librispeech 100h Supervised

Developed by Kuray107
This model is a speech recognition model fine-tuned on the LibriSpeech 100-hour dataset based on facebook/wav2vec2-large-lv60, achieving a low word error rate.
Downloads 14
Release Time : 3/2/2022

Model Overview

This is a supervised learning model for English speech recognition, based on the wav2vec2 architecture and fine-tuned on the LibriSpeech 100-hour dataset.

Model Features

Low word error rate
Achieved a word error rate (WER) of 0.0345 on the evaluation set, demonstrating excellent performance.
Based on wav2vec2 architecture
Uses facebook/wav2vec2-large-lv60 as the base model, with powerful speech feature extraction capabilities.
Supervised learning fine-tuning
Fine-tuned with supervised learning on the LibriSpeech 100-hour dataset, optimizing speech recognition performance.

Model Capabilities

English speech recognition
Audio to text conversion

Use Cases

Speech transcription
Meeting minutes
Automatically transcribe meeting recordings into text records
Accuracy up to 96.55%
Subtitle generation
Automatically generate English subtitles for video content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase