W

Wav2vec2 Large 960h Lv60

Developed by facebook
Wav2Vec2 is a powerful speech recognition model that extracts features from raw audio through self-supervised learning and achieves high-performance speech recognition with limited labeled data.
Downloads 7,011
Release Time : 3/2/2022

Model Overview

This model is pre-trained and fine-tuned on 960 hours of Libri-Light and Librispeech speech data, specifically designed for English automatic speech recognition tasks, supporting 16kHz sample rate audio input.

Model Features

Self-supervised Learning
Learns representations from raw audio, reducing reliance on large amounts of labeled data.
High Performance
Achieves a 2.2% WER on the Librispeech clean test set, demonstrating excellent performance.
Data Efficiency
Requires only a small amount of labeled data for fine-tuning to achieve high performance, suitable for resource-limited scenarios.

Model Capabilities

English speech recognition
16kHz audio processing
High-accuracy transcription

Use Cases

Speech Transcription
Meeting Minutes
Automatically transcribe meeting recordings into text records
Highly accurate transcribed text
Subtitle Generation
Generate English subtitles for video content
Fast and accurate automatic subtitles
Voice Assistants
Voice Command Recognition
Recognize and understand user voice commands
High-precision command recognition
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase