W

Wav2vec2 Large Lv60 Timit Asr

Developed by elgeish
A speech recognition model fine-tuned on the timit_asr dataset based on facebook/wav2vec2-large-lv60
Downloads 13
Release Time : 3/2/2022

Model Overview

This is a model for automatic speech recognition (ASR), specifically optimized for English speech recognition tasks.

Model Features

High-precision speech recognition
Achieves a 13.5% word error rate (WER) on the TIMIT dataset
No language model required
Can be used directly without additional language model support
16kHz sampling rate support
Optimized for speech input with a 16kHz sampling rate

Model Capabilities

English speech-to-text
Continuous speech recognition
Speaker-independent recognition

Use Cases

Speech transcription
Voice note transcription
Automatically convert English voice notes into text
Approximately 86.5% accuracy
Meeting minutes
Automatically generate text transcripts of meeting recordings
Voice interface
Voice command recognition
Recognize user voice commands
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase