W

Wav2vec2 2 Bart Large No Adapter

Developed by sanchit-gandhi
This model is an automatic speech recognition (ASR) model trained on the LibriSpeech ASR dataset, capable of converting English speech into text.
Downloads 22
Release Time : 3/14/2022

Model Overview

This is a speech recognition model trained from scratch, specifically designed for English speech-to-text tasks. The model achieved a word error rate (WER) of 1.0267 on the LibriSpeech evaluation set.

Model Features

Low Word Error Rate
Achieved a word error rate (WER) of 1.0267 on the LibriSpeech evaluation set, demonstrating excellent performance
End-to-End Training
The model is trained from scratch without relying on pre-trained weights
Optimized Training Configuration
Uses the Adam optimizer and linear learning rate scheduler, combined with gradient accumulation for efficient training

Model Capabilities

English speech recognition
Speech-to-text
Continuous speech recognition

Use Cases

Speech Transcription
Audiobook Transcription
Automatically transcribe English audiobooks into text
Highly accurate transcription results
Meeting Minutes
Automatically record English meeting content and generate text transcripts
Assistive Technology
Real-time Caption Generation
Generate real-time captions for English videos or live streams
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase