Open-source wav2vec2-2-bart-large-no-adapter model - Accurately convert English speech to text for free

Wav2vec2 2 Bart Large No Adapter

Developed by sanchit-gandhi

This model is an automatic speech recognition (ASR) model trained on the LibriSpeech ASR dataset, capable of converting English speech into text.

Speech Recognition

Transformers

#High-precision speech transcription #Low word error rate #English speech recognition

Downloads 22

Release Time : 3/14/2022

Model Overview

This is a speech recognition model trained from scratch, specifically designed for English speech-to-text tasks. The model achieved a word error rate (WER) of 1.0267 on the LibriSpeech evaluation set.

Model Features

Low Word Error Rate

Achieved a word error rate (WER) of 1.0267 on the LibriSpeech evaluation set, demonstrating excellent performance

End-to-End Training

The model is trained from scratch without relying on pre-trained weights

Optimized Training Configuration

Uses the Adam optimizer and linear learning rate scheduler, combined with gradient accumulation for efficient training

Model Capabilities

English speech recognition

Speech-to-text

Continuous speech recognition

Use Cases

Speech Transcription

Audiobook Transcription

Automatically transcribe English audiobooks into text

Highly accurate transcription results

Meeting Minutes

Automatically record English meeting content and generate text transcripts

Assistive Technology

Real-time Caption Generation

Generate real-time captions for English videos or live streams

Training Loss	Epoch	Step	Validation Loss	Wer
6.7189	0.56	500	6.9796	0.9350
6.5068	1.12	1000	6.4823	1.3923
6.4601	1.68	1500	6.1801	1.1578
6.1802	2.24	2000	6.0002	1.7750
6.0888	2.8	2500	5.8453	1.7581
6.0993	3.36	3000	5.7702	1.4096
6.0851	3.92	3500	5.6634	1.0944
5.9357	4.48	4000	5.6120	1.0267

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 2 Bart Large No Adapter

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Speech Recognition Model

🚀 Quick Start

📚 Documentation

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions