W

Wav2vec2 2 Bart Large

Developed by patrickvonplaten
This model is an automatic speech recognition (ASR) model fine-tuned on the librispeech_asr-clean dataset, based on wav2vec2-large-lv60 and bart-large
Downloads 31
Release Time : 3/2/2022

Model Overview

A speech recognition model combining wav2vec2 and bart architectures, optimized for English speech-to-text tasks

Model Features

Hybrid Architecture
Combines wav2vec2's speech feature extraction capability with bart's sequence generation ability
High Accuracy
Achieved a word error rate (WER) of 4.86% on the LibriSpeech evaluation set
Multi-GPU Training
Supports distributed training to accelerate the model training process

Model Capabilities

English Speech Recognition
Audio-to-Text Conversion
Large-scale Speech Data Processing

Use Cases

Speech Transcription
Audiobook Transcription
Convert English audiobook content into text
Highly accurate transcription results
Meeting Minutes
Automatically record English meeting content
Voice Assistant
Voice Command Recognition
Recognize and understand English voice commands
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase