W

Wav2vec2 2 Bart Base

Developed by patrickvonplaten
A speech recognition model fine-tuned on the LibriSpeech ASR clean dataset, based on wav2vec2-base and bart-base
Downloads 493
Release Time : 3/2/2022

Model Overview

This model combines the speech feature extraction capability of wav2vec2 with the sequence-to-sequence transformation ability of BART, focusing on English speech recognition tasks

Model Features

Hybrid Architecture
Combines speech feature extraction from wav2vec2 with sequence transformation capability from BART
Efficient Fine-tuning
Optimized on the LibriSpeech ASR clean dataset
Multi-GPU Training
Supports distributed training to improve efficiency

Model Capabilities

English speech recognition
Audio-to-text conversion
Sequence-to-sequence transformation

Use Cases

Speech Transcription
Meeting Minutes
Convert meeting recordings into text transcripts
Podcast Transcription
Convert podcast audio content into text
Assistive Technology
Real-time Caption Generation
Generate real-time captions for videos or live streams
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase