Wav2vec2-2-Bart-large Open-source Automatic Speech Recognition Model - Free Deployment for Accurate Speech-to-Text Conversion

Wav2vec2 2 Bart Large

Developed by patrickvonplaten

This model is an automatic speech recognition (ASR) model fine-tuned on the librispeech_asr-clean dataset, based on wav2vec2-large-lv60 and bart-large

Speech Recognition

Transformers

#Speech-to-Text #High Accuracy Recognition #Multimodal Fusion

Downloads 31

Release Time : 3/2/2022

Model Overview

A speech recognition model combining wav2vec2 and bart architectures, optimized for English speech-to-text tasks

Model Features

Hybrid Architecture

Combines wav2vec2's speech feature extraction capability with bart's sequence generation ability

High Accuracy

Achieved a word error rate (WER) of 4.86% on the LibriSpeech evaluation set

Multi-GPU Training

Supports distributed training to accelerate the model training process

Model Capabilities

English Speech Recognition

Audio-to-Text Conversion

Large-scale Speech Data Processing

Use Cases

Speech Transcription

Audiobook Transcription

Convert English audiobook content into text

Highly accurate transcription results

Meeting Minutes

Automatically record English meeting content

Voice Assistant

Voice Command Recognition

Recognize and understand English voice commands

Property	Details
Model Name	wav2vec2-2-bart-large
Base Models	facebook/wav2vec2-large-lv60, bart-large
Fine - tuned Dataset	librispeech_asr - clean
Evaluation Loss	0.3204
Evaluation Wer	0.0486

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 2 Bart Large

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 wav2vec2-2-bart-large

🚀 Quick Start

✨ Features

🔧 Technical Details

Training hyperparameters

Training results

Framework versions

Model Information

Interactive Widgets