W

Wav2vec2 2 Bert Large No Adapter Frozen Enc

Developed by speech-seq2seq
This model is a speech recognition model trained on the librispeech_asr dataset, achieving a word error rate (WER) of 2.0133 on the evaluation set.
Downloads 25
Release Time : 3/2/2022

Model Overview

This is an automatic speech recognition (ASR) model capable of converting speech to text. The model is trained on the librispeech_asr dataset and is suitable for English speech recognition tasks.

Model Features

Low word error rate
Achieved a word error rate (WER) of 2.0133 on the evaluation set, demonstrating good performance.
Trained on LibriSpeech
Trained using the standard librispeech_asr dataset, providing a reliable training foundation.
Optimized training configuration
Incorporates optimization techniques such as gradient accumulation, learning rate warm-up, and mixed-precision training.

Model Capabilities

English speech recognition
Speech-to-text conversion

Use Cases

Speech transcription
Audio transcription
Convert English speech content into text
Word error rate 2.0133
Assistive tools
Subtitle generation
Automatically generate subtitles for English video content
null
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase