X

Xlsr En Punctuation

Developed by boris
Fine-tuned automatic speech recognition model based on facebook/wav2vec2-large-xlsr-53 on the English Common Voice dataset, supporting punctuation prediction
Downloads 30.28k
Release Time : 3/2/2022

Model Overview

This is a Wav2Vec2 model for English automatic speech recognition (ASR) that can convert speech to text and automatically add punctuation.

Model Features

Multilingual pretraining
Fine-tuned from the XLSR-53 multilingual model with strong cross-lingual representation capabilities
Punctuation prediction
Not only recognizes speech content but also automatically predicts and adds punctuation
High accuracy
Achieves 1.0% word error rate (WER) on the Common Voice English test set

Model Capabilities

English speech recognition
Automatic punctuation prediction
16kHz audio processing

Use Cases

Speech transcription
Automatic meeting minutes generation
Automatically converts meeting recordings into punctuated transcripts
High accuracy reduces manual proofreading workload
Podcast subtitle generation
Automatically generates punctuated subtitle files for English podcasts
Supports output in common subtitle formats like SRT
Assistive technology
Voice input system
Provides high-accuracy voice input solutions for people with disabilities
Improves input efficiency and accuracy
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase