B

Bp Tedx100 Xlsr

Developed by lgris
Brazilian Portuguese Wav2vec 2.0 speech recognition model fine-tuned on TEDx Portuguese dataset
Downloads 23
Release Time : 3/2/2022

Model Overview

This model uses the Wav2vec 2.0 architecture, fine-tuned on the TEDx Portuguese multilingual dataset, specifically designed for automatic speech recognition tasks in Brazilian Portuguese.

Model Features

Multi-dataset training
The model was evaluated on multiple Portuguese speech datasets, including CETUC, Common Voice, etc.
Language model support
Can be combined with a 4-gram language model to further improve recognition accuracy
High performance
Excellent performance on multiple test sets with an average word error rate (WER) of 0.321

Model Capabilities

Brazilian Portuguese speech recognition
audio-to-text conversion
supports multiple audio format processing

Use Cases

Speech transcription
Lecture transcription
Automatically convert TEDx Portuguese lecture content into text
Word error rate 0.222
Business speech transcription
Transcribe business meeting recordings into text
Word error rate 0.169 on LaPS BM dataset
Speech analysis
Speech content analysis
Perform text analysis on Portuguese speech content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase