B

Bp500 Base100k Voxpopuli

Developed by lgris
Speech recognition model optimized for Brazilian Portuguese, trained with 453 hours of audio from 7 public datasets
Downloads 23
Release Time : 3/2/2022

Model Overview

This model is a Brazilian Portuguese automatic speech recognition (ASR) system based on the Wav2vec 2.0 architecture, fine-tuned using multiple public datasets. It supports both language-model-free and 4-gram language model-enhanced modes.

Model Features

Multi-dataset Training
Combines 7 Brazilian Portuguese datasets (CETUC/Common Voice/MLS, etc.) totaling 453 hours of training data
Language Model Support
Optional 4-gram language model enhancement reduces average WER from 0.155 to 0.157
Cross-domain Adaptability
Stable performance across different scenarios such as read speech (CETUC) and spontaneous speech (TEDx)

Model Capabilities

Brazilian Portuguese speech-to-text conversion
Supports 16kHz sample rate audio processing
Batch speech recognition

Use Cases

Speech Transcription
Educational Content Transcription
Convert Portuguese teaching audio into text transcripts
Achieves WER as low as 0.052 on read speech datasets
Automated Meeting Minutes
Real-time transcription of Brazilian Portuguese meetings
WER around 0.317 on spontaneous speech datasets
Voice Assistants
Brazilian Portuguese Voice Command Recognition
Provides voice interaction support for localized smart devices
Excellent performance on short command datasets
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase