Wav2vec2 Xls R Pt Cv7 From Bp400h
W
Wav2vec2 Xls R Pt Cv7 From Bp400h
Developed by lgris
This is a Portuguese automatic speech recognition (ASR) model based on the wav2vec2 XLS-R architecture, fine-tuned on the Common Voice 7 dataset, achieving a word error rate (WER) of 12.13% on the test set.
Downloads 94
Release Time : 3/2/2022
Model Overview
This model is specifically designed for Portuguese speech recognition tasks, based on Facebook's wav2vec2 XLS-R architecture and fine-tuned on the Mozilla Common Voice 7.0 dataset.
Model Features
High-performance Portuguese recognition
Achieves a word error rate (WER) of 12.13% and a character error rate (CER) of 3.68% on the Common Voice 7 Portuguese test set.
Based on powerful pre-trained model
Fine-tuned on the lgris/bp_400h_xlsr2_300M pre-trained model, with excellent speech feature extraction capabilities.
Multi-scenario evaluation
In addition to the Common Voice dataset, it has also been evaluated on robust speech competition datasets, demonstrating its performance in different scenarios.
Model Capabilities
Portuguese speech recognition
Automatic speech-to-text transcription
Handling various Portuguese accents
Use Cases
Speech-to-text
Voice memo transcription
Automatically convert Portuguese voice memos into searchable text
Accuracy rate of 87.87% (based on WER 12.13%)
Voice assistant
Provide speech recognition capabilities for Portuguese voice assistants
Accessibility technology
Real-time caption generation
Generate real-time captions for Portuguese video content
Featured Recommended AI Models
Š 2025AIbase