Wav2vec2 Large Xlsr 53 Portuguese
A large-scale Portuguese automatic speech recognition (ASR) model developed by Facebook based on the Wav2Vec 2.0 architecture, supporting Portuguese speech-to-text tasks.
Downloads 425
Release Time : 3/2/2022
Model Overview
This model is a large-scale cross-lingual speech representation model (XLSR) trained on the Wav2Vec 2.0 architecture, specifically optimized for Portuguese, capable of accurately converting Portuguese speech into text.
Model Features
Cross-lingual Speech Representation
Based on the XLSR-53 architecture, capable of learning universal speech feature representations across languages.
Portuguese Optimization
Specifically optimized and trained for Portuguese speech characteristics.
End-to-End Recognition
Directly generates text output from raw audio input without intermediate feature extraction steps.
Model Capabilities
Portuguese speech recognition
Speech-to-text
Automatic speech transcription
Use Cases
Speech Transcription
Portuguese Speech-to-Text
Automatically converts Portuguese speech content into editable text format
Achieves a WER of 27.1% on the Common Voice Portuguese test set
Voice Assistants
Portuguese Voice Command Recognition
Used for building Portuguese voice assistants and voice control applications
Featured Recommended AI Models
Š 2025AIbase