W

Wav2vec2 Large Xlsr 53 Portuguese

Developed by facebook
A large-scale Portuguese automatic speech recognition (ASR) model developed by Facebook based on the Wav2Vec 2.0 architecture, supporting Portuguese speech-to-text tasks.
Downloads 425
Release Time : 3/2/2022

Model Overview

This model is a large-scale cross-lingual speech representation model (XLSR) trained on the Wav2Vec 2.0 architecture, specifically optimized for Portuguese, capable of accurately converting Portuguese speech into text.

Model Features

Cross-lingual Speech Representation
Based on the XLSR-53 architecture, capable of learning universal speech feature representations across languages.
Portuguese Optimization
Specifically optimized and trained for Portuguese speech characteristics.
End-to-End Recognition
Directly generates text output from raw audio input without intermediate feature extraction steps.

Model Capabilities

Portuguese speech recognition
Speech-to-text
Automatic speech transcription

Use Cases

Speech Transcription
Portuguese Speech-to-Text
Automatically converts Portuguese speech content into editable text format
Achieves a WER of 27.1% on the Common Voice Portuguese test set
Voice Assistants
Portuguese Voice Command Recognition
Used for building Portuguese voice assistants and voice control applications
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase