W

Wav2vec2 Large Xlsr Open Brazilian Portuguese V2

Developed by lgris
This is a Wav2vec2 model optimized for Brazilian Portuguese, trained on multiple open datasets for automatic speech recognition tasks.
Downloads 1,825
Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition (ASR) model based on the Wav2vec2 architecture, specifically fine-tuned for Brazilian Portuguese. It integrates multiple publicly available Brazilian Portuguese speech datasets and can convert Portuguese speech into text.

Model Features

Multi-dataset training
Integrates multiple Brazilian Portuguese datasets including CETUC, MLS, VoxForge, Common Voice, and Lapsbm, improving the model's generalization capability.
High performance
Achieves a word error rate (WER) of 10.69% on the Common Voice test set.
Open license
Released under the Apache 2.0 license, allowing for commercial and research use.

Model Capabilities

Brazilian Portuguese speech recognition
Speech-to-text
Supports multiple audio sampling rates

Use Cases

Speech transcription
Meeting minutes
Automatically transcribe Brazilian Portuguese meeting recordings into text records.
Performs well in formal speech scenarios.
Subtitle generation
Automatically generate subtitles for Brazilian Portuguese video content.
High accuracy on clear speech.
Voice assistants
Portuguese voice command recognition
Used as a foundational speech recognition component for Brazilian Portuguese voice assistants.
Suitable for command and control scenarios.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase