W

Wav2vec2 Large 100k Voxpopuli Ft Common Voice Plus TTS Dataset Portuguese

Developed by Edresson
This is an automatic speech recognition model based on Facebook's Wav2vec2 Large 100k Voxpopuli, fine-tuned on Portuguese using Common Voice 7.0 and TTS-Portuguese corpus.
Downloads 20
Release Time : 3/2/2022

Model Overview

This model is primarily used for automatic speech recognition tasks in Portuguese, capable of converting Portuguese speech into text.

Model Features

Portuguese Optimization
Specifically fine-tuned for Portuguese speech, improving recognition accuracy.
Multi-dataset Training
Trained with both Common Voice and TTS-Portuguese corpus, enhancing the model's generalization capability.
High Performance
Achieves a word error rate of 20.39% on the Common Voice 7.0 test set.

Model Capabilities

Portuguese speech recognition
Audio to text conversion
Automatic speech recognition

Use Cases

Speech transcription
Portuguese speech to text
Automatically convert Portuguese speech content into text format
Word error rate 20.39%
Voice assistants
Portuguese voice command recognition
Used for developing Portuguese voice assistants and control systems
Featured Recommended AI Models
ยฉ 2025AIbase