B

Base 10k 8khz Pt

Developed by lgris
A Portuguese automatic speech recognition model fine-tuned from facebook/wav2vec2-base-10k-voxpopuli, supporting 8kHz sampling rate
Downloads 28
Release Time : 3/2/2022

Model Overview

This is an optimized automatic speech recognition (ASR) model for Portuguese, based on the Wav2vec 2.0 architecture, fine-tuned using multiple Portuguese speech datasets.

Model Features

Multi-dataset Fine-tuning
Fine-tuned using multiple Portuguese speech datasets including CETUC, Common Voice, and Lapsbm to improve recognition accuracy
8kHz Sampling Rate Support
Optimized to support 8kHz sampling rate audio input, suitable for more real-world application scenarios
Brazilian Portuguese Optimization
Specifically optimized for Brazilian Portuguese variants, delivering better recognition performance

Model Capabilities

Portuguese speech recognition
Audio-to-text conversion
Supports 8kHz sampling rate input

Use Cases

Speech Transcription
Automatic Meeting Transcription
Automatically convert Portuguese meeting recordings into text transcripts
Voice Note Conversion
Convert Portuguese voice notes into editable text
Accessibility Applications
Real-time Caption Generation
Generate real-time captions for Portuguese video content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase