Wav2vec2 Large Romance Voxpopuli V2
Facebook's Wav2Vec2 large model, pretrained only on 101.5 hours of unlabeled data from the Romance language VoxPopuli corpus, suitable for speech recognition tasks.
Downloads 26
Release Time : 3/2/2022
Model Overview
This model is an automatic speech recognition model pretrained on 16kHz sampled speech audio, requiring fine-tuning with a tokenizer and labeled data for use.
Model Features
Multilingual Support
Focuses on speech recognition for Romance languages, supporting multiple related languages.
Efficient Pretraining
Pretrained using only 101.5 hours of unlabeled data, achieving high data efficiency.
16kHz Audio Support
Optimized for 16kHz sampled speech audio to ensure recognition quality.
Model Capabilities
Speech feature extraction
Automatic speech recognition
Use Cases
Speech Technology
Multilingual Speech Recognition System
Build a speech recognition system supporting Romance languages
Requires fine-tuning with labeled data for use
Speech Data Analysis
Used for feature extraction and analysis of Romance language speech data
Featured Recommended AI Models
Š 2025AIbase