Wav2vec2 Large Uralic Voxpopuli V2
Wav2Vec2 large speech model pre-trained on 42.5 hours of unannotated Uralic language data from the VoxPopuli corpus
Downloads 46
Release Time : 3/2/2022
Model Overview
This is a large speech model based on Facebook's Wav2Vec2 architecture, specifically pre-trained for Uralic languages and suitable for speech recognition tasks.
Model Features
Specialized for Uralic languages
Specifically pre-trained for Uralic languages, suitable for speech recognition tasks in this language family
Based on VoxPopuli corpus
Pre-trained using 42.5 hours of Uralic language data from the VoxPopuli multilingual speech corpus
16kHz audio support
The model was pre-trained using speech audio with a 16kHz sampling rate, ensure input audio matches this sampling rate when using
Model Capabilities
Speech feature extraction
Speech representation learning
Use Cases
Speech technology
Uralic language speech recognition
Can be used to develop automatic speech recognition systems for Uralic languages
Requires fine-tuning on annotated data to achieve optimal performance
Featured Recommended AI Models
Š 2025AIbase