W

Wav2vec2 Large Uralic Voxpopuli V2

Developed by facebook
Wav2Vec2 large speech model pre-trained on 42.5 hours of unannotated Uralic language data from the VoxPopuli corpus
Downloads 46
Release Time : 3/2/2022

Model Overview

This is a large speech model based on Facebook's Wav2Vec2 architecture, specifically pre-trained for Uralic languages and suitable for speech recognition tasks.

Model Features

Specialized for Uralic languages
Specifically pre-trained for Uralic languages, suitable for speech recognition tasks in this language family
Based on VoxPopuli corpus
Pre-trained using 42.5 hours of Uralic language data from the VoxPopuli multilingual speech corpus
16kHz audio support
The model was pre-trained using speech audio with a 16kHz sampling rate, ensure input audio matches this sampling rate when using

Model Capabilities

Speech feature extraction
Speech representation learning

Use Cases

Speech technology
Uralic language speech recognition
Can be used to develop automatic speech recognition systems for Uralic languages
Requires fine-tuning on annotated data to achieve optimal performance
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase