Wav2vec2-large-uralic-voxpopuli-v2 Open-source Speech Model - Supports Uralic Language Family Speech Processing

Home

Wav2vec2 Large Uralic Voxpopuli V2

Developed by facebook

Wav2Vec2 large speech model pre-trained on 42.5 hours of unannotated Uralic language data from the VoxPopuli corpus

Speech Recognition

Transformers

#Uralic language speech recognition #Unsupervised pre-training #16kHz audio processing

Downloads 46

Release Time : 3/2/2022

Model Overview

This is a large speech model based on Facebook's Wav2Vec2 architecture, specifically pre-trained for Uralic languages and suitable for speech recognition tasks.

Model Features

Specialized for Uralic languages

Specifically pre-trained for Uralic languages, suitable for speech recognition tasks in this language family

Based on VoxPopuli corpus

Pre-trained using 42.5 hours of Uralic language data from the VoxPopuli multilingual speech corpus

16kHz audio support

The model was pre-trained using speech audio with a 16kHz sampling rate, ensure input audio matches this sampling rate when using

Model Capabilities

Speech feature extraction

Speech representation learning

Use Cases

Speech technology

Uralic language speech recognition

Can be used to develop automatic speech recognition systems for Uralic languages

Requires fine-tuning on annotated data to achieve optimal performance

Property	Details
Tags	audio, automatic - speech - recognition, voxpopuli - v2
Datasets	voxpopuli
License	cc - by - nc - 4.0
Inference	false
Model Type	Wav2Vec2 large model pretrained in Uralic
Training Data	42.5 hours of unlabeled data from the VoxPopuli corpus

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 Large Uralic Voxpopuli V2

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Wav2Vec2-large-VoxPopuli-V2

🚀 Quick Start

📚 Documentation

Paper

Authors

More Information

📄 License

📦 Information Table