wav2vec2-large-romance-voxpopuli-v2 Open-Source Speech Recognition Model - Specially Designed for the Romance Languages

Wav2vec2 Large Romance Voxpopuli V2

Developed by facebook

Facebook's Wav2Vec2 large model, pretrained only on 101.5 hours of unlabeled data from the Romance language VoxPopuli corpus, suitable for speech recognition tasks.

Speech Recognition

Transformers

#Romance language speech recognition #Unsupervised pretraining #16kHz audio processing

Downloads 26

Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition model pretrained on 16kHz sampled speech audio, requiring fine-tuning with a tokenizer and labeled data for use.

Model Features

Multilingual Support

Focuses on speech recognition for Romance languages, supporting multiple related languages.

Efficient Pretraining

Pretrained using only 101.5 hours of unlabeled data, achieving high data efficiency.

16kHz Audio Support

Optimized for 16kHz sampled speech audio to ensure recognition quality.

Model Capabilities

Speech feature extraction

Automatic speech recognition

Use Cases

Speech Technology

Multilingual Speech Recognition System

Build a speech recognition system supporting Romance languages

Requires fine-tuning with labeled data for use

Speech Data Analysis

Used for feature extraction and analysis of Romance language speech data

Property	Details
Model Type	Wav2Vec2-large-VoxPopuli-V2
Training Data	101.5 unlabeled data of the VoxPopuli corpus in romance
Sampling Rate	16kHz

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 Large Romance Voxpopuli V2

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Wav2Vec2-large-VoxPopuli-V2

🚀 Quick Start

✨ Features

📚 Documentation

Model Information

Important Notes

Paper Reference

More Information

📄 License