Wav2vec2-large-mt-voxpopuli-v2 Open-source Speech Recognition Model - Customized for Maltese Speech Recognition

Wav2vec2 Large Mt Voxpopuli V2

Developed by facebook

Facebook's Wav2Vec2 large model, pretrained exclusively on unlabeled data from the VoxPopuli corpus for Maltese (mt), suitable for speech recognition tasks.

Speech Recognition

Transformers

Other#Multilingual speech recognition #Unsupervised pretraining #16kHz audio processing

Downloads 25

Release Time : 3/2/2022

Model Overview

This model is a large-scale speech model based on the Wav2Vec2 architecture, specifically pretrained for Maltese, primarily used for automatic speech recognition (ASR) tasks.

Model Features

Multilingual pretraining

The model is pretrained on the VoxPopuli corpus, supporting Maltese.

16kHz audio support

The model is pretrained on speech audio sampled at 16kHz; ensure input audio matches this sampling rate during use.

Unsupervised pretraining

The model uses unlabeled data for pretraining, making it suitable for speech recognition tasks in low-resource languages.

Model Capabilities

Speech recognition

Audio feature extraction

Use Cases

Speech recognition

Maltese speech-to-text

Convert Maltese speech input into text output.

Property	Details
Model Type	Wav2Vec2-large-VoxPopuli-V2
Training Data	9.1 hours of unlabeled data from the VoxPopuli corpus
Sampling Rate	16kHz

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 Large Mt Voxpopuli V2

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Wav2Vec2-large-VoxPopuli-V2

🚀 Quick Start

📚 Documentation

Model Information

Paper

Authors

Official Website

📄 License