Wav2vec2 Large Mt Voxpopuli V2
Facebook's Wav2Vec2 large model, pretrained exclusively on unlabeled data from the VoxPopuli corpus for Maltese (mt), suitable for speech recognition tasks.
Downloads 25
Release Time : 3/2/2022
Model Overview
This model is a large-scale speech model based on the Wav2Vec2 architecture, specifically pretrained for Maltese, primarily used for automatic speech recognition (ASR) tasks.
Model Features
Multilingual pretraining
The model is pretrained on the VoxPopuli corpus, supporting Maltese.
16kHz audio support
The model is pretrained on speech audio sampled at 16kHz; ensure input audio matches this sampling rate during use.
Unsupervised pretraining
The model uses unlabeled data for pretraining, making it suitable for speech recognition tasks in low-resource languages.
Model Capabilities
Speech recognition
Audio feature extraction
Use Cases
Speech recognition
Maltese speech-to-text
Convert Maltese speech input into text output.
Featured Recommended AI Models
Š 2025AIbase