W

Wav2vec2 Large Mt Voxpopuli V2

Developed by facebook
Facebook's Wav2Vec2 large model, pretrained exclusively on unlabeled data from the VoxPopuli corpus for Maltese (mt), suitable for speech recognition tasks.
Downloads 25
Release Time : 3/2/2022

Model Overview

This model is a large-scale speech model based on the Wav2Vec2 architecture, specifically pretrained for Maltese, primarily used for automatic speech recognition (ASR) tasks.

Model Features

Multilingual pretraining
The model is pretrained on the VoxPopuli corpus, supporting Maltese.
16kHz audio support
The model is pretrained on speech audio sampled at 16kHz; ensure input audio matches this sampling rate during use.
Unsupervised pretraining
The model uses unlabeled data for pretraining, making it suitable for speech recognition tasks in low-resource languages.

Model Capabilities

Speech recognition
Audio feature extraction

Use Cases

Speech recognition
Maltese speech-to-text
Convert Maltese speech input into text output.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase