Wav2vec2 Base Bg Voxpopuli V2
A speech model based on Facebook's Wav2Vec2 architecture, specifically pretrained for Bulgarian language, suitable for speech recognition tasks.
Downloads 30
Release Time : 3/2/2022
Model Overview
This model is the base version of Wav2Vec2, pretrained on 17.6k hours of unlabeled Bulgarian data from the VoxPopuli corpus, suitable for speech recognition tasks.
Model Features
Bulgarian Language Specialized
Specifically pretrained for Bulgarian language, optimizing speech recognition performance for this language.
Based on VoxPopuli Corpus
Trained using the large-scale multilingual VoxPopuli speech corpus, ensuring high data quality.
16kHz Sampling Rate
The model is pretrained on 16kHz sampled speech audio; ensure input audio matches this sampling rate.
Model Capabilities
Speech recognition
Audio feature extraction
Use Cases
Speech Recognition
Bulgarian Speech-to-Text
Convert Bulgarian speech into text
Featured Recommended AI Models
Š 2025AIbase