Wav2vec2 Base Hr Voxpopuli V2
Speech model based on Facebook's Wav2Vec2 architecture, pre-trained on the Croatian VoxPopuli corpus
Downloads 30
Release Time : 3/2/2022
Model Overview
This is a speech model based on the Wav2Vec2 architecture, specifically pre-trained for Croatian language, suitable for speech recognition tasks.
Model Features
Croatian Language Optimization
Specifically pre-trained using the Croatian VoxPopuli corpus
16kHz Audio Support
The model is pre-trained on 16kHz sampled speech audio, requiring matching sampling rate during usage
Lightweight Pre-training
Pre-trained using only 8.1k unlabeled data samples
Model Capabilities
Speech feature extraction
Croatian speech recognition
Use Cases
Speech Technology
Croatian Speech Recognition System
Can be used to build Croatian speech-to-text applications
Additional fine-tuning and tokenizer required for optimal performance
Featured Recommended AI Models
Š 2025AIbase