wav2vec2-base-hr-voxpopuli-v2 Open-source Speech Model - Achieve Precise Speech Processing in Croatian

Wav2vec2 Base Hr Voxpopuli V2

Developed by facebook

Speech model based on Facebook's Wav2Vec2 architecture, pre-trained on the Croatian VoxPopuli corpus

Downloads 30

Release Time : 3/2/2022

Model Overview

This is a speech model based on the Wav2Vec2 architecture, specifically pre-trained for Croatian language, suitable for speech recognition tasks.

Croatian Language Optimization

Specifically pre-trained using the Croatian VoxPopuli corpus

16kHz Audio Support

The model is pre-trained on 16kHz sampled speech audio, requiring matching sampling rate during usage

Lightweight Pre-training

Pre-trained using only 8.1k unlabeled data samples

Speech feature extraction

Croatian speech recognition

Speech Technology

Croatian Speech Recognition System

Can be used to build Croatian speech-to-text applications

Additional fine-tuning and tokenizer required for optimal performance

Property	Details
Model Type	Wav2Vec2-base-VoxPopuli-V2
Training Data	8.1k unlabeled data from the VoxPopuli corpus
Sampling Rate	16kHz

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base