Wav2vec2 Large Slavic Parlaspeech Hr
This is an automatic speech recognition system for Croatian, based on a Slavic language pre-trained model, specifically optimized for parliamentary speech scenarios
Downloads 5,768
Release Time : 4/28/2022
Model Overview
The model is fine-tuned from the facebook/wav2vec2-large-slavic-voxpopuli-v2 pre-trained model using 300 hours of Croatian parliamentary speech data from ParlaSpeech-HR v1.0, specifically designed for speech recognition tasks in Croatian parliamentary settings
Model Features
Slavic language pre-training
Fine-tuned from a Slavic language pre-trained model, providing better adaptability for Croatian
Parliamentary speech optimization
Specifically optimized for the acoustic characteristics of Croatian parliamentary scenarios
High performance metrics
Achieves 2.22% character error rate and 6.79% word error rate on test sets
Model Capabilities
Croatian speech recognition
Parliamentary speech transcription
Long audio processing
Use Cases
Government agencies
Parliament meeting minutes
Automatically transcribe Croatian parliamentary meeting content
Efficiently generates meeting transcripts with over 93% accuracy
Academic research
Political discourse analysis
Provide political scientists with textual data of parliamentary speeches
Supports large-scale political discourse analysis research
Featured Recommended AI Models