W

Wav2vec2 Base 10k Voxpopuli Ft Sl

Developed by facebook
Based on Facebook's Wav2Vec2 base model, pretrained on a 10K unlabeled subset of the VoxPopuli corpus and fine-tuned on Slovenian transcription data for automatic speech recognition.
Downloads 26
Release Time : 3/2/2022

Model Overview

This model is an automatic speech recognition system optimized for Slovenian, capable of converting speech to text.

Model Features

Multilingual pretraining
Pretrained on the VoxPopuli multilingual corpus, enabling cross-language learning capabilities
Slovenian optimization
Specifically fine-tuned for Slovenian, improving recognition accuracy for this language
End-to-end model
Learns speech representations directly from raw audio, eliminating the need for manual feature extraction in traditional speech recognition pipelines

Model Capabilities

Speech recognition
Audio-to-text conversion
Slovenian language processing

Use Cases

Speech transcription
Automated meeting minutes
Automatically convert Slovenian meeting recordings into written transcripts
Voice assistant development
Provide speech recognition capabilities for Slovenian voice assistants
Accessibility technology
Real-time caption generation
Generate real-time captions for Slovenian video content
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase