wav2vec2-large-xls-r-300m-slovenian Open-source Speech Recognition Model

Wav2vec2 Large Xls R 300m Slovenian

Developed by bekirbakar

This model is a speech recognition model fine-tuned on the Common Voice Slovenian dataset based on facebook/wav2vec2-xls-r-300m, with a word error rate of 0.3271.

Speech Recognition

Transformers

Open Source License:Apache-2.0 #Slovenian speech recognition #High-accuracy speech-to-text #Multilingual pre-training fine-tuning

Downloads 278

Release Time : 6/6/2022

Model Overview

A speech recognition model optimized for Slovenian, fine-tuned on the wav2vec2-xls-r-300m architecture, suitable for speech-to-text tasks.

Model Features

High-performance speech recognition

Achieved a word error rate of 0.3271 on the Common Voice Slovenian dataset

Fine-tuned on a large model

Fine-tuned on the 300-million-parameter wav2vec2-xls-r-300m model, inheriting the powerful feature extraction capabilities of the original model

Optimized training process

Used linear learning rate scheduling with 500 warmup steps, trained for 20 epochs to achieve optimal results

Model Capabilities

Slovenian speech recognition

Audio-to-text conversion

Speech content analysis

Use Cases

Speech transcription

Automated meeting minutes

Automatically convert Slovenian meeting recordings into text transcripts

Accuracy approximately 67.29%

Voice assistant

Provide voice interaction support for Slovenian-speaking users

Educational technology

Language learning applications

Help learners practice Slovenian pronunciation and listening

Training Loss	Epoch	Step	Validation Loss	Wer
4.3681	4.93	400	0.7067	0.6486
0.2311	9.87	800	0.5155	0.4341
0.0833	14.81	1200	0.4996	0.3799
0.0455	19.75	1600	0.4462	0.3271

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 Large Xls R 300m Slovenian

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 wav2vec2-large-xls-r-300m-slovenian

🚀 Quick Start

📚 Documentation

Training procedure

Training Hyper - parameters

Training Results

Framework Versions

📄 License