W

Wav2vec2 Large Xls R 300m Bulgarian

Developed by infinitejoy
A Bulgarian speech recognition model fine-tuned on the MOZILLA-FOUNDATION/COMMON_VOICE_7_0 - BG dataset based on facebook/wav2vec2-xls-r-300m
Downloads 10.59k
Release Time : 3/2/2022

Model Overview

This is a model for Bulgarian Automatic Speech Recognition (ASR), based on the XLS-R architecture and fine-tuned on the Bulgarian dataset from Common Voice 7.0.

Model Features

Multilingual Pretraining
Fine-tuned based on the XLS-R-300M multilingual model with strong speech representation capabilities
Bulgarian Language Optimization
Specifically fine-tuned for Bulgarian to adapt to its unique linguistic features
Medium Scale
300M parameter size, balancing performance and resource consumption

Model Capabilities

Bulgarian Speech Recognition
Speech-to-Text Conversion
Conversation Transcription

Use Cases

Speech Transcription
Voice Memo Transcription
Convert Bulgarian voice memos into text
WER 46.68% on Common Voice 7 test set
Customer Service Dialogue Recording
Automatically transcribe Bulgarian customer service conversations
WER 64.08% on Robust Speech Event test data
Assistive Technology
Voice Control Applications
Provide voice control interfaces for Bulgarian users
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase