wav2vec2-base-bg-voxpopuli-v2 Open-source Speech Model - Free Deployment to Facilitate Bulgarian Speech Recognition

Wav2vec2 Base Bg Voxpopuli V2

Developed by facebook

A speech model based on Facebook's Wav2Vec2 architecture, specifically pretrained for Bulgarian language, suitable for speech recognition tasks.

Speech Recognition

Transformers

Other#Bulgarian speech recognition #Unsupervised pretraining #16kHz audio processing

Downloads 30

Release Time : 3/2/2022

Model Overview

This model is the base version of Wav2Vec2, pretrained on 17.6k hours of unlabeled Bulgarian data from the VoxPopuli corpus, suitable for speech recognition tasks.

Model Features

Bulgarian Language Specialized

Specifically pretrained for Bulgarian language, optimizing speech recognition performance for this language.

Based on VoxPopuli Corpus

Trained using the large-scale multilingual VoxPopuli speech corpus, ensuring high data quality.

16kHz Sampling Rate

The model is pretrained on 16kHz sampled speech audio; ensure input audio matches this sampling rate.

Model Capabilities

Speech recognition

Audio feature extraction

Use Cases

Speech Recognition

Bulgarian Speech-to-Text

Convert Bulgarian speech into text

Property	Details
Model Type	Wav2Vec2-base-VoxPopuli-V2
Training Data	17.6k unlabeled data from the VoxPopuli corpus
Sampling Rate	16kHz

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 Base Bg Voxpopuli V2

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Wav2Vec2-base-VoxPopuli-V2

🚀 Quick Start

📚 Documentation

Model Information

Paper Reference

More Information

📄 License