wav2vec2-base-lv-voxpopuli-v2 Open-Source Speech Recognition Model - Tailored for Precise Latvian Language Recognition

Wav2vec2 Base Lv Voxpopuli V2

Developed by facebook

A foundational speech recognition model based on Facebook's Wav2Vec2 architecture, specifically pretrained for Latvian (lv) using 13.1k hours of unlabeled data from the VoxPopuli corpus.

Speech Recognition

Transformers

Other#Latvian speech recognition #Unsupervised pretraining #16kHz audio processing

Downloads 29

Release Time : 3/2/2022

Model Overview

This model is a foundational speech recognition model that learns speech representations from raw audio through self-supervised learning, suitable for Latvian speech processing tasks.

Model Features

Latvian-specific

Specifically pretrained for Latvian, optimizing speech feature extraction capabilities for this language

Self-supervised learning

Uses 13.1k hours of unlabeled data for self-supervised pretraining, eliminating the need for large amounts of labeled data

16kHz audio support

Optimized for 16kHz sampled speech audio; ensure input audio sampling rate matches when using

Model Capabilities

Speech representation learning

Speech feature extraction

Foundational speech recognition model

Use Cases

Speech technology

Latvian speech recognition system

Can serve as a foundational model for further fine-tuning and development of Latvian speech recognition systems

Speech data analysis

Used for feature extraction and analysis of Latvian speech data

Property	Details
Language	lv
Tags	audio, automatic-speech-recognition, voxpopuli-v2
Datasets	voxpopuli
Inference	false

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Wav2vec2 Base Lv Voxpopuli V2

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Wav2Vec2-base-VoxPopuli-V2

🚀 Quick Start

📚 Documentation

Model Details

Paper

Authors

More Information

📄 License

📦 Metadata