W

Wav2vec2 Large Es Voxpopuli

Developed by facebook
Large-scale speech pre-training model trained on the Spanish subset of the VoxPopuli corpus, suitable for Spanish speech recognition tasks
Downloads 117.04k
Release Time : 3/2/2022

Model Overview

This model is a Spanish speech recognition model developed by Facebook based on the Wav2Vec2 architecture, pre-trained using unannotated Spanish data from the VoxPopuli corpus, and can be used for Spanish speech-to-text tasks.

Model Features

Large-scale pre-training
Pre-trained on the Spanish subset of the VoxPopuli corpus, with a large data scale
Unsupervised learning
Uses unannotated speech data for pre-training, reducing reliance on labeled data
Multilingual support
Although focused on Spanish, it can be extended to support other languages based on the Wav2Vec2 architecture
Easy to fine-tune
Provides fine-tuning guidelines for optimization on specific Spanish speech recognition tasks

Model Capabilities

Spanish speech recognition
Speech feature extraction
Speech-to-text

Use Cases

Speech transcription
Spanish meeting minutes
Automatically transcribe Spanish meeting recordings into text records
Spanish media subtitle generation
Automatically generate subtitles for Spanish video content
Voice assistants
Spanish voice assistant
Build a voice interaction system that supports Spanish
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase