Wav2vec2 Base Toy Train Data Masked Audio
W
Wav2vec2 Base Toy Train Data Masked Audio
Developed by scasutt
A speech recognition model fine-tuned from facebook/wav2vec2-base, trained on toy dataset, supporting audio masking tasks
Downloads 22
Release Time : 3/26/2022
Model Overview
This model is a variant based on the wav2vec2-base architecture, specifically optimized for audio masking tasks, suitable for speech recognition and audio feature extraction scenarios
Model Features
Audio Masking Capability
Specifically optimized for audio masking tasks, capable of effectively processing masked audio inputs
Lightweight Fine-tuning
Fine-tuned based on the pre-trained wav2vec2-base model, suitable for small-scale datasets
Progressive Performance Improvement
Word error rate gradually decreased from 1.0 to 0.7340 during training, showing a good learning curve
Model Capabilities
Speech Recognition
Audio Feature Extraction
Masked Audio Prediction
Use Cases
Speech Processing
Noisy Environment Speech Recognition
Performing speech recognition when audio is partially masked or interfered by noise
Word error rate 0.7340
Audio Data Augmentation
Used to generate training data for audio masking tasks
Featured Recommended AI Models
Š 2025AIbase