Wav2vec2 Large Xlsr 53 Toy Train Data Masked Audio 10ms
Speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, optimized on 10ms audio masked training data
Downloads 22
Release Time : 3/28/2022
Model Overview
This model is an optimized version for speech recognition tasks, with improved recognition accuracy under specific conditions through fine-tuning
Model Features
10ms audio masked training
Uses a special training method with 10ms audio masking, potentially improving the model's ability to recognize short-term audio features
Fine-tuning optimization
Fine-tuned based on a pre-trained model, achieving better performance on specific datasets
Model Capabilities
Speech recognition
Audio feature extraction
Use Cases
Speech-to-text
Speech transcription
Convert speech content into text
Word error rate 0.4929
Featured Recommended AI Models
Š 2025AIbase