W

Wav2vec2 Large Xlsr 53 Toy Train Data Masked Audio 10ms

Developed by scasutt
Speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, optimized on 10ms audio masked training data
Downloads 22
Release Time : 3/28/2022

Model Overview

This model is an optimized version for speech recognition tasks, with improved recognition accuracy under specific conditions through fine-tuning

Model Features

10ms audio masked training
Uses a special training method with 10ms audio masking, potentially improving the model's ability to recognize short-term audio features
Fine-tuning optimization
Fine-tuned based on a pre-trained model, achieving better performance on specific datasets

Model Capabilities

Speech recognition
Audio feature extraction

Use Cases

Speech-to-text
Speech transcription
Convert speech content into text
Word error rate 0.4929
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase