Wavlm Basic S R 5c 8batch 5sec 0.0001lr Unfrozen
W
Wavlm Basic S R 5c 8batch 5sec 0.0001lr Unfrozen
Developed by reralle
A speech processing model fine-tuned based on microsoft/wavlm-large, achieving 75% accuracy on the evaluation set
Downloads 16
Release Time : 4/30/2023
Model Overview
This model is a variant of the WavLM architecture optimized for speech processing tasks, suitable for short audio segment analysis
Model Features
Efficient fine-tuning
Fine-tuned with a learning rate of 0.0001 to preserve the core capabilities of the pre-trained model
Short audio processing
Optimized for 5-second audio clips, suitable for real-time processing scenarios
Stable training
Utilizes gradient accumulation (4 steps) and linear learning rate scheduling to ensure training stability
Model Capabilities
Voice feature extraction
Short audio classification
Speech pattern recognition
Use Cases
Speech analysis
Emotion recognition
Analyze emotional tendencies in short speech segments
75% accuracy
Voice command classification
Identify categories of short voice commands
F1 score 0.75
Featured Recommended AI Models