Wavlm Basic N F N 8batch 5sec 0.0001lr Unfrozen
W
Wavlm Basic N F N 8batch 5sec 0.0001lr Unfrozen
Developed by reralle
A speech processing model fine-tuned based on microsoft/wavlm-large, achieving an accuracy of 73.33% on the evaluation set
Downloads 14
Release Time : 4/27/2023
Model Overview
This model is a speech processing model based on the WavLM architecture, fine-tuned for specific speech recognition or classification tasks
Model Features
Efficient fine-tuning
Fine-tuned with a learning rate of 0.0001, achieving good results on limited data
Stable training
Accuracy steadily improved during training, from an initial 16.67% to 73.33%
Batch optimization
Adopted a batch size of 8 and gradient accumulation steps of 4, resulting in a total training batch size of 32
Model Capabilities
Voice feature extraction
Speech classification
Speech recognition
Use Cases
Speech processing
Speech emotion recognition
Identify emotion categories in speech
Accuracy 73.33%, F1 score 73.08%
Voice command classification
Classify voice commands
Featured Recommended AI Models