W

Wavlm Basic N F N 8batch 5sec 0.0001lr Unfrozen

Developed by reralle
A speech processing model fine-tuned based on microsoft/wavlm-large, achieving an accuracy of 73.33% on the evaluation set
Downloads 14
Release Time : 4/27/2023

Model Overview

This model is a speech processing model based on the WavLM architecture, fine-tuned for specific speech recognition or classification tasks

Model Features

Efficient fine-tuning
Fine-tuned with a learning rate of 0.0001, achieving good results on limited data
Stable training
Accuracy steadily improved during training, from an initial 16.67% to 73.33%
Batch optimization
Adopted a batch size of 8 and gradient accumulation steps of 4, resulting in a total training batch size of 32

Model Capabilities

Voice feature extraction
Speech classification
Speech recognition

Use Cases

Speech processing
Speech emotion recognition
Identify emotion categories in speech
Accuracy 73.33%, F1 score 73.08%
Voice command classification
Classify voice commands
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase