W

Wavlm Basic S R 5c 8batch 5sec 0.0001lr Unfrozen

Developed by reralle
A speech processing model fine-tuned based on microsoft/wavlm-large, achieving 75% accuracy on the evaluation set
Downloads 16
Release Time : 4/30/2023

Model Overview

This model is a variant of the WavLM architecture optimized for speech processing tasks, suitable for short audio segment analysis

Model Features

Efficient fine-tuning
Fine-tuned with a learning rate of 0.0001 to preserve the core capabilities of the pre-trained model
Short audio processing
Optimized for 5-second audio clips, suitable for real-time processing scenarios
Stable training
Utilizes gradient accumulation (4 steps) and linear learning rate scheduling to ensure training stability

Model Capabilities

Voice feature extraction
Short audio classification
Speech pattern recognition

Use Cases

Speech analysis
Emotion recognition
Analyze emotional tendencies in short speech segments
75% accuracy
Voice command classification
Identify categories of short voice commands
F1 score 0.75
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase