Dptnet Libri1Mix Enhsingle 16k
Audio enhancement model trained based on the Asteroid framework, focusing on mono speech enhancement tasks
Downloads 4,446
Release Time : 3/2/2022
Model Overview
This model uses the DPTNet architecture and is trained on the enhanced mono task of the Libri1Mix dataset, aiming to improve the clarity and intelligibility of mono audio.
Model Features
Efficient audio processing
Trained with a 16kHz sampling rate and 3-second segment length, suitable for real-time audio processing scenarios
Deep time-frequency transformation network
Uses the DPTNet architecture, combining time-frequency transformation and deep neural networks for audio feature learning
Significant performance improvement
Achieves an SI-SDR improvement of 11.38dB and an STOI improvement of 0.13 on the test set
Model Capabilities
Mono speech enhancement
Audio quality improvement
Speech clarity enhancement
Use Cases
Voice communication
Voice call quality enhancement
Improves voice call clarity in noisy environments
SI-SDR improvement of 11.38dB, significantly improving speech intelligibility
Audio post-processing
Recording quality restoration
Enhances low-quality recordings
STOI improved to 0.93, approaching the level of original clear speech
Featured Recommended AI Models
Š 2025AIbase