Dcunet Libri1Mix Enhsingle 16k
Audio enhancement model trained based on the Asteroid framework, specifically designed for mono speech enhancement tasks
Downloads 69
Release Time : 3/2/2022
Model Overview
This model adopts the DCUNet-20 architecture, trained on the Libri1Mix dataset, aimed at improving mono audio quality, especially suitable for speech enhancement scenarios
Model Features
High-performance speech enhancement
Achieved a 13.15dB SI-SDR improvement and a 0.92 STOI score on the Libri1Mix test set
Deep Complex U-Net architecture
Utilizes a 20-layer DCUNet structure, specifically designed for processing complex spectrogram audio signals
Fixed-length processing
Supports fixed-length audio processing in padding mode, suitable for batch processing
Model Capabilities
Mono speech enhancement
Audio quality improvement
Noise suppression
Use Cases
Speech processing
Call quality enhancement
Improves speech clarity and intelligibility in voice calls
SI-SDR improvement of 9.7dB, STOI improvement of 12.4%
Speech recognition preprocessing
Serves as a front-end processing module for ASR systems to improve recognition accuracy
Featured Recommended AI Models
Š 2025AIbase