S

Sepformer Dns4 16k Enhancement

Developed by speechbrain
This is a speech enhancement model based on the SepFormer architecture, specifically designed for denoising tasks. It was trained on the Microsoft DNS-4 dataset and supports audio processing at a 16kHz sampling rate.
Downloads 1,669
Release Time : 8/6/2023

Model Overview

The model utilizes the SepFormer architecture to achieve speech enhancement, primarily for removing background noise and improving speech quality. It was trained on 1300 hours of the Microsoft DNS 4 dataset and is suitable for audio with a 16kHz sampling rate.

Model Features

High-performance Denoising
Excellent performance on the DNS4 2022 baseline development set, with DNSMOS SIG score of 2.999, BAK score of 3.076, and OVRL score of 2.437
Multilingual Support
Supports multiple languages including English, German, Russian, French, Italian, and Spanish
Transformer-based Architecture
Utilizes the advanced SepFormer architecture, combining the advantages of Transformer for speech separation and enhancement

Model Capabilities

Audio Denoising
Speech Quality Enhancement
Background Noise Suppression

Use Cases

Voice Communication
VoIP Call Enhancement
Improves the quality of internet voice calls by reducing background noise interference
Significantly improves call clarity
Audio Post-processing
Recording Denoising
Reduces noise in field recordings to improve speech intelligibility
Enhances recording quality, making speech clearer
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase