S

Sepformer Wsj03mix

Developed by speechbrain
This is an audio source separation model using the SepFormer architecture, trained on the WSJ0-3Mix dataset, capable of separating mixed speech into independent speech sources.
Downloads 158
Release Time : 3/2/2022

Model Overview

This model is based on the Transformer-architecture SepFormer implementation, specifically designed for audio source separation tasks, capable of separating multiple independent speech signals from mixed audio.

Model Features

High-performance separation
Achieves separation performance of 19.8 dB SI-SNRi and 20.0 dB SDRi on the WSJ0-3Mix test set
Transformer-based architecture
Utilizes the advanced SepFormer architecture with attention mechanisms for efficient speech separation
Ready-to-use model
Provides simple and easy-to-use interfaces for direct audio file separation

Model Capabilities

Speech separation
Multi-speaker separation
Audio source separation

Use Cases

Speech processing
Meeting recording separation
Separate independent speech from multi-speaker meeting recordings
Can clearly separate 3 simultaneous speech sources
Audio enhancement
Extract target speech signals from noisy mixed audio
Improves speech clarity and intelligibility
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase