T

TIGER DnR

Developed by JusperLee
TIGER is a lightweight speech separation model that achieves efficient audio processing through frequency band segmentation and multi-scale feature extraction
Downloads 134
Release Time : 1/22/2025

Model Overview

TIGER is an efficient speech separation model that employs frequency band segmentation and interleaved modeling architecture, significantly reducing computational costs while maintaining high performance. Primarily used for speech separation, noise reduction, and reverberation elimination tasks.

Model Features

Efficient frequency band segmentation
Divides frequency bands using prior knowledge and compresses frequency information, significantly reducing computational costs
Multi-scale feature extraction
Utilizes multi-scale selective attention (MSA) modules to effectively extract contextual features
Lightweight design
Reduces parameter count by 94.3% and MACs by 95.3% while maintaining high performance
Real-world scenario adaptation
Performs exceptionally well on the EchoSet dataset containing complex noise and reverberation

Model Capabilities

Speech separation
Background noise elimination
Reverberation elimination
Multi-speaker speech separation

Use Cases

Speech enhancement
Meeting recording enhancement
Separates clear individual speech from recordings with multiple people speaking simultaneously
Outperforms the TF-GridNet model on the EchoSet dataset
Noisy environment speech processing
Eliminates background noise and reverberation to improve speech clarity
Effectively handles real-world reverberation influenced by object occlusion and material properties
Audio post-production
Film and TV audio restoration
Separates and enhances target speech from field recordings
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase