L

Lang Id Voxlingua107 Ecapa

Developed by apenasissso
ECAPA-TDNN based spoken language identification model trained on VoxLingua107 dataset, supporting classification of 107 languages
Downloads 19
Release Time : 10/23/2023

Model Overview

This model is used for spoken language identification and speech segment-level feature extraction, employing the ECAPA-TDNN architecture and trained on the VoxLingua107 dataset

Model Features

Multilingual support
Supports recognition of 107 different languages, covering major global languages
ECAPA-TDNN architecture
Utilizes advanced ECAPA-TDNN architecture optimized for speech embedding extraction performance
Automatic audio processing
Automatically processes audio at 16kHz sampling rate, including resampling and mono conversion
Dual purpose
Can be directly used for language identification or as a feature extractor for downstream tasks

Model Capabilities

Spoken language identification
Speech feature extraction
Multilingual classification

Use Cases

Language identification
Multilingual speech classification
Identify the language category of speech segments
Achieves 6.7% error rate on VoxLingua107 development set
Speech processing
Speech embedding extraction
Extract feature vectors from speech segments for downstream tasks
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase