L

Lang Id Voxlingua107 Ecapa

Developed by speechbrain
A speech language identification model based on the SpeechBrain framework and ECAPA-TDNN architecture, supporting recognition and speech embedding extraction for 107 languages.
Downloads 330.01k
Release Time : 3/2/2022

Model Overview

This model adopts the ECAPA-TDNN architecture and is trained on the VoxLingua107 dataset. It can be used for speech language identification or as a speech segment feature extractor. Supports mono audio input with a 16kHz sampling rate.

Model Features

Multilingual Support
Supports recognition of 107 languages, covering major global languages and some minority languages.
Dual Purpose
Can be directly used for language identification or as a feature extractor for building specialized models.
High-Performance Architecture
Uses ECAPA-TDNN architecture with an error rate of only 6.7% on the VoxLingua107 development set.
Automatic Audio Processing
Built-in audio normalization, automatically handles sampling rate and channel conversion.

Model Capabilities

Speech Language Identification
Speech Feature Extraction
Multilingual Processing

Use Cases

Speech Processing
Multilingual Speech Classification
Identify the language category of speech segments.
6.7% error rate on the VoxLingua107 development set.
Speech Feature Extraction
Extract embedding vectors from speech segments for downstream tasks.
256-dimensional feature vectors.
Content Management
Multilingual Content Classification
Classify user-generated multilingual speech content for management.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase