S

Seamless M4t V2 Large Speech Encoder

Developed by WueNLP
Speech encoder module extracted from SeamlessM4Tv2-Large, excelling in cross-language and multilingual sequence-level audio classification tasks
Downloads 67
Release Time : 11/18/2024

Model Overview

This model is a multilingual speech encoder specifically designed for audio classification tasks, supporting over 100 languages.

Model Features

Multilingual support
Supports speech encoding and classification for over 100 languages
Audio classification
Excels in cross-language and multilingual sequence-level audio classification tasks
Efficient processing
Optimized for processing 16kHz audio waveforms

Model Capabilities

Audio feature extraction
Multilingual audio classification
Speech encoding

Use Cases

Speech recognition
Multilingual speech classification
Classifying speech in multiple languages
Performs excellently on the SIB-Fleurs dataset
Speech processing
Speech feature extraction
Extracting useful features from speech
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase