M

Mhubert Base 25hz

Developed by slprl
A variant of Meta's Hubert model proposed in the TWIST paper, demonstrating significant value as a speech tokenizer for training SpeechLMs.
Downloads 10.63k
Release Time : 10/24/2024

Model Overview

This Hubert model is designed for speech feature extraction, suitable for scenarios like spoken language modeling or speaking style conversion.

Model Features

25Hz feature rate
Added a stride-2 convolutional layer to the CNN encoder, ultimately generating 25Hz features.
Multilingual support
Trained using a combination of multiple multilingual datasets.
Speech tokenizer
Demonstrates significant value when training SpeechLMs.

Model Capabilities

Speech feature extraction
Spoken language modeling
Speaking style conversion

Use Cases

Speech processing
Spoken language modeling
Used for building spoken language models
Speaking style conversion
Used for speaker style conversion
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase