C

Chinese Hubert Large

Developed by TencentGameMate
This is a speech model pretrained on a 10,000-hour WenetSpeech L subset, released under the MIT license.
Downloads 2,966
Release Time : 6/2/2022

Model Overview

This model is a pretrained model for Chinese speech data, primarily used for feature extraction and representation learning in speech-related tasks. The model does not include a tokenizer; an additional tokenizer needs to be constructed for downstream tasks such as speech recognition.

Model Features

Large-scale Chinese Speech Pretraining
Pretrained on a 10,000-hour WenetSpeech L subset, with robust speech feature extraction capabilities.
Half-precision Support
Supports half-precision (FP16) inference, improving computational efficiency and reducing memory usage.
Flexible Downstream Task Adaptation
Can serve as a foundational model for various speech-related tasks, such as speech recognition and speech classification.

Model Capabilities

Speech Feature Extraction
Speech Representation Learning

Use Cases

Speech Processing
Speech Recognition Foundation Model
Can be used as a feature extractor for speech recognition systems, requiring additional tokenizer construction and fine-tuning.
Speech Classification Tasks
Can be applied to classification tasks such as speech emotion analysis and speaker recognition.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase