Chinese-Hubert-Large Open-Source Speech Model - Trained with Massive Speech Data and High Practical Value!

Home

Chinese Hubert Large

Developed by TencentGameMate

This is a speech model pretrained on a 10,000-hour WenetSpeech L subset, released under the MIT license.

Speech Recognition

Transformers

Open Source License:MIT #Chinese Speech Pretraining #WenetSpeech Dataset #Half-precision Inference

Downloads 2,966

Release Time : 6/2/2022

Model Overview

This model is a pretrained model for Chinese speech data, primarily used for feature extraction and representation learning in speech-related tasks. The model does not include a tokenizer; an additional tokenizer needs to be constructed for downstream tasks such as speech recognition.

Model Features

Large-scale Chinese Speech Pretraining

Pretrained on a 10,000-hour WenetSpeech L subset, with robust speech feature extraction capabilities.

Half-precision Support

Supports half-precision (FP16) inference, improving computational efficiency and reducing memory usage.

Flexible Downstream Task Adaptation

Can serve as a foundational model for various speech-related tasks, such as speech recognition and speech classification.

Model Capabilities

Speech Feature Extraction

Speech Representation Learning

Use Cases

Speech Processing

Speech Recognition Foundation Model

Can be used as a feature extractor for speech recognition systems, requiring additional tokenizer construction and fine-tuning.

Speech Classification Tasks

Can be applied to classification tasks such as speech emotion analysis and speaker recognition.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

Chinese Hubert Large

Model Overview

Model Features

Model Capabilities

Use Cases

🚀 Chinese Speech Pretrained Model

🚀 Quick Start

📦 Installation

💻 Usage Examples

Basic Usage

📄 License