X

Xcodec2

Developed by HKUSTAudio
XCodec2 is a voice tokenizer supporting multilingual voice semantic understanding and high-quality voice reconstruction
Downloads 32.36k
Release Time : 1/7/2025

Model Overview

XCodec2 is a voice tokenizer optimized for training and inference computation scale based on LLaMA voice synthesis, featuring single vector quantization and 50 tokens per second, supporting multilingual voice semantic understanding and high-quality voice reconstruction.

Model Features

Single Vector Quantization
Supports efficient voice encoding and decoding
Efficient Token Generation
Generates 50 tokens per second for fast voice processing
Multilingual Support
Supports multilingual voice semantic understanding and reconstruction
High-Quality Reconstruction
Achieves high-quality voice reconstruction

Model Capabilities

Voice Encoding
Voice Decoding
Voice Semantic Understanding
Voice Reconstruction

Use Cases

Voice Processing
Voice Compression and Reconstruction
Compresses voice signals into tokens and reconstructs them into high-quality voice
High-quality voice reconstruction
Multilingual Voice Processing
Supports semantic understanding and processing of multilingual voice
Cross-language voice applications
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase