M

Musicgen Stereo Melody Large

Developed by facebook
MusicGen is a text-to-music generation model that supports stereo and melody guidance, capable of producing high-quality music samples based on text descriptions or audio prompts.
Downloads 61
Release Time : 10/23/2023

Model Overview

MusicGen is an autoregressive music generation model based on the Transformer architecture, supporting 32kHz stereo audio generation through text descriptions or melody guidance. The model employs the EnCodec audio tokenizer and can generate all codebooks at once for efficient music synthesis.

Model Features

Stereo Support
Stereo generation capability achieved through 200,000 iterations of fine-tuning, using delay mode to process dual token streams
Melody Guidance
Supports generating style-matching music based on input melodies while preserving original melodic characteristics
Efficient Generation
Utilizes parallel prediction mechanism, requiring only 50 autoregressive steps per second of audio, significantly improving generation speed
Multi-codebook Joint Prediction
Generates all 4 codebooks simultaneously without requiring staged processing

Model Capabilities

Text-to-music generation
Melody-guided music generation
Stereo audio synthesis
Multiple music style generation

Use Cases

Creative content generation
Background music creation
Generate customized background music for videos, games, and other content
Can quickly produce scene-matching music based on text descriptions
Melody expansion
Generate complete arrangements based on existing melody fragments
Enriches musical expression while preserving original melodic features
Music research
Music generation algorithm research
Used to explore cutting-edge AI music generation technologies
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase