M

Musicgen Stereo Melody

Developed by facebook
MusicGen is a text-to-music generation model developed by Meta AI, capable of producing high-quality stereo music samples based on text descriptions or audio prompts.
Downloads 82
Release Time : 10/23/2023

Model Overview

A Transformer-based autoregressive music generation model that supports generating 32kHz stereo music through text descriptions or melody prompts, capable of generating all audio codebooks at once without self-supervised semantic representations.

Model Features

Stereo generation
Achieves stereo output through interleaved processing of dual token streams, offering better spatial perception compared to mono versions.
Melody control
Supports input reference melodies, with generated music maintaining the original melodic contour.
Efficient generation
Employs delayed codebook prediction technology, requiring only 50 autoregressive steps per second of audio.
Parallel multi-codebook
Simultaneously predicts 4 EnCodec codebooks without requiring staged generation.

Model Capabilities

Text-to-music generation
Melody-guided music generation
Stereo audio synthesis
Music style transfer

Use Cases

Creative assistance
Background music generation
Automatically generates matching background music based on scene descriptions
Can produce 8-30 second music clips in various styles
Melody expansion
Develops complete arrangements based on user-provided simple melodies
Adds harmony and rhythm while preserving original melodic characteristics
Research applications
Generative model research
Explores architectures and control methods for audio generation models
Provides comparable baseline models
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase