M

Musicgen Stereo Large

Developed by facebook
MusicGen is a text-to-music generation model developed by Meta AI, supporting stereo generation and capable of producing high-quality music samples based on text descriptions or audio prompts.
Downloads 382
Release Time : 10/23/2023

Model Overview

MusicGen employs a single-stage autoregressive Transformer architecture, trained on a 32kHz sampled EnCodec tokenizer, supporting stereo effect generation without requiring self-supervised semantic representations to generate all codebooks at once.

Model Features

Stereo support
Achieves stereo effects through dual token streams and delay mode interleaving, enhancing spatial perception and directionality.
Efficient generation
Utilizes parallel prediction technology, requiring only 50 autoregressive steps per second of audio, significantly improving generation efficiency.
Melody guidance
Supports music generation via text descriptions or existing melody prompts, enhancing creative control.
Multi-scale models
Offers three parameter scales: 300M/1.5B/3.3B, catering to different computational resource needs.

Model Capabilities

Text-to-music generation
Melody-guided generation
Stereo generation
High-quality music sample generation

Use Cases

Music creation
Background music generation
Automatically generates matching background music based on scene descriptions
Produces stereo audio at 32kHz sampling rate
Melody extension
Generates complete arrangements based on existing melody fragments
Maintains diverse variations of original melody features
Academic research
Generative model research
Explores limitations and improvement directions of audio generation models
Provides quantifiable objective evaluation metrics
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase