M

Musicgen Stereo Small

Developed by facebook
AI model that generates high-quality stereo music samples based on text descriptions, supporting 300M parameter scale
Downloads 7,091
Release Time : 10/23/2023

Model Overview

MusicGen is a text-to-music model that generates music through text prompts or audio references, using stereo technology to enhance spatial perception

Model Features

Stereo generation
Creates auditory experiences with directional and layered perception through dual-channel audio systems
Efficient parallel prediction
Uses delayed interleaved pattern processing for codebooks, requiring only 50 autoregressive steps per second of audio
Multi-scale options
Offers three parameter scales (300M/1.5B/3.3B) and two variants (text/melody)

Model Capabilities

Generate music based on text descriptions
Support style mixing (e.g., hip-hop + funk)
Generate stereo audio at 32kHz sampling rate
Supports generation of up to 256 new tokens

Use Cases

Music creation
Background music generation
Quickly generate customized soundtracks for videos/podcasts
Produces stereo music that matches the scene atmosphere
Music inspiration
Explore new music genres through style-mixing prompts
Generates experimental music segments blending multiple styles
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase