M

Musicgen Large

Developed by facebook
MusicGen is a text-to-music generation model capable of producing high-quality music samples based on text descriptions or audio prompts.
Downloads 5,125
Release Time : 6/8/2023

Model Overview

MusicGen employs a single-stage autoregressive Transformer architecture, trained on a 32kHz EnCodec tokenizer using four 50Hz sampled codebooks. It generates all four codebooks in one pass without requiring self-supervised semantic representations.

Model Features

Efficient Generation
Parallel prediction of codebooks requires only 50 autoregressive steps per second of audio, significantly improving generation efficiency.
High-Quality Output
Trained on a 32kHz EnCodec tokenizer, it produces high-quality music samples.
Flexible Control
Supports both text descriptions and audio prompts as input, offering more flexible control over music generation.

Model Capabilities

Text-to-Music Generation
Music Style Transfer
Melody-Guided Generation

Use Cases

Music Creation
Background Music Generation
Generates custom background music for videos, games, and other content.
Can produce music that matches specific styles and moods.
Music Inspiration
Provides musicians with creative inspiration and material.
Quickly generates music snippets in various styles.
Research Applications
Generative Model Research
Explores the limitations and possibilities of music generation models.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase