# Text-to-Music
Mustango
Apache-2.0
Mustango is a novel multimodal large language model specifically designed for controllable music generation, combining Latent Diffusion Model (LDM), Flan-T5, and music features to achieve high-quality text-to-music generation.
Text-to-Audio
Transformers

M
declare-lab
165
40
Musicgen Medium
MusicGen is a text-to-music model that generates high-quality music samples based on text descriptions or audio prompts, utilizing a 1.5-billion-parameter autoregressive Transformer architecture.
Audio Generation
Transformers

M
facebook
1.5M
118
Featured Recommended AI Models