M

Magnet Small 30secs

Developed by facebook
MAGNeT is a text-to-music and text-to-sound model capable of generating high-quality audio samples from text descriptions.
Downloads 215
Release Time : 1/10/2024

Model Overview

MAGNeT is a masked generative non-autoregressive Transformer based on a 32kHz EnCodec tokenizer, using 4 codebooks sampled at 50Hz. It requires no semantic token conditioning or model cascading, using a single non-autoregressive Transformer to generate all 4 codebooks.

Model Features

Single non-autoregressive Transformer
Uses a single non-autoregressive Transformer to generate all 4 codebooks without model cascading.
High-quality audio generation
Capable of generating high-quality audio samples from text descriptions.
Multi-codebook generation
Uses 4 codebooks sampled at 50Hz for audio generation.

Model Capabilities

Text-to-music generation
Text-to-sound generation
High-quality audio synthesis

Use Cases

Music creation
Generate music in specific styles
Generate music in specific styles based on text descriptions, such as 80s hip-hop style funky house music.
Produces 30-second high-quality music samples.
Podcast background music
Generate engaging rhythms suitable for podcast intros.
Produces 30-second high-quality background music.
Sound effect generation
Generate specific sound effects
Generate specific sound effects based on text descriptions, such as natural environment sounds or mechanical sounds.
Produces 30-second high-quality sound effect samples.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase