M

Magnet Small 10secs

Developed by facebook
MAGNeT is a text-to-music and text-to-audio model capable of generating high-quality audio samples from text descriptions.
Downloads 976
Release Time : 1/10/2024

Model Overview

MAGNeT is a masked generative non-autoregressive Transformer based on a 32kHz EnCodec tokenizer, trained using 4 codebooks sampled at 50 Hz. It does not require semantic token conditioning or model cascading, using a single non-autoregressive Transformer to generate all 4 codebooks.

Model Features

Non-autoregressive generation
Uses a single non-autoregressive Transformer to generate all codebooks without cascading models
High-quality audio generation
Capable of generating high-quality audio samples at 32kHz sampling rate from text descriptions
Diverse style support
Supports generating various music styles including hip-hop, funk house, lo-fi, etc.

Model Capabilities

Text-to-music generation
Text-to-sound effects generation
Short audio clip generation (10 seconds)

Use Cases

Music creation
Background music generation
Generate background music for videos, podcasts, etc.
Produces 10-second music clips
Music inspiration exploration
Explore creative possibilities of different music styles through text prompts
Generates diverse music samples
Sound design
Game sound effects generation
Generate environmental sound effects for game scenes
Produces 10-second sound effect clips
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase