AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Multimodal Diffusion

# Multimodal Diffusion

Mmada 8B MixCoT
MIT
MMaDA is a novel class of multimodal diffusion foundation models, excelling in various domains such as text reasoning, multimodal understanding, and text-to-image generation.
Text-to-Image Transformers
M
Gen-Verse
601
3
Mmada 8B Base
MIT
MMaDA is a novel multimodal diffusion foundation model that excels in text reasoning, multimodal understanding, and text-to-image generation.
Text-to-Image Transformers
M
Gen-Verse
6,304
56
Text To Video Lvd Ms
This model combines large language models with video diffusion technology, supporting text-to-video generation and allowing control over video content through bounding box conditional input.
Text-to-Video
T
longlian
91
2
Altdiffusion M9
Openrail
AltDiffusion-m9 is a multilingual text-to-image generation model based on the Stable Diffusion framework, supporting 9 languages and powered by the AltCLIP-m9 multilingual CLIP model.
Text-to-Image Supports Multiple Languages
A
BAAI
46
70
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase