Doohickey Mega
A stable diffusion model series optimized for high-resolution image synthesis, fine-tuned based on Stable Diffusion v1-5, supporting multiple aspect ratios
Downloads 186
Release Time : 11/12/2022
Model Overview
A text-to-image generation model fine-tuned from runwayml/stable-diffusion-v1-5, specially optimized for image synthesis quality around 768x768 resolution, supporting multiple output aspect ratios
Model Features
High-resolution optimization
Specially fine-tuned for resolutions around 768x768 to generate high-quality images
Multi-aspect ratio support
Supports various output ratios from 640x640 to 768x768 (e.g., 768x640/704x768)
Hand detail optimization
Versions v3-6000 and later specifically optimize hand detail performance
Improved CLIP model
Version v3 uses the laion/CLIP-ViT-L-14-laion2B-s32B-b82K model with synchronized fine-tuning
Model Capabilities
Text-to-image generation
High-resolution image synthesis
Multi-aspect ratio image generation
Use Cases
Creative design
Concept art creation
Generate high-resolution concept art images based on text descriptions
High-quality artwork at 768x768 resolution
Digital illustration generation
Quickly generate illustrations in various styles
Professional-grade illustrations supporting multiple aspect ratios
Commercial applications
Advertising material generation
Quickly generate high-quality visual materials for marketing campaigns
HD images ready for commercial use
Featured Recommended AI Models