Hitokomoru-diffusion Open Source AI Painting Model - Supports Non-square Images and Tag Input, Superb at Drawing Anime!

Hitokomoru Diffusion

Developed by Linaqruf

A latent diffusion model trained on works by Japanese artists, supporting non-square resolution generation and Danbooru tag input

Image Generation Open Source License:Openrail #Anime-style generation #Danbooru tag support #Aspect ratio adaptation

Downloads 130

Release Time : 11/21/2022

Model Overview

This is a latent diffusion model trained on works by the Japanese artist [Hitokomoru], specifically designed for generating anime-style images. The model supports text-to-image conversion and allows content control through Danbooru tags.

Model Features

Multi-stage training checkpoints

Provides four model variants at 5000, 10000, 15000, and 20000 training steps, allowing users to choose models from different training stages based on their needs

Aspect ratio support

Trained using NovelAI's aspect ratio bucketing tool, supporting non-square resolution image generation

Tag-controlled generation

Supports precise control over image style and content using Danbooru tags

Model Capabilities

Text-to-image generation

Anime-style image generation

Flexible resolution control

Tag-controlled image content

Use Cases

Anime creation

Anime character design

Generate anime characters with specific styles using Danbooru tags

Can generate anime characters with specific hairstyles, eye colors, and clothing styles

Scene creation

Generate anime scenes with complex backgrounds

Can generate anime scenes featuring complex backgrounds such as flower fields and skies

🚀 Hitokomoru Diffusion

Hitokomoru Diffusion is a latent diffusion model trained on Japanese Artist artwork, enabling high - quality text - to - image generation.

🚀 Quick Start

Hitokomoru Diffusion is a latent diffusion model that has been trained on Japanese Artist artwork, „Éí„Éà„Åì„ÇÇ„Çã/Hitokomoru. The current model has been fine - tuned with a learning rate of 2.0e-6 for 20000 training steps/80 Epochs on 255 images collected from Danbooru. The model is trained using NovelAI Aspect Ratio Bucketing Tool so that it can be trained at non - square resolutions. Like other anime - style Stable Diffusion models, it also supports Danbooru tags to generate images.

e.g. 1girl, white hair, golden eyes, beautiful eyes, detail, flower meadow, cumulonimbus clouds, lighting, detailed sky, garden

There are 4 variations of this model available so far:

hitokomoru-5000.ckpt for the checkpoint trained on 5,000 steps.
hitokomoru-10000.ckpt for the checkpoint trained on 10,000 steps.
hitokomoru-15000.ckpt for the checkpoint trained on 15,000 steps.
hitokomoru-20000.ckpt for the checkpoint trained on 20,000 steps.

📦 Installation

This model can be used just like any other Stable Diffusion model. For more information, please have a look at the Stable Diffusion.

You can also export the model to ONNX, MPS and/or FLAX/JAX.

💻 Usage Examples

Basic Usage

from diffusers import StableDiffusionPipeline
import torch

model_id = "Linaqruf/hitokomoru-diffusion"
pipe = StableDiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.float16)
pipe = pipe.to("cuda")

prompt = "hatsune_miku"
image = pipe(prompt).images[0]

image.save("./hatsune_miku.png")

Advanced Usage

The following shows the prompt and settings for generating example images. You can adjust these parameters according to your needs to generate different images.

Anime Girl:

(masterpiece:1.05),illustration,beautiful detailed,colourful,finely detailed,dramatic light,intricate details,1 girl, 1990, 1980, hatsune miku

Negative prompt:
nsfw, lowres, bad anatomy, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, worst quality, low quality, normal quality, jpeg artifacts, signature, watermark, username, blurry,missing fingers,bad hands,missing arms, long neck, Humpbacked,shadow,long body, Abnormal fingers,

Steps: 32, Sampler: Euler, CFG scale: 10, Seed: 2319346364, Size: 512x768, Model hash: 2700c435, Batch size: 2, Batch pos: 0, Clip skip: 2

Anime Boy:

Authentic and detailed face(man:1.2763)(boymasterpiece:1.1025), (best quality:1.1025), (ultra-detailed:1.1025), (illustration:1.1025), (tousled hair:1.1025), (frill:0.907) , white cutter shirt, (one boy:1.05), (solo:1.05) chest, detailed wet clothes, empty stare, pants, (flowers:1.05), beautifully detailed sky, beautifully detailed water, leaves, detailed and beautiful sea

Negative prompt: 
(big breasts:1.2763)(breast:1.1025)}(woman:1.2155)} little girl,(3d:1.1576)(girl:1.629), nsfw, lowres, bad anatomy, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, worst quality, low quality, normal quality, jpeg artifacts, signature, watermark, username, blurry,missing fingers,bad hands,missing arms, long neck, Humpbacked

Steps: 40, Sampler: DPM++ 2M Karras, CFG scale: 7, Seed: 1870332858, Size: 512x768, Model hash: 2700c435, Batch size: 2, Batch pos: 0, Clip skip: 2

📚 Documentation

Examples

Below are some examples of images generated using this model:

Using Hitokomoru - 5000 - pruned.ckpt

Anime Girl:

Anime Boy:

Using Hitokomoru - 10000 - pruned.ckpt

Anime Girl:

Anime Boy:

Using Hitokomoru - 15000 - pruned.ckpt

Anime Girl:

Anime Boy:

Using Hitokomoru - 20000 - pruned.ckpt

Anime Girl:

Anime Boy:

Dataset

You can find datasets used to train this model and the last - state folder for resume training here

📄 License

This model is open access and available to all, with a CreativeML OpenRAIL - M license further specifying rights and usage. The CreativeML OpenRAIL License specifies:

You can't use the model to deliberately produce nor share illegal or harmful outputs or content
The authors claims no rights on the outputs you generate, you are free to use them and are accountable for their use which must not go against the provisions set in the license
You may re - distribute the weights and use the model commercially and/or as a service. If you do, please be aware you have to include the same use restrictions as the ones in the license and share a copy of the CreativeML OpenRAIL - M to all your users (please read the license entirely and carefully) Please read the full license here

What's next?

Hitokomoru Diffusion V2 will be released soon. It will add scenery datasets from MidJourney. I hope it works well because I'm tired of the simple backgrounds in the current results.

Credit

„Éí„Éà„Åì„ÇÇ„Çã/Hitokomoru for Datasets
Just for my part

Big Thanks to

Kohya with their Kohya Trainer
Peeps on SD Training Labs Discord Server
ptsearch.info for prompt references

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご