Japanese-Stable-Diffusion-XL Open-source Image Generation Model - Draw Japanese-style Images by Inputting Japanese Prompt Words

Japanese Stable Diffusion Xl

Developed by stabilityai

A Japanese-specialized text-to-image generation model based on the SDXL architecture, capable of generating Japanese-style images from Japanese prompts

Text-to-Image JapaneseOpen Source License:Other #Japanese text-to-image generation #Japanese style specialization #SDXL architecture optimization

Downloads 100

Release Time : 11/1/2023

Model Overview

Japanese Stable Diffusion XL is a Japanese-specialized model based on the SDXL architecture. By employing a Japanese-specific text encoder for parameter-efficient fine-tuning, it significantly enhances the understanding of the Japanese language and cultural expressions.

Model Features

Japanese Specialization

Utilizes a Japanese-specific text encoder for parameter-efficient fine-tuning, significantly improving understanding of Japanese language and cultural expressions

High-Quality Image Generation

Based on the SDXL architecture, capable of generating high-quality Japanese-style images

Orthogonal Fine-Tuning Technology

Employs Orthogonal Fine-Tuning (OFT) method for better performance and training stability

Model Capabilities

Japanese text-to-image generation

Japanese-style image generation

Artistic creation

Use Cases

Artistic Creation

Artwork Generation

Generate Japanese-style artworks

High-quality Japanese-style images

Design Creation

Used for design creation such as illustrations, posters, etc.

Diverse design works

Educational Tools

Creative Tool Development

Develop educational or creative tools

Enhanced creative expression and learning experience

Research

Generative Model Research

Used for generative model-related research

Advancing generative model technologies

🚀 Japanese Stable Diffusion XL

A Japanese-specific SDXL model capable of generating Japanese-style images from Japanese prompts.

🚀 Quick Start

Please note: for commercial usage of this model, please see https://stability.ai/license. For Japanese inquiries regarding commercial use, please contact sales-jp@stability.ai.

✨ Features

Japanese Stable Diffusion XL (JSDXL) is a Japanese-specific SDXL model that is capable of inputting prompts in Japanese and generating Japanese-style images.

📦 Installation

No specific installation steps provided in the original document.

💻 Usage Examples

Basic Usage

from diffusers import DiffusionPipeline
import torch

pipeline = DiffusionPipeline.from_pretrained(
    "stabilityai/japanese-stable-diffusion-xl", trust_remote_code=True
)
pipeline.to("cuda")

# if using torch < 2.0
# pipeline.enable_xformers_memory_efficient_attention()

prompt = "柴犬、カラフルアート"

image = pipeline(prompt=prompt).images[0]

📚 Documentation

Model Details

Property	Details
Developed by	Stability AI
Model Type	Diffusion-based text-to-image generative model
Model Description	This model is a fine-tuned model based on SDXL 1.0. In order to maximize the understanding of the Japanese language and Japanese culture/expressions while preserving the versatility of the pre-trained model, we performed a PEFT training using one Japanese-specific compatible text encoder. As a PEFT method, we applied Orthogonal Fine-tuning (OFT) for better results and training stability.
License	STABILITY AI COMMUNITY LICENSE

Uses

Direct Use

Commercial use: for commercial usage of this model, please see https://stability.ai/license. For Japanese inquiries regarding commercial use, please contact partners-jp@stability.ai.

Research: possible research areas/tasks include:

Generation of artworks and use in design and other artistic processes.
Applications in educational or creative tools.
Research on generative models.
Safe deployment of models which have the potential to generate harmful content.
Probing and understanding the limitations and biases of generative models.

Excluded uses are described below.

Out-of-Scope Use

The model was not trained to be factual or true representations of people or events, and therefore using the model to generate such content is out-of-scope for the abilities of this model.

Limitations and Bias

Limitations

The model does not achieve perfect photorealism.
The model cannot render legible text.
The model struggles with more difficult tasks which involve compositionality, such as rendering an image corresponding to “A red cube on top of a blue sphere”.
Faces and people in general may not be generated properly.
The autoencoding part of the model is lossy.

Bias

While the capabilities of image generation models are impressive, they can also reinforce or exacerbate social biases.

How to cite

@misc{JSDXL, 
    url    = {[https://huggingface.co/stabilityai/japanese-stable-diffusion-xl](https://huggingface.co/stabilityai/japanese-stable-diffusion-xl)}, 
    title  = {Japanese Stable Diffusion XL}, 
    author = {Shing, Makoto and Akiba, Takuya and Chi, Jerry}
}

Contact

For questions and comments about the model, please join Stable Community Japan.
For future announcements / information about Stability AI models, research, and events, please follow https://twitter.com/StabilityAI_JP.
For business and partnership inquiries, please contact partners-jp@stability.ai.

📄 License

The model is licensed under the STABILITY AI COMMUNITY LICENSE.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご