🚀 Japanese Stable Diffusion XL
A Japanese-specific SDXL model capable of generating Japanese-style images from Japanese prompts.
🚀 Quick Start
Please note: for commercial usage of this model, please see https://stability.ai/license.
For Japanese inquiries regarding commercial use, please contact sales-jp@stability.ai.
✨ Features
Japanese Stable Diffusion XL (JSDXL) is a Japanese-specific SDXL model that is capable of inputting prompts in Japanese and generating Japanese-style images.
📦 Installation
No specific installation steps provided in the original document.
💻 Usage Examples
Basic Usage
from diffusers import DiffusionPipeline
import torch
pipeline = DiffusionPipeline.from_pretrained(
"stabilityai/japanese-stable-diffusion-xl", trust_remote_code=True
)
pipeline.to("cuda")
prompt = "柴犬、カラフルアート"
image = pipeline(prompt=prompt).images[0]
📚 Documentation
Model Details
Property |
Details |
Developed by |
Stability AI |
Model Type |
Diffusion-based text-to-image generative model |
Model Description |
This model is a fine-tuned model based on SDXL 1.0. In order to maximize the understanding of the Japanese language and Japanese culture/expressions while preserving the versatility of the pre-trained model, we performed a PEFT training using one Japanese-specific compatible text encoder. As a PEFT method, we applied Orthogonal Fine-tuning (OFT) for better results and training stability. |
License |
STABILITY AI COMMUNITY LICENSE |
Uses
Direct Use
Commercial use: for commercial usage of this model, please see https://stability.ai/license.
For Japanese inquiries regarding commercial use, please contact partners-jp@stability.ai.
Research: possible research areas/tasks include:
- Generation of artworks and use in design and other artistic processes.
- Applications in educational or creative tools.
- Research on generative models.
- Safe deployment of models which have the potential to generate harmful content.
- Probing and understanding the limitations and biases of generative models.
Excluded uses are described below.
Out-of-Scope Use
The model was not trained to be factual or true representations of people or events, and therefore using the model to generate such content is out-of-scope for the abilities of this model.
Limitations and Bias
Limitations
- The model does not achieve perfect photorealism.
- The model cannot render legible text.
- The model struggles with more difficult tasks which involve compositionality, such as rendering an image corresponding to “A red cube on top of a blue sphere”.
- Faces and people in general may not be generated properly.
- The autoencoding part of the model is lossy.
Bias
While the capabilities of image generation models are impressive, they can also reinforce or exacerbate social biases.
How to cite
@misc{JSDXL,
url = {[https://huggingface.co/stabilityai/japanese-stable-diffusion-xl](https://huggingface.co/stabilityai/japanese-stable-diffusion-xl)},
title = {Japanese Stable Diffusion XL},
author = {Shing, Makoto and Akiba, Takuya and Chi, Jerry}
}
Contact
- For questions and comments about the model, please join Stable Community Japan.
- For future announcements / information about Stability AI models, research, and events, please follow https://twitter.com/StabilityAI_JP.
- For business and partnership inquiries, please contact partners-jp@stability.ai.
📄 License
The model is licensed under the STABILITY AI COMMUNITY LICENSE.