Paint Journey V2 Open-source Text-to-Image Model - Free Support for Multi-resolution Oil Painting Style Art Creation

Paint Journey V2

Developed by FredZhang7

Paint Journey V2 is an improved text-to-image model based on the V1 version, focusing on oil painting styles and high-quality art creation, supporting multiple resolution outputs.

Image Generation Supports Multiple LanguagesOpen Source License:Openrail #Oil painting style generation #High-resolution art creation #Multi-style fusion

Downloads 60

Release Time : 1/3/2023

Model Overview

This model is fine-tuned on Paint Journey V1, specifically optimized for Midjourney V4, Open Journey V2, Disco Diffusion, and licensed artists' 768x768 oil paintings, excelling in generating oil painting-style artworks.

Model Features

Oil painting style generation

Enable oil painting effects by adding ((oil painting)) at the beginning of the prompt, generating artworks with oil painting textures.

High-resolution support

Supports generating images at 768x768 or higher resolutions with excellent noise control, ideal for artistic creations.

Diverse style fusion

The model achieves natural fusion of digital and oil painting styles through fine-tuned text encoders, producing more dynamic artworks.

Portrait generation capability

Capable of generating stunning portraits without repeated facial features at 768x1136 resolution, outperforming similar models.

Model Capabilities

Text-to-image generation

Oil painting style creation

High-resolution image generation

Portrait painting generation

Landscape painting generation

Use Cases

Art creation

Portraits

Generate high-quality portrait paintings

Stunning portraits without repeated facial features at 768x1136 resolution

Natural landscapes

Generate oil painting-style landscape paintings

Exquisite landscape works at 1280x768 resolution

Sci-fi scenes

Generate artistic sci-fi scenes like space

Space landscapes at 1152x768 resolution

Commercial design

Product rendering

Generate artistic product renderings

Lamborghini renderings at 1280x768 resolution

Character design

Generate game or anime character designs

Eevee character designs at 960x768 resolution

🚀 Paint Journey V2

Paint Journey V2 is a fine - tuned model based on V1, trained on 768x768 oil paintings from Midjourney V4, Open Journey V2, Disco Diffusion, and authorized artists. It offers high - quality image generation in various styles.

📦 Installation

Camenduru's WebUI

git clone -b v1.6 https://github.com/camenduru/stable-diffusion-webui

Click to use Automatic1111's Webui instead, but may not output images as artistic

git clone https://github.com/AUTOMATIC1111/stable-diffusion-webui.git

Download [checkpoint](./paint_journey_v2.ckpt) and [vae](./paint_journey_v2.vae.pt) to the `./stable-diffusion-webui/models/Stable-diffusion` folder. Run `webui-user.bat`.

Diffusers

pip install --upgrade diffusers transformers

🚀 Quick Start

Begin the prompt with ((oil painting)) to add the oil paint effect. For digital and other painting styles, use similar prompts as you would for Midjourney V4 (with some tweaks), Stable Diffusion v1.5 (add more styles), Open Journey V2, or Disco Diffusion.

💻 Usage Examples

Basic Usage

# see more sampling algorithms at https://huggingface.co/docs/diffusers/using-diffusers/schedulers#changing-the-scheduler

from diffusers import StableDiffusionPipeline, EulerAncestralDiscreteScheduler
import torch, random, datetime

pipe = StableDiffusionPipeline.from_pretrained("FredZhang7/paint-journey-v2")
pipe.scheduler = EulerAncestralDiscreteScheduler.from_config(pipe.scheduler.config)
pipe = pipe.to("cuda")

def random_seed():
  return random.randint(0, 2**32 - 1)


prompt = "((oil painting)), gentle waves, bright blue sky, white sails billowing, sun glistening on the surface, salty sea air, distant horizon, calm breeze, birds soaring overhead, vibrant colors, artstation digital painting, high resolution, uhd, 4 k, 8k wallpaper"   # what you want to see
negative_prompt = "low-res, blurry, haze, dark clouds looming, choppy waves, engine failing, sails tattered, stormy winds".split(", ")   # what you don't want to see
seed = random_seed()               # replace with the desired seed if needed
width, height = 1280, 768          # width and height of the generated image
cfg_scale = 7.5                    # classifer free guidance scale, smaller means more creative, 7 to 11 is usually a good range
num_inference_steps = 40           # sampling steps, 30 to 40 is usually good for Euler Ancestral


generator = torch.Generator("cuda").manual_seed(seed)
with torch.autocast("cuda"):
    image = pipe(prompt=prompt,
                  num_inference_steps=num_inference_steps,
                  width=width, height=height,
                  generator=generator,
                  guidance_scale=cfg_scale).images[0]

def generate_filename(string, seed):
    invalid_chars = ["<", ">", ":", '"', "/", "\\", "|", "?", "*"]
    for char in invalid_chars:
        string = string.replace(char, "")
    return f"{datetime.now().strftime('%Y-%m-%d_%H-%M-%S')}_{seed}_{string}"

image.save(f"./{generate_filename(prompt, seed)}.png")

✨ Features

Style Blending: Paint Journey V2 can seamlessly blend digital and oil painting styles into various other types of prompts, resulting in a more natural and dynamic output.
High - Resolution Output: It can generate 768x768 or higher resolution images with reduced noise levels.
Portrait Generation: Capable of generating stunning portraits at 768x1136 resolution without duplicated faces using Camenduru's WebUI.

📚 Documentation

Examples

All examples were generated using Camenduru's WebUI (see the Colab file)

🎨 768x1136 portraits, generated using descriptive prompts and without face restoration, generation parameters

🎨 1280x768 (mostly) natural landscapes, used shorter prompts, generation parameters

🎨 1152x768 outerspace landscapes, used descriptive prompts, generation parameters

🎨 1280x768 lamborghini, generation parameters

🎨 960x768 Eevee, generation parameters

Comparisons

Paint Journey V2's paintings are closer to human - drawn art than Open Journey V2.
Compared to models like Dreamlike Diffusion 1.0, PJ V2 tends to generate 768x768 or higher resolution images with reduced noise levels.
At lower resolutions, DreamShaper 3.3 tends to generate higher quality portraits than PJ V2 in terms of noise levels, given the same (short) positive and negative prompts. However, PJ V2 can craft more stunning masterpieces with more descriptive positive and negative prompts and can still generate beautiful landscapes with shorter prompts.

Training

Instead of solely fine - tuning its Unet, Paint Journey V2 focuses on fine - tuning its text encoder with a diverse range of prompts. This model was trained on a curated dataset of roughly 300 images hand - picked from Midjourney, Prompt Hero, PixaBay, Open Journey V2, and Reddit. Before training, R - ESRGAN 4x was used on many images to increase their resolution and reduce noise.

Output Dimensions

Portrait sizes include, but are not limited to, 512x768, 768x768, and 768x1136.
Landscape sizes include, but are not limited to, 768x512, 768x768, 1152x768, and 1280x768.

Useful Resources

Running out of prompts? Use these resources: Lexica.art, Fast GPT PromptGen, Prompt Hero

Safety Checker V2

The official stable diffusion safety checker uses up 1.22GB VRAM. I recommend using Google Safesearch Mini V2 (220MB) to save 1.0GB VRAM.

📄 License

This project is licensed under the creativeml-openrail-m license.

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご