đ Flux.1-Dev-Ctoon-LoRA
This LoRA model is designed for text - to - image generation, aiming to create high - quality cartoon - style images. It is currently in the training phase, and the final version may offer better performance and fewer artifacts.
đ Quick Start
Setting Up
import torch
from pipelines import DiffusionPipeline
base_model = "black-forest-labs/FLUX.1-dev"
pipe = DiffusionPipeline.from_pretrained(base_model, torch_dtype=torch.bfloat16)
lora_repo = "prithivMLmods/Flux.1-Dev-Ctoon-LoRA"
trigger_word = "ctoon"
pipe.load_lora_weights(lora_repo)
device = torch.device("cuda")
pipe.to(device)
Trigger words
You should use ctoon
to trigger the image generation.
⨠Features
- Text - to - Image Generation: Convert text descriptions into cartoon - style images.
- LoRA Integration: Utilize the LoRA technique to fine - tune the base model for specific styles.
đĻ Installation
The weights for this model are available in Safetensors format. You can Download them in the Files & versions tab.
đģ Usage Examples
Here are some examples of using the model:
- Input: 'ctoon, A cartoon drawing of a white cat with a black sunglasses around its eyes. The cats eyes are squinted and the cats ears are a light pink color. There are two yellow stars on the wall to the right of the cat.'
- Input: 'ctoon, A cartoon drawing of a white penguin sitting on a red chair. The penguin has a yellow beak and yellow feet. The background of the penguin is a bright blue color.'
- Input: 'ctoon, A cartoon drawing of a cat wearing sunglasses. The cat is facing towards the left side of the image. It is holding a wine glass with wine in it. There is a bottle of wine to the right of the cat. There are two stars to the left of the animal. The background is a light beige color.'
đ Documentation
Model description
prithivMLmods/Flux.1-Dev-Ctoon-LoRA
Image Processing Parameters
Parameter |
Value |
Parameter |
Value |
LR Scheduler |
constant |
Noise Offset |
0.03 |
Optimizer |
AdamW |
Multires Noise Discount |
0.1 |
Network Dim |
64 |
Multires Noise Iterations |
10 |
Network Alpha |
32 |
Repeat & Steps |
20 & 2700 |
Epoch |
15 |
Save Every N Epochs |
1 |
Labeling: florence2 - en(natural language & English)
Total Images Used for Training : 22
Best Dimensions
- 768 x 1024 (Best)
- 1024 x 1024 (Default)
Important Note
â ī¸ Important Note
The model is still in the training phase. This is not the final version and may contain artifacts and perform poorly in some cases.
đ License
The model is released under the creativeml-openrail-m
license.