Ultralytics
U
Ultralytics
Developed by MidnightRunner
This project integrates Ultralytics models into ComfyUI, facilitating users to perform operations such as object detection.
Downloads 1,247
Release Time : 4/26/2025
Model Overview
The main function of this project is to integrate models such as YOLOv8 and YOLO11 of Ultralytics into the ComfyUI workflow, providing convenient object detection and segmentation capabilities.
Model Features
ComfyUI Integration
Seamlessly integrate Ultralytics models into the ComfyUI workflow, providing a visual operation interface
Multi-model Support
Support multiple Ultralytics models such as YOLOv8 and YOLO11
Secure Loading Mechanism
Provide a description of the secure loading mechanism to prevent potential security risks
Model Capabilities
Object Detection
Image Segmentation
Bounding Box Detection
Face Detection
Anime Image Segmentation
Use Cases
Computer Vision
Object Detection
Detect objects in the image and mark the bounding boxes
Evaluation metrics such as mAP50 and mAP50-95
Image Segmentation
Perform semantic segmentation on the image
Anime Image Processing
Anime Character Segmentation
Segment the character area from the anime image
Featured Recommended AI Models
Qwen2.5 VL 7B Abliterated Caption It I1 GGUF
Apache-2.0
Quantized version of Qwen2.5-VL-7B-Abliterated-Caption-it, supporting multilingual image description tasks.
Image-to-Text
Transformers Supports Multiple Languages

Q
mradermacher
167
1
Nunchaku Flux.1 Dev Colossus
Other
The Nunchaku quantized version of the Colossus Project Flux, designed to generate high-quality images based on text prompts. This model minimizes performance loss while optimizing inference efficiency.
Image Generation English
N
nunchaku-tech
235
3
Qwen2.5 VL 7B Abliterated Caption It GGUF
Apache-2.0
This is a static quantized version based on the Qwen2.5-VL-7B model, focusing on image captioning generation tasks and supporting multiple languages.
Image-to-Text
Transformers Supports Multiple Languages

Q
mradermacher
133
1
Olmocr 7B 0725 FP8
Apache-2.0
olmOCR-7B-0725-FP8 is a document OCR model based on the Qwen2.5-VL-7B-Instruct model. It is fine-tuned using the olmOCR-mix-0225 dataset and then quantized to the FP8 version.
Image-to-Text
Transformers English

O
allenai
881
3
Lucy 128k GGUF
Apache-2.0
Lucy-128k is a model developed based on Qwen3-1.7B, focusing on proxy-based web search and lightweight browsing, and can run efficiently on mobile devices.
Large Language Model
Transformers English

L
Mungert
263
2
Š 2025AIbase