Birefnet-portrait Open-source Image Segmentation Model - Applicable to Multiple Tasks, Achieving Precise Segmentation of High-resolution Images

Birefnet Portrait

Developed by aigchacker

BiRefNet is a high-resolution binary image segmentation model that employs a bilateral reference framework and demonstrates excellent performance across multiple image segmentation tasks.

Image Segmentation

Safetensors

#High-resolution image segmentation #Bilateral reference framework #Camouflaged object detection

Downloads 4,021

Release Time : 7/30/2024

Model Overview

BiRefNet is a deep learning model specialized in high-resolution binary image segmentation, designed with a bilateral reference framework. It effectively handles tasks such as background removal and mask generation, excelling in camouflaged object detection and salient object detection.

Model Features

High-resolution processing capability

Specially designed for segmentation tasks involving high-resolution images

Bilateral reference framework

Innovative bilateral reference framework design that enhances segmentation accuracy

Multi-task applicability

Demonstrates excellent performance in various tasks including background removal, mask generation, and camouflaged object detection

Model Capabilities

Image segmentation

Background removal

Mask generation

Camouflaged object detection

Salient object detection

Use Cases

Image editing

Portrait background removal

Precisely separates subjects from backgrounds in portrait photos

Generates high-quality portraits with transparent backgrounds

Product image processing

Background removal and subject extraction for e-commerce product images

Produces clean product display images

Computer vision

Camouflaged object detection

Detects objects that blend into their environment

Improves recognition rates of concealed targets

Salient object detection

Identifies the most attention-grabbing regions in an image

Accurately marks salient regions

🚀 BiRefNet

BiRefNet is a model for high - resolution dichotomous image segmentation. It has achieved SOTA performance on tasks such as image segmentation, salient object detection, and camouflaged object detection.

🚀 Quick Start

0. Install Packages:

pip install -qr https://raw.githubusercontent.com/ZhengPeng7/BiRefNet/main/requirements.txt

1. Load BiRefNet:

Use codes + weights from HuggingFace

Only use the weights on HuggingFace -- Pro: No need to download BiRefNet codes manually; Con: Codes on HuggingFace might not be latest version (I'll try to keep them always latest).

# Load BiRefNet with weights
from transformers import AutoModelForImageSegmentation
birefnet = AutoModelForImageSegmentation.from_pretrained('zhengpeng7/BiRefNet-portrait', trust_remote_code=True)

Use codes from GitHub + weights from HuggingFace

Only use the weights on HuggingFace -- Pro: codes are always latest; Con: Need to clone the BiRefNet repo from my GitHub.

# Download codes
git clone https://github.com/ZhengPeng7/BiRefNet.git
cd BiRefNet

# Use codes locally
from models.birefnet import BiRefNet

# Load weights from Hugging Face Models
birefnet = BiRefNet.from_pretrained('zhengpeng7/BiRefNet-portrait')

Use codes from GitHub + weights from HuggingFace

Only use the weights and codes both locally.

# Use codes and weights locally
import torch
from utils import check_state_dict

birefnet = BiRefNet(bb_pretrained=False)
state_dict = torch.load(PATH_TO_WEIGHT, map_location='cpu')
state_dict = check_state_dict(state_dict)
birefnet.load_state_dict(state_dict)

Use the loaded BiRefNet for inference

# Imports
from PIL import Image
import matplotlib.pyplot as plt
import torch
from torchvision import transforms
from models.birefnet import BiRefNet

birefnet = ... # -- BiRefNet should be loaded with codes above, either way.
torch.set_float32_matmul_precision(['high', 'highest'][0])
birefnet.to('cuda')
birefnet.eval()

def extract_object(birefnet, imagepath):
    # Data settings
    image_size = (1024, 1024)
    transform_image = transforms.Compose([
        transforms.Resize(image_size),
        transforms.ToTensor(),
        transforms.Normalize([0.485, 0.456, 0.406], [0.229, 0.224, 0.225])
    ])

    image = Image.open(imagepath)
    input_images = transform_image(image).unsqueeze(0).to('cuda')

    # Prediction
    with torch.no_grad():
        preds = birefnet(input_images)[-1].sigmoid().cpu()
    pred = preds[0].squeeze()
    pred_pil = transforms.ToPILImage()(pred)
    mask = pred_pil.resize(image.size)
    image.putalpha(mask)
    return image, mask

# Visualization
plt.axis("off")
plt.imshow(extract_object(birefnet, imagepath='PATH-TO-YOUR_IMAGE.jpg')[0])
plt.show()

✨ Features

Multiple Application Scenarios: BiRefNet can be applied to various image segmentation tasks, including background removal, mask generation, dichotomous image segmentation, camouflaged object detection, and salient object detection.
SOTA Performance: It has achieved state - of - the - art performance on three tasks (DIS, HRSOD, and COD).

📚 Documentation

This repo is the official implementation of "Bilateral Reference for High - Resolution Dichotomous Image Segmentation" (CAAI AIR 2024).

Visit our GitHub repo: https://github.com/ZhengPeng7/BiRefNet for more details -- codes, docs, and model zoo!

This repo contains the weights of BiRefNet proposed in our paper, which has achieved the SOTA performance on three tasks (DIS, HRSOD, and COD).

Go to my GitHub page for BiRefNet codes and the latest updates: https://github.com/ZhengPeng7/BiRefNet :)

Try our online demos for inference:

Online Single Image Inference on Colab:
Online Inference with GUI on Hugging Face with adjustable resolutions:
Inference and evaluation of your given weights:

📄 License

This project is under the MIT license. You can check the details here.

Acknowledgement:

Many thanks to @fal for their generous support on GPU resources for training better BiRefNet models.
Many thanks to @not - lain for his help on the better deployment of our BiRefNet model on HuggingFace.

Citation

@article{BiRefNet,
  title={Bilateral Reference for High-Resolution Dichotomous Image Segmentation},
  author={Zheng, Peng and Gao, Dehong and Fan, Deng-Ping and Liu, Li and Laaksonen, Jorma and Ouyang, Wanli and Sebe, Nicu},
  journal={CAAI Artificial Intelligence Research},
  year={2024}
}

Authors

Peng Zheng^1,4,5,6, Dehong Gao², Deng-Ping Fan^1*, Li Liu³, Jorma Laaksonen⁴, Wanli Ouyang⁵, Nicu Sebe⁶

¹Nankai University ²Northwestern Polytechnical University ³National University of Defense Technology ⁴Aalto University ⁵Shanghai AI Laboratory ⁶University of Trento