ai-vs-human-image-detectorオープンソース画像検出器 - AI生成の高品質画像を高精度で識別

ホーム

Ai Vs Human Image Detector

Ateeqqによって開発

このモデルは12万枚の画像（AI生成画像6万枚と人間の画像6万枚）でファインチューニングされ、AI生成の高品質画像を検出するために特別に設計されています。

画像分類

Transformers

その他#AI画像検出 #高精度分類 #マルチモデル認識

ダウンロード数 1,299

リリース時間 : 3/30/2025

モデル概要

このモデルは、AI生成画像（Midjourney v6.1、Flux 1.1 Pro、Stable Diffusion 3.5、GPT-4oなどの最先端モデルで生成された画像を含む）と人間が作成した実際の画像を正確に区別できます。

モデル特徴

高精度

テストセットで99.23%の精度を達成し、AI生成画像と人間作成画像を確実に区別できます

幅広い互換性

Midjourney、Stable Diffusion、GPT-4oなど、さまざまな最先端AIモデルで生成された画像の検出をサポート

効率的なトレーニング

わずか2時間39分でトレーニングを完了し、トレーニング速度は69.053サンプル/秒に達しました

モデル能力

AI生成画像検出

人間作成画像認識

画像分類

使用事例

コンテンツ審査

AI生成コンテンツ識別

ソーシャルメディアプラットフォームでAI生成の画像コンテンツを識別するために使用

精度99.23%

デジタルフォレンジック

画像真正性検証

ニュース画像や証拠画像などがAI生成かどうかを検証するのに役立ちます

🚀 AIと人間の画像分類モデル

このモデルは、60,000枚のAI生成画像と60,000枚の人間による画像を用いて微調整されたモデルです。Midjourney v6.1、Flux 1.1 Pro、Stable Diffusion 3.5、GPT - 4oなどの最新のAI生成モデルによる高品質な画像を検出する能力に優れています。

詳細な学習コードはこちらで入手できます: [blog/ai/fine - tuning - siglip2](https://exnrt.com/blog/ai/fine - tuning - siglip2/)

🚀 クイックスタート

📦 インストール

pip install -q transformers torch Pillow accelerate

💻 使用例

基本的な使用法

import torch
from PIL import Image as PILImage
from transformers import AutoImageProcessor, SiglipForImageClassification

MODEL_IDENTIFIER = r"Ateeqq/ai-vs-human-image-detector"

# Device: Use GPU if available, otherwise CPU
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
print(f"Using device: {device}")

# Load Model and Processor
try:
    print(f"Loading processor from: {MODEL_IDENTIFIER}")
    processor = AutoImageProcessor.from_pretrained(MODEL_IDENTIFIER)

    print(f"Loading model from: {MODEL_IDENTIFIER}")
    model = SiglipForImageClassification.from_pretrained(MODEL_IDENTIFIER)
    model.to(device)
    model.eval()
    print("Model and processor loaded successfully.")

except Exception as e:
    print(f"Error loading model or processor: {e}")
    exit()

# Load and Preprocess the Image

IMAGE_PATH = r"/content/images.jpg" 
try:
    print(f"Loading image: {IMAGE_PATH}")
    image = PILImage.open(IMAGE_PATH).convert("RGB")
except FileNotFoundError:
    print(f"Error: Image file not found at {IMAGE_PATH}")
    exit()
except Exception as e:
    print(f"Error opening image: {e}")
    exit()

print("Preprocessing image...")
# Use the processor to prepare the image for the model
inputs = processor(images=image, return_tensors="pt").to(device)

# Perform Inference
print("Running inference...")
with torch.no_grad(): # Disable gradient calculations for inference
    outputs = model(**inputs)
    logits = outputs.logits

# Interpret the Results
# Get the index of the highest logit score -> this is the predicted class ID
predicted_class_idx = logits.argmax(-1).item()

# Use the model's config to map the ID back to the label string ('ai' or 'hum')
predicted_label = model.config.id2label[predicted_class_idx]

# Optional: Get probabilities using softmax
probabilities = torch.softmax(logits, dim=-1)
predicted_prob = probabilities[0, predicted_class_idx].item()

print("-" * 30)
print(f"Image: {IMAGE_PATH}")
print(f"Predicted Label: {predicted_label}")
print(f"Confidence Score: {predicted_prob:.4f}")
print("-" * 30)

# You can also print the scores for all classes:
print("Scores per class:")
for i, label in model.config.id2label.items():
    print(f"  - {label}: {probabilities[0, i].item():.4f}")

出力例

Using device: cpu
Model and processor loaded successfully.
Loading image: /content/images.jpg
Preprocessing image...
Running inference...
------------------------------
Image: /content/images.jpg
Predicted Label: ai
Confidence Score: 0.9996
------------------------------
Scores per class:
  - ai: 0.9996
  - hum: 0.0004

📚 ドキュメント

🔍 評価指標

Training Results

🏋️‍♂️ 学習指標

エポック数: 5.0
総FLOPs: 51,652,280,821 GF
学習損失: 0.0799
学習時間: 2:39:49.46
学習サンプル/秒: 69.053
学習ステップ/秒: 4.316

📊 評価指標 (テストセットでの微調整モデル)

エポック数: 5.0
評価精度: 0.9923
評価損失: 0.0551
評価時間: 0:02:35.78
評価サンプル/秒: 212.533
評価ステップ/秒: 6.644

🔦 予測指標 (テストセット)

{
  "test_loss": 0.05508904904127121,
  "test_accuracy": 0.9923283699296264,
  "test_runtime": 167.1844,
  "test_samples_per_second": 198.039,
  "test_steps_per_second": 6.191
}

最終テスト精度: 0.9923
最終テストF1スコア (マクロ): 0.9923
最終テストF1スコア (加重): 0.9923