Geometric-Shapes-Classification開源圖像分類模型，免費精準識別8種基本幾何形狀

首頁

Geometric Shapes Classification

由prithivMLmods開發

基於SigLIP2微調的圖像分類模型，專用於識別8種基本幾何形狀

圖像分類

Transformers

支持多種語言開源協議:Apache-2.0 #高精度形狀識別 #教育輔助工具 #SigLIP2架構

下載量 159

發布時間 : 4/4/2025

模型概述

該模型採用SiglipForImageClassification架構，可準確分類圓形、風箏形、平行四邊形、長方形、菱形、正方形、梯形和三角形等幾何形狀。

模型特點

高精度分類

在8類形狀識別任務中達到99.08%的準確率

符號化輸出

分類結果附帶幾何符號標識（如◯▲◼等）

教育友好

特別適合幾何教學場景的視覺識別

模型能力

幾何形狀識別

圖像分類

視覺特徵提取

使用案例

教育

幾何教學輔助

自動識別並標註教學材料中的幾何形狀

提升幾何概念可視化教學效率

計算機視覺

工程圖紙分析

識別技術圖紙中的基本幾何元素

準確率超過99%的形狀分類

🚀 幾何形狀分類模型

本項目的幾何形狀分類模型是一個圖像分類的視覺語言編碼器模型，它基於 google/siglip2-base-patch16-224 進行微調，用於多類形狀識別任務。該模型使用 SiglipForImageClassification 架構對各種幾何形狀進行分類。

🚀 快速開始

安裝依賴

!pip install -q transformers torch pillow gradio

運行代碼

import gradio as gr
from transformers import AutoImageProcessor
from transformers import SiglipForImageClassification
from PIL import Image
import torch

# Load model and processor
model_name = "prithivMLmods/Geometric-Shapes-Classification"
model = SiglipForImageClassification.from_pretrained(model_name)
processor = AutoImageProcessor.from_pretrained(model_name)

# Label mapping with symbols
labels = {
    "0": "Circle ◯",
    "1": "Kite ⬰",
    "2": "Parallelogram ▰",
    "3": "Rectangle ▭",
    "4": "Rhombus ◆",
    "5": "Square ◼",
    "6": "Trapezoid ⏢",
    "7": "Triangle ▲"
}

def classify_shape(image):
    """Classifies the geometric shape in the input image."""
    image = Image.fromarray(image).convert("RGB")
    inputs = processor(images=image, return_tensors="pt")

    with torch.no_grad():
        outputs = model(**inputs)
        logits = outputs.logits
        probs = torch.nn.functional.softmax(logits, dim=1).squeeze().tolist()

    predictions = {labels[str(i)]: round(probs[i], 3) for i in range(len(probs))}
    
    return predictions

# Gradio interface
iface = gr.Interface(
    fn=classify_shape,
    inputs=gr.Image(type="numpy"),
    outputs=gr.Label(label="Prediction Scores"),
    title="Geometric Shapes Classification",
    description="Upload an image to classify geometric shapes such as circle, triangle, square, and more."
)

# Launch the app
if __name__ == "__main__":
    iface.launch()

💻 使用示例

基礎用法

# 以下代碼展示瞭如何使用該模型進行幾何形狀分類
import gradio as gr
from transformers import AutoImageProcessor
from transformers import SiglipForImageClassification
from PIL import Image
import torch

# 加載模型和處理器
model_name = "prithivMLmods/Geometric-Shapes-Classification"
model = SiglipForImageClassification.from_pretrained(model_name)
processor = AutoImageProcessor.from_pretrained(model_name)

# 帶有符號的標籤映射
labels = {
    "0": "Circle ◯",
    "1": "Kite ⬰",
    "2": "Parallelogram ▰",
    "3": "Rectangle ▭",
    "4": "Rhombus ◆",
    "5": "Square ◼",
    "6": "Trapezoid ⏢",
    "7": "Triangle ▲"
}

def classify_shape(image):
    """對輸入圖像中的幾何形狀進行分類。"""
    image = Image.fromarray(image).convert("RGB")
    inputs = processor(images=image, return_tensors="pt")

    with torch.no_grad():
        outputs = model(**inputs)
        logits = outputs.logits
        probs = torch.nn.functional.softmax(logits, dim=1).squeeze().tolist()

    predictions = {labels[str(i)]: round(probs[i], 3) for i in range(len(probs))}
    
    return predictions

# Gradio 界面
iface = gr.Interface(
    fn=classify_shape,
    inputs=gr.Image(type="numpy"),
    outputs=gr.Label(label="Prediction Scores"),
    title="Geometric Shapes Classification",
    description="上傳一張圖像，對圓形、三角形、正方形等幾何形狀進行分類。"
)

# 啟動應用
if __name__ == "__main__":
    iface.launch()

📚 詳細文檔

分類報告

Classification Report:
                 precision    recall  f1-score   support

       Circle ◯     0.9921    0.9987    0.9953      1500
         Kite ⬰     0.9927    0.9927    0.9927      1500
Parallelogram ▰     0.9926    0.9840    0.9883      1500
    Rectangle ▭     0.9993    0.9913    0.9953      1500
      Rhombus ◆     0.9846    0.9820    0.9833      1500
       Square ◼     0.9914    0.9987    0.9950      1500
    Trapezoid ⏢     0.9966    0.9793    0.9879      1500
     Triangle ▲     0.9772    0.9993    0.9881      1500

       accuracy                         0.9908     12000
      macro avg     0.9908    0.9908    0.9907     12000
   weighted avg     0.9908    0.9908    0.9907     12000

模型分類的類別

該模型將圖像分類為以下類別：

類別 0：圓形 ◯
類別 1：風箏形 ⬰
類別 2：平行四邊形 ▰
類別 3：矩形 ▭
類別 4：菱形 ◆
類別 5：正方形 ◼
類別 6：梯形 ⏢
類別 7：三角形 ▲

預期用途

幾何形狀分類 模型旨在識別圖像中的基本幾何形狀。示例用例如下：

教育工具：用於以可視化方式學習和教授幾何知識。
計算機視覺項目：作為機器人或自動化中的形狀檢測器。
圖像分析：識別圖表或工程圖紙中的符號。
輔助技術：支持視障應用中的形狀識別。

📄 許可證

本項目採用 Apache-2.0 許可證。

📦 模型信息

屬性	詳情
模型類型	圖像分類視覺語言編碼器模型
基礎模型	google/siglip2-base-patch16-224
訓練數據集	prithivMLmods/Math-Shapes
庫名稱	transformers
標籤	Shapes、Geometric、SigLIP2、art
管道標籤	圖像分類