G

Git RSCLIP

Developed by lcybuaa
Git-RSCLIP is a vision-language model pretrained on the Git-10M dataset, specializing in multimodal understanding of remote sensing images.
Downloads 59.37k
Release Time : 3/3/2025

Model Overview

This model is a vision-language model specifically designed for tasks involving the association of remote sensing images and text, supporting functions such as zero-shot image classification and image-text retrieval.

Model Features

Global-Scale Remote Sensing Dataset
Pretrained on the Git-10M dataset, which contains 10 million remote sensing image-text pairs, covering a global scope.
High-Resolution Processing
Supports image processing at 256x256 resolution, suitable for high-precision requirements of remote sensing images.
Zero-Shot Learning Capability
Can be directly applied to zero-shot image classification and image-text retrieval tasks without fine-tuning.

Model Capabilities

Zero-Shot Image Classification
Image-Text Retrieval
Remote Sensing Image Understanding

Use Cases

Remote Sensing Image Analysis
Remote Sensing River Image Classification
Identify rivers and other geographical features in remote sensing images.
High-accuracy zero-shot classification capability
House and Road Detection
Detect artificial structures such as houses and roads from remote sensing images.
Supports multi-label classification
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase