C

Clip Rsicd V2

Developed by flax-community
A remote sensing image-specific model fine-tuned based on OpenAI CLIP, enhancing zero-shot classification and cross-modal retrieval capabilities
Downloads 3,229
Release Time : 3/2/2022

Model Overview

This model is optimized for remote sensing images and can perform zero-shot image classification, text-to-image, and image-to-image retrieval tasks, particularly suitable for geospatial analysis scenarios.

Model Features

Optimized for Remote Sensing
Fine-tuned with professional datasets like RSICD, significantly improving the understanding of satellite/aerial images
Zero-shot Classification
Performs image classification on new categories without specific training
Cross-modal Retrieval
Supports bidirectional retrieval between text-to-image and image-to-image
Efficient Training
Achieves rapid convergence using TPU-v3-8 hardware and Adafactor optimizer

Model Capabilities

Remote sensing image classification
Text-to-image retrieval
Image-to-image retrieval
Zero-shot learning

Use Cases

Geospatial Analysis
Land Use Classification
Automatically identifies land types such as residential areas, forests, and airports in satellite images
Achieves 88.3% top-1 accuracy on the RSICD dataset
Disaster Assessment
Retrieves images of disaster-affected areas through text descriptions
National Defense and Security
Critical Facility Monitoring
Automatically detects important facilities such as military bases and ports
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase