C

Colpali V1.3 Hf

Developed by vidore
ColPali is a vision-language model extended from PaliGemma-3B, capable of efficiently indexing documents through visual features and generating ColBERT-style multi-vector representations.
Downloads 790
Release Time : 11/28/2024

Model Overview

This model indexes documents via visual features, combining PaliGemma-3B's vision-language capabilities with ColBERT's multi-vector representation strategy to achieve efficient document retrieval.

Model Features

Multi-vector Representation
Adopts the ColBERT strategy to generate multi-vector representations for text and images, improving retrieval accuracy.
Vision-Language Fusion
Combines PaliGemma-3B's vision-language capabilities to achieve cross-modal understanding.
Efficient Retrieval
Optimizes retrieval efficiency by indexing documents through visual features.

Model Capabilities

Visual Document Retrieval
Cross-modal Understanding
Multi-vector Representation Generation

Use Cases

Document Retrieval
PDF Document Retrieval
Quickly retrieve relevant content from PDF documents via visual features.
Cross-modal Search
Image-Text Association Search
Retrieve related image content based on text queries or vice versa.
Featured Recommended AI Models
ยฉ 2025AIbase