C

Colpali V1.1

Developed by vidore
ColPali is a visual retrieval model based on PaliGemma-3B and the ColBERT strategy, used to efficiently index documents from visual features.
Downloads 196
Release Time : 8/21/2024

Model Overview

ColPali is a vision-language model (VLM) that can generate ColBERT-style multi-vector representations of text and images, mainly used for document retrieval tasks.

Model Features

Multi-vector representation
Generate ColBERT-style multi-vector representations of text and images to improve retrieval efficiency.
Vision-language fusion
Combine the advantages of SigLIP and PaliGemma-3B to achieve deep fusion of visual and language features.
Efficient retrieval
Calculate the interaction between text tokens and image blocks through the ColBERT strategy to significantly improve retrieval performance.

Model Capabilities

Visual document retrieval
Multimodal representation learning
Document indexing

Use Cases

Document retrieval
Academic document retrieval
Quickly retrieve relevant information from a large number of PDF documents.
Performed excellently in the ViDoRe benchmark test
Enterprise document management
Efficiently manage and retrieve enterprise internal documents.
Featured Recommended AI Models
ยฉ 2025AIbase