C

Colqwen2 V1.0

Developed by vidore
ColQwen2 is a visual retrieval model based on Qwen2-VL-2B-Instruct and the ColBERT strategy, designed for efficient indexing of document visual features.
Downloads 106.85k
Release Time : 11/3/2024

Model Overview

ColQwen2 is a Vision-Language Model (VLM) capable of generating ColBERT-style multi-vector representations for both text and images, primarily used for document retrieval tasks.

Model Features

Dynamic Input Image Resolution
Supports original aspect ratio input without resizing, with maximum resolution set to generate up to 768 image patches.
Multi-vector Representation
Utilizes ColBERT-style multi-vector representation to improve retrieval efficiency.
LoRA Adapter
Applies Low-Rank Adaptation (LoRA) with parameters alpha=32 and r=32 to the Transformer layers and projection layers of the language model.

Model Capabilities

Visual Document Retrieval
Multimodal Representation Learning
Cross-modal Retrieval

Use Cases

Document Retrieval
Academic Literature Retrieval
Retrieve relevant academic literature from a large collection of PDF documents.
Significantly improves retrieval efficiency.
Enterprise Document Management
Efficient indexing and retrieval of internal corporate documents.
Enhances document search efficiency.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase