C

Colqwen2 7b V1.0

Developed by yydxlv
A visual retrieval model based on Qwen2-VL-7B-Instruct and ColBERT strategy, supporting multi-vector text and image representation
Downloads 25
Release Time : 12/24/2024

Model Overview

ColQwen is a novel vision-language model architecture that efficiently indexes documents through visual features, particularly suitable for PDF document retrieval

Model Features

Dynamic Image Resolution Support
Supports dynamic input image resolution without adjusting aspect ratio, capable of generating up to 768 image patches
Multi-Vector Representation
Adopts ColBERT-style multi-vector text and image representation to enhance retrieval efficiency
Efficient Training Strategy
Utilizes LoRA adapter training to optimize computational resource usage

Model Capabilities

Visual Document Retrieval
Multimodal Embedding
Image Feature Extraction
Text Feature Extraction

Use Cases

Document Retrieval
PDF Document Retrieval
Content retrieval for PDF documents based on visual features
Improves document retrieval efficiency
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase