C

Colqwen2 7b V1.0

Developed by tsystems
A visual retrieval model based on Qwen2-VL-7B-Instruct using ColBERT strategy, focusing on efficient visual feature indexing for documents
Downloads 172
Release Time : 12/30/2024

Model Overview

ColQwen is a novel architecture based on vision-language models, capable of generating ColBERT-style multi-vector text and image representations for efficient document retrieval

Model Features

Dynamic Image Resolution Processing
Accepts dynamic resolution input without resizing, maintains original aspect ratio, and generates up to 768 image patches
Multi-Vector Representation
Adopts ColBERT-style multi-vector text and image representations to enhance retrieval efficiency
LoRA Fine-tuning
Uses Low-Rank Adaptation (LoRA) for efficient fine-tuning, with alpha=64 and r=64 set in the language model Transformer layers and projection layers

Model Capabilities

Visual Document Retrieval
Multimodal Embedding
Cross-modal Matching

Use Cases

Document Retrieval
PDF Document Retrieval
Quickly retrieves relevant content from a large number of PDF documents
Experiments show that increasing the number of image patches significantly improves performance
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase