C

Colqwen2.5 3b Multilingual V1.0

Developed by tsystems
A multilingual visual retrieval model based on Qwen2.5-VL-3B-Instruct and ColBERT strategy, supporting dynamic input image resolution and multilingual document retrieval.
Downloads 13.29k
Release Time : 3/9/2025

Model Overview

ColQwen is a novel vision-language model architecture that efficiently indexes documents through visual features, generating ColBERT-style multi-vector text and image representations, suitable for multilingual visual document retrieval tasks.

Model Features

Multilingual Support
Supports visual document retrieval in five languages: English, French, Spanish, Italian, and German
Dynamic Image Resolution
Supports dynamic input image resolution without changing aspect ratio, with a maximum limit of 768 image patches
Efficient Retrieval Architecture
Adopts ColBERT-style multi-vector representation strategy to improve document retrieval efficiency
Multimodal Embedding
Processes both text and image inputs simultaneously to generate joint multimodal embeddings

Model Capabilities

Multilingual visual document retrieval
Text-to-image retrieval
Multimodal embedding generation
Dynamic resolution image processing

Use Cases

Document Retrieval
Multilingual PDF Document Retrieval
Retrieves relevant documents from a multilingual PDF document library based on text queries
Efficiently retrieves documents containing visual content
Visual Question Answering System
Answers user questions based on document image content
Provides accurate answers by combining text and visual information
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase