C

Colqwen2.5 3b Multilingual

Developed by Metric-AI
A multilingual visual retriever based on Qwen2.5-VL-3B-Instruct, employing ColBERT strategy, with outstanding performance on the Vidore benchmark
Downloads 1,957
Release Time : 2/1/2025

Model Overview

ColQwen is a novel architecture and training strategy based on Vision-Language Models (VLM), capable of efficiently indexing documents from visual features. Supports multilingual and multimodal embeddings, suitable for text-to-visual document retrieval tasks.

Model Features

Multilingual Support
Supports visual document retrieval in multiple languages including English, French, Spanish, Italian, and German
Dynamic Input Resolution
Supports dynamic input image resolution without altering the original aspect ratio, with a maximum resolution limit of generating up to 768 image patches
Efficient Retrieval
Employs ColBERT strategy to generate multi-vector text and image representations, improving retrieval efficiency
High Performance
Ranked first among models with less than 7B parameters and third overall in the Vidore benchmark

Model Capabilities

Multimodal Embedding
Multilingual Embedding
Text-to-Visual Document Retrieval
Efficient Document Indexing

Use Cases

Document Retrieval
Multilingual Document Retrieval
Retrieve relevant content from multilingual documents
Efficient and accurate retrieval of multilingual documents
Visual Document Retrieval
Retrieve relevant content from visual documents
Supports dynamic input resolution, improving retrieval efficiency
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase