C

Colsmol 500M

Developed by vidore
A visual retrieval model based on SmolVLM-Instruct-500M and the ColBERT strategy, capable of efficiently indexing documents through visual features
Downloads 1,807
Release Time : 1/22/2025

Model Overview

ColSmolVLM is a novel architecture and training strategy based on Vision-Language Models (VLMs) that generates ColBERT-style multi-vector representations for text and images, enabling efficient document retrieval

Model Features

ColBERT-style Multi-vector Representation
Generates multi-vector representations for text and images, improving retrieval efficiency
Efficient Visual Feature Indexing
Efficiently indexes document content through visual features
LoRA Adapter Training
Applies LoRA adapters to the Transformer and projection layers of the language model for training

Model Capabilities

Visual Document Retrieval
Multi-vector Representation Generation
Image-Text Matching

Use Cases

Document Retrieval
Academic Literature Retrieval
Retrieve relevant content from PDF documents through visual features
Enterprise Document Management
Quickly locate relevant information within company internal documents
Featured Recommended AI Models
ยฉ 2025AIbase