C

Colsmol 256M

Developed by vidore
A visual retriever based on SmolVLM-Instruct-250M using ColBERT strategy, capable of efficiently indexing documents from visual features
Downloads 42.84k
Release Time : 1/22/2025

Model Overview

ColSmolVLM is a vision-language model (VLM) based on novel architecture and training strategies, generating ColBERT-style multi-vector representations for text and images to enable efficient document retrieval

Model Features

ColBERT-Style Multi-Vector Representation
Generates multi-vector representations for both text and images to improve retrieval efficiency
Efficient Visual Document Retrieval
Specially optimized for tasks involving document indexing from visual features
LoRA Adapter Training
Trained using Low-Rank Adapters (LoRA) for parameter efficiency

Model Capabilities

Visual Document Retrieval
Multimodal Representation Learning
Cross-Modal Matching

Use Cases

Document Retrieval
Academic Literature Retrieval
Retrieving relevant academic literature from large collections of PDF documents
Enterprise Document Management
Assisting enterprises in managing internal document libraries for quick information retrieval
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase