D

Dse Qwen2 2b Mrl V1

Developed by MrLight
DSE-QWen2-2b-MRL-V1 is a dual-encoder model specifically designed for encoding document screenshots into dense vectors to facilitate document retrieval.
Downloads 4,447
Release Time : 9/11/2024

Model Overview

This model employs the Document Screenshot Embedding (DSE) approach, capturing documents in their original visual format to preserve all information (such as text, images, and layout), avoiding cumbersome parsing and potential information loss. It aims to provide a universal embedding model for retrieving text, PDF documents, web pages, and slides.

Model Features

Original Visual Format Processing
Directly processes document screenshots, preserving original layout, text, and image information.
Flexible Representation Dimensionality
Supports adjusting output embedding dimensions to balance effectiveness and efficiency.
Flexible Input Size
Input image size can be adjusted based on GPU resources.
Multilingual Support
Supports processing documents in English and French.

Model Capabilities

Document Screenshot Embedding
Dense Vector Retrieval
Cross-Modal Document Understanding
Multilingual Document Processing

Use Cases

Document Retrieval
Academic Paper Retrieval
Retrieve relevant literature through paper screenshots.
Achieved 85.8 nDCG@5 on the ViDoRE leaderboard.
Enterprise Document Management
Quickly retrieve corporate documents such as PDFs and PPTs.
Cross-Modal Search
Text-Image Hybrid Retrieval
Simultaneously processes text and visual information in documents for retrieval.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase