C

Colbertv2 Camembert L4 Mmarcofr

Developed by antoinelouis
A lightweight ColBERTv2 model designed specifically for French semantic search, supporting efficient context matching retrieval.
Downloads 533
Release Time : 3/11/2024

Model Overview

This model is based on the ColBERTv2 architecture and optimized for French semantic search. It can encode queries and text paragraphs into token-level embedding matrices and achieve efficient matching through the MaxSim operator.

Model Features

Lightweight design
With only 54M parameters and a model size of 0.2GB, it is suitable for deployment in resource-constrained environments.
Efficient retrieval
Uses the MaxSim operator to achieve efficient context matching retrieval, supporting large-scale corpus search.
French optimization
Specifically trained and optimized for French semantic search, performing excellently on the mMARCO-fr dataset.
Residual compression
Adopts the residual compression mechanism of ColBERTv2, significantly reducing the index storage space requirement.

Model Capabilities

French semantic search
Context matching retrieval
Large-scale corpus indexing

Use Cases

Information retrieval
Document retrieval system
Build a French document retrieval system to quickly find documents that match the query semantically.
Achieved a recall rate of 91.9%@1000 on the mMARCO-fr dataset
Question-answering system
As the retrieval component of a question-answering system, quickly find candidate paragraphs related to the question.
MRR@10 reached 32.3
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase