B

Biencoder Distilcamembert Mmarcofr

Developed by antoinelouis
This is a dense single-vector dual encoder model for French, suitable for semantic search. The model maps queries and passages to 768-dimensional dense vectors and calculates relevance through cosine similarity.
Downloads 160
Release Time : 5/22/2023

Model Overview

This model is a dual encoder based on DistilCamemBERT, specifically optimized for French information retrieval tasks, capable of efficiently computing semantic similarity between queries and passages.

Model Features

French optimization
Semantic retrieval model specifically optimized for French text
Efficient retrieval
Uses 768-dimensional dense vector representation to support fast cosine similarity calculation
Hard negative mining
Utilized hard negative samples mined by 12 different retrievers during training

Model Capabilities

Semantic similarity calculation
Passage retrieval
Information retrieval

Use Cases

Information retrieval
Document retrieval system
Build a French document retrieval system that returns the most relevant documents based on user queries
Achieved Recall@500 of 87.9 on the mMARCO-fr validation set
Question answering system
Serves as the retrieval component in a QA system to find relevant passages from a knowledge base
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase