C

Camembert Base Lleqa

Developed by maastrichtlawtech
A French sentence embedding model based on CamemBERT, specifically optimized for French legal information retrieval tasks, capable of converting text into 768-dimensional vector space representations.
Downloads 25
Release Time : 9/28/2023

Model Overview

This model is a sentence embedding model fine-tuned on the French legal Q&A dataset LLeQA, suitable for tasks such as legal clause retrieval and semantic similarity calculation, effectively handling French legal texts.

Model Features

Legal Domain Optimization
Specifically fine-tuned for French legal texts, excelling in Belgian regulation retrieval tasks.
Efficient Semantic Encoding
Encodes sentences/paragraphs of any length into fixed 768-dimensional dense vectors, suitable for large-scale retrieval.
Contrastive Learning Training
Uses Q&A-clause contrastive learning objectives to enhance the model's ability to distinguish relevant legal clauses.

Model Capabilities

French Sentence Embedding
Semantic Similarity Calculation
Legal Clause Retrieval
Text Feature Extraction

Use Cases

Legal Information Retrieval
Citizen Legal Q&A System
Automatically retrieves relevant legal clauses based on natural language questions.
Achieved a 58.27% R@10 recall rate on the test set.
Regulation Clause Clustering
Performs semantic clustering analysis on legal provisions.
Document Processing
Legal Document Similarity Comparison
Calculates semantic similarity between different legal documents.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase