D

Distilcamembert Lleqa

Developed by maastrichtlawtech
Sentence embedding model trained on the French legal information retrieval dataset LLeQA, suitable for semantic search and clustering tasks in legal texts
Downloads 22
Release Time : 9/28/2023

Model Overview

This model maps sentences and paragraphs to a 768-dimensional dense vector space, specifically optimized for French legal texts, and can be used in applications such as legal Q&A systems and clause retrieval

Model Features

Legal Domain Optimization
Fine-tuned on the professional legal Q&A dataset LLeQA, demonstrating excellent performance in legal text comprehension
Efficient Inference
Adopts the DistilCamemBERT-base architecture, improving inference efficiency while maintaining performance
Semantic Retrieval Capability
Accurately calculates semantic similarity between legal questions and clauses, supporting efficient retrieval

Model Capabilities

Sentence vectorization
Semantic similarity calculation
Legal text retrieval
Legal Q&A support

Use Cases

Legal Information Retrieval
Legal Clause Matching
Automatically retrieves relevant legal clauses based on user-submitted legal questions
Achieves 52.95% R@10 recall rate on the LLeQA test set
Legal Q&A System
Serves as the retrieval module for legal Q&A systems to improve answer accuracy
Legal Text Analysis
Legal Document Clustering
Automatically classifies and organizes large volumes of legal documents
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase