B

Bert Large Portuguese Cased Legal Tsdae Gpl Nli Sts MetaKD V0

Developed by stjiris
This is a large Portuguese legal domain sentence transformer model based on BERTimbau, specifically designed for semantic similarity tasks in legal texts.
Downloads 63
Release Time : 3/3/2023

Model Overview

The model maps sentences and paragraphs into a 1024-dimensional dense vector space, suitable for tasks like clustering or semantic search. It is a legal domain variant of the BERTimbau large model, trained with TSDAE technology and fine-tuned on NLI and STS tasks.

Model Features

Legal domain optimization
Specially optimized for Portuguese legal texts, demonstrating excellent performance in the Supreme Court semantic search system
Metadata knowledge distillation
Utilizes metadata knowledge distillation technology to improve information retrieval effectiveness through dense vectors
Multi-stage training
Initially trained unsupervised with TSDAE, then fine-tuned on NLI and STS tasks

Model Capabilities

Sentence embedding generation
Semantic similarity calculation
Legal text analysis
Information retrieval

Use Cases

Legal information retrieval
Supreme Court case search
Used to build a semantic search system for the Portuguese Supreme Court
Compared to BM25 method, the discovery metric for the first query result improved by 335%
Legal text analysis
Legal document similarity analysis
Calculates semantic similarity between different legal documents or judgments
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase