B

Bert Large Portuguese Cased Legal Tsdae Gpl Nli Sts MetaKD V1

Developed by stjiris
A Portuguese sentence transformer specialized for the legal domain based on the BERTimbau large model, suitable for semantic similarity calculation and clustering tasks
Downloads 74
Release Time : 3/3/2023

Model Overview

This is a sentence transformer model optimized for Portuguese legal texts, capable of mapping sentences to a 1024-dimensional dense vector space, particularly suitable for semantic search and similarity calculation tasks involving legal documents.

Model Features

Legal Domain Optimization
Trained on approximately 30,000 legal documents, excelling in legal text processing
Advanced Training Techniques
Utilizes TSDAE technology and metadata knowledge distillation to enhance semantic representation capabilities
Multi-Dataset Fine-Tuning
Optimized on multiple Portuguese datasets including assin, assin2, and stsb_multi_mt
High-Dimensional Vector Space
Maps text to a 1024-dimensional dense vector space, suitable for complex semantic analysis

Model Capabilities

Semantic similarity calculation
Legal text clustering
Information retrieval
Sentence vectorization

Use Cases

Legal Document Processing
Legal Document Similarity Analysis
Calculate semantic similarity between different legal documents
Performs exceptionally well on STJ judicial documents
Legal Semantic Search System
Build a semantic-based legal document retrieval system
Applied in Supreme Court document retrieval
Text Analysis
Legal Text Clustering
Automatically classify and cluster large volumes of legal documents
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase