B

Bert Large Portuguese Cased Legal Mlm

Developed by stjiris
Legal domain-specific Portuguese model trained on BERTimbau large version, supports semantic search and text embedding
Downloads 109
Release Time : 1/4/2023

Model Overview

This is a BERT model optimized for Portuguese legal texts, capable of mapping legal sentences to a 1024-dimensional vector space, suitable for tasks like legal document clustering and semantic search.

Model Features

Legal Domain Optimization
Specially trained on approximately 30,000 Portuguese legal documents, suitable for handling legal terminology and expressions
High-Quality Embeddings
Generates 1024-dimensional dense vectors, effectively capturing semantic features of legal texts
Large-Scale Pretraining
Further trained based on BERTimbau large version (by neuralmind)

Model Capabilities

Legal text semantic understanding
Document similarity calculation
Legal information retrieval
Text embedding generation

Use Cases

Judicial System
Case Law Semantic Search
Quickly find historical case laws similar to current cases
Performs excellently in STJ Supreme Court semantic search system
Legal Document Classification
Automatically categorize court documents and litigation materials
Legal Tech
Intelligent Legal Consultation
Serves as the semantic understanding component for legal Q&A systems
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase