L

Legal Roberta Large

Developed by lexlms
A legal domain language model continuously pre-trained on the LeXFiles legal corpus based on the RoBERTa large model
Downloads 367
Release Time : 11/11/2022

Model Overview

LexLM is a series of RoBERTa models specifically optimized for the legal domain, enhancing legal text comprehension through continuous pre-training and supporting legal document analysis and processing tasks

Model Features

Legal Domain Optimization
Continuously pre-trained on the diverse LeXFiles legal corpus, specifically optimized for legal text processing capabilities
Mixed Case Support
Consistent with mainstream large language models, supports mixed-case text processing
Balanced Training Strategy
Uses an exponential smoothing sentence sampler to balance token ratios across sub-corpora, preventing overfitting
Efficient Tokenizer
Trained with a new 50K BPE tokenizer, reusing embeddings of overlapping tokens from the original vocabulary

Model Capabilities

Legal Text Comprehension
Legal Document Analysis
Legal Terminology Recognition
Legal Text Fill-Mask Prediction

Use Cases

Legal Document Processing
Legal Agreement Analysis
Analyzing key clauses and terms in legal agreements
Legal Case Analysis
Understanding key facts and legal issues in case descriptions
Legal Text Generation
Legal Document Completion
Automatically completing missing content in legal documents
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase