Legalbert
BERT-based pre-trained model specialized for legal texts with optimizations for legal text characteristics
Downloads 467
Release Time : 3/2/2022
Model Overview
This model is a BERT variant further pre-trained on large-scale legal judgment texts, specifically designed for natural language processing tasks in the legal domain, such as legal text classification and case analysis.
Model Features
Legal domain specialization
Further pre-trained on 37GB of legal judgment texts, optimized for legal terminology and text structure
Large-scale training data
Training corpus includes 3,446,187 legal judgments, far exceeding the scale of original BERT training data
Multi-task support
Supports masked language modeling, next sentence prediction, and legal-specific tasks like CaseHOLD multiple-choice questions
Model Capabilities
Legal text understanding
Legal text classification
Legal multiple-choice question answering
Legal text generation
Legal semantic analysis
Use Cases
Legal text analysis
Precedent overturning prediction
Analyze legal judgment texts to predict the likelihood of overturning precedents
Terms of service classification
Automatically classify legal contracts and terms of service
Legal education
CaseHOLD multiple-choice question answering
Assist in answering case-based multiple-choice questions in legal education
Featured Recommended AI Models
Qwen2.5 VL 7B Abliterated Caption It I1 GGUF
Apache-2.0
Quantized version of Qwen2.5-VL-7B-Abliterated-Caption-it, supporting multilingual image description tasks.
Image-to-Text
Transformers Supports Multiple Languages

Q
mradermacher
167
1
Nunchaku Flux.1 Dev Colossus
Other
The Nunchaku quantized version of the Colossus Project Flux, designed to generate high-quality images based on text prompts. This model minimizes performance loss while optimizing inference efficiency.
Image Generation English
N
nunchaku-tech
235
3
Qwen2.5 VL 7B Abliterated Caption It GGUF
Apache-2.0
This is a static quantized version based on the Qwen2.5-VL-7B model, focusing on image captioning generation tasks and supporting multiple languages.
Image-to-Text
Transformers Supports Multiple Languages

Q
mradermacher
133
1
Olmocr 7B 0725 FP8
Apache-2.0
olmOCR-7B-0725-FP8 is a document OCR model based on the Qwen2.5-VL-7B-Instruct model. It is fine-tuned using the olmOCR-mix-0225 dataset and then quantized to the FP8 version.
Image-to-Text
Transformers English

O
allenai
881
3
Lucy 128k GGUF
Apache-2.0
Lucy-128k is a model developed based on Qwen3-1.7B, focusing on proxy-based web search and lightweight browsing, and can run efficiently on mobile devices.
Large Language Model
Transformers English

L
Mungert
263
2