R

Robertalex

Developed by PlanTL-GOB-ES
A RoBERTa base model trained on Spanish legal domain corpus, specializing in Spanish legal text processing
Downloads 379
Release Time : 3/2/2022

Model Overview

This model is a Spanish masked language model based on Transformer architecture, specifically optimized for legal domain texts, suitable for masked language modeling tasks or as a pre-training foundation for downstream tasks

Model Features

Legal domain specialization
Pre-trained on an 8.9GB Spanish legal domain corpus, demonstrating excellent performance in legal text processing
High-quality preprocessing
Training data underwent rigorous preprocessing including sentence segmentation, language detection, abnormal sentence filtering, and content deduplication
Multi-task adaptability
Can be directly used for masked language modeling tasks or fine-tuned as a base model for downstream tasks

Model Capabilities

Legal text understanding
Masked language modeling
Text feature extraction
Legal text classification
Legal named entity recognition

Use Cases

Legal text processing
Legal text completion
Automatically completing missing content in legal documents
Examples show accurate prediction of professional terminology in legal texts
Legal Q&A systems
Serving as a base model for legal question answering systems
Legal document classification
Automatic classification of legal documents
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase