Roberta Large Ner English
R
Roberta Large Ner English
Developed by Jean-Baptiste
An English named entity recognition model fine-tuned on RoBERTa-large, trained on the conll2003 dataset, specifically optimized for entity recognition in email/chat data.
Downloads 236.85k
Release Time : 3/2/2022
Model Overview
This model focuses on English named entity recognition tasks, excelling particularly in recognizing entities in informal texts (such as emails/chats), with improved performance for non-capitalized entities.
Model Features
Optimized Informal Text Processing
Validated on email and chat data, outperforming other models, especially suitable for processing informal texts.
Non-Capitalized Entity Recognition
Significantly better at recognizing non-capitalized entities compared to other models.
Simplified Tagging System
Removes B- and I- prefixes, using a simplified five-category tagging system: PER/ORG/LOC/MISC/O.
Model Capabilities
English Named Entity Recognition
Informal Text Processing
Multi-category Entity Classification
Use Cases
Text Analysis
Email Signature Detection
Identify signature sections in emails and extract information such as names and job titles.
Can be used to train LSTM models for signature detection (refer to the provided Medium article).
Chat Log Analysis
Extract names, organizations, and locations from instant messaging or chat logs.
Achieved a PER entity F1 score of 0.8967 on a private dataset.
Information Extraction
News Text Analysis
Extract key information such as names, organizations, and locations from news articles.
Achieved an overall F1 score of 0.9753 on the conll2003 validation set.
Featured Recommended AI Models