Es Anonimization Core Lg
A multilingual (Catalan and Spanish) anonymization model based on Spacy, used to identify and anonymize sensitive data.
Downloads 9,735
Release Time : 1/11/2023
Model Overview
This model is used to identify sensitive data in plain text generated by users of Latin American Spanish and Catalan, and finally anonymize the detected data.
Model Features
Multilingual support
Supports the anonymization of sensitive information in both Catalan and Spanish.
Multiple entity detection
Can detect multiple types of entities, including emails, financial information, identity information, etc.
Seamless integration
Can be seamlessly integrated with BSC's AnonymizationPipeline, making it convenient to use in real - world projects.
Model Capabilities
Sensitive information detection
Text anonymization
Multilingual processing
Use Cases
Data privacy protection
User data anonymization
Perform sensitive information detection and anonymization on text data generated by users.
Protect user privacy and comply with data protection regulations.
đ ca_anonimization_core_lg
This is a Spacy multilingual (Catalan & Spanish) anonymization model. It's designed to work with BSC's AnonymizationPipeline, enabling the identification and anonymization of sensitive data in user - generated plain text in Spanish and Catalan.
đ Quick Start
This model is not a standalone one and is intended to function within the BSC's AnonymizationPipeline. You can access the pipeline at GitHub.
đĻ Installation
You can install the model using the following command:
pip install https://huggingface.co/PlanTL-GOB-ES/es_anonimization_core_lg/resolve/main/es_anonimization_core_lg-any-py3-none-any.whl
⨠Features
- Multilingual Support: Works with both Catalan and Spanish.
- Entity Detection: Can detect various entities such as
EMAIL
,FINANCIAL
,ID
,LOC
,MISC
,ORG
,PER
,TELEPHONE
,VEHICLE
,ZIP
.
đ Documentation
Model Information
Property | Details |
---|---|
Model Type | ca_anonimization_core_lg |
Version | 1.0.0 |
spaCy | >=3.2.3,<4.0.0 |
Default Pipeline | tok2vec , morphologizer , parser , attribute_ruler , lemmatizer , ner |
Components | tok2vec , morphologizer , parser , attribute_ruler , lemmatizer , ner |
Vectors | 500000 keys, 500000 unique vectors (300 dimensions) |
Sources | n/a |
License | MIT |
Author | Joaquin Silveira |
Label Scheme
View label scheme (322 labels for 3 components)
Component | Labels |
---|---|
morphologizer |
Definite=Def|Gender=Masc|Number=Sing|POS=DET|PronType=Art , POS=PROPN , POS=PUNCT|PunctSide=Ini|PunctType=Brck , POS=PUNCT|PunctSide=Fin|PunctType=Brck , Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin , Gender=Masc|Number=Sing|POS=VERB|Tense=Past|VerbForm=Part , Definite=Def|Gender=Fem|Number=Sing|POS=DET|PronType=Art , Gender=Fem|Number=Sing|POS=NOUN , POS=ADP , NumType=Card|Number=Plur|POS=NUM , Gender=Masc|Number=Plur|POS=NOUN , Number=Sing|POS=ADJ , POS=CCONJ , Gender=Fem|Number=Sing|POS=DET|PronType=Ind , NumForm=Digit|NumType=Card|POS=NUM , NumForm=Digit|POS=NOUN , Gender=Masc|Number=Plur|POS=ADJ , POS=PUNCT|PunctType=Comm , POS=AUX|VerbForm=Inf , Case=Acc,Dat|POS=PRON|Person=3|PrepCase=Npr|PronType=Prs|Reflex=Yes , Definite=Def|Gender=Masc|Number=Plur|POS=DET|PronType=Art , POS=PRON|PronType=Rel , Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Imp|VerbForm=Fin , Gender=Fem|Number=Sing|POS=DET|PronType=Art , Gender=Fem|Number=Sing|POS=DET|Person=3|Poss=Yes|PronType=Prs , Definite=Def|Gender=Fem|Number=Plur|POS=DET|PronType=Art , Gender=Fem|Number=Plur|POS=NOUN , Gender=Fem|Number=Plur|POS=ADJ , POS=VERB|VerbForm=Inf , Case=Acc,Dat|Number=Plur|POS=PRON|Person=3|PronType=Prs , Number=Plur|POS=ADJ , POS=PUNCT|PunctType=Peri , Number=Sing|POS=PRON|PronType=Rel , Gender=Masc|Number=Sing|POS=NOUN , Mood=Imp|Number=Sing|POS=VERB|Person=2|VerbForm=Fin , Gender=Masc|Number=Plur|POS=ADJ|VerbForm=Part , POS=SCONJ , Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin , Gender=Masc|Number=Plur|POS=VERB|Tense=Past|VerbForm=Part , Definite=Def|Number=Sing|POS=DET|PronType=Art , Gender=Masc|Number=Sing|POS=DET|PronType=Ind , Gender=Fem|Number=Plur|POS=ADJ|VerbForm=Part , Gender=Masc|Number=Sing|POS=DET|PronType=Dem , POS=VERB|VerbForm=Ger , POS=NOUN , Gender=Fem|NumType=Card|Number=Sing|POS=NUM , Gender=Fem|Number=Sing|POS=ADJ|VerbForm=Part , Gender=Fem|NumType=Ord|Number=Plur|POS=ADJ , POS=SYM , Gender=Masc|Number=Sing|POS=ADJ , Gender=Masc|Number=Sing|POS=ADJ|VerbForm=Part , Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin , Gender=Fem|Number=Sing|POS=DET|PronType=Dem , POS=ADV|Polarity=Neg , POS=ADV , Number=Sing|POS=PRON|PronType=Dem , Number=Sing|POS=NOUN , Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin , Number=Plur|POS=NOUN , Mood=Sub|Number=Plur|POS=VERB|Person=3|Tense=Imp|VerbForm=Fin , Gender=Fem|Number=Sing|POS=ADJ , Mood=Sub|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin , Gender=Masc|Number=Sing|POS=PRON|PronType=Tot , Case=Loc|POS=PRON|Person=3|PronType=Prs , Gender=Fem|NumType=Ord|Number=Sing|POS=ADJ , Degree=Cmp|POS=ADV , Gender=Fem|Number=Plur|POS=DET|PronType=Art , Gender=Fem|Number=Plur|POS=DET|Person=3|Poss=Yes|PronType=Prs , Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin , Gender=Masc|NumType=Ord|Number=Sing|POS=ADJ , Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Fut|VerbForm=Fin , NumType=Card|POS=NUM , Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Fut|VerbForm=Fin , Number=Sing|POS=PRON|PronType=Ind , Gender=Masc|Number=Sing|POS=DET|PronType=Art , Number=Plur|POS=DET|PronType=Ind , Mood=Sub|Number=Plur|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin , Gender=Masc|Number=Plur|POS=DET|PronType=Dem , Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Fut|VerbForm=Fin , Gender=Masc|NumType=Card|Number=Sing|POS=NUM , Mood=Sub|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin , Case=Acc|Gender=Fem|Number=Sing|POS=PRON|Person=3|PronType=Prs , Number=Sing|POS=DET|PronType=Ind , POS=PUNCT , Number=Sing|POS=DET|PronType=Rel , Case=Gen|POS=PRON|Person=3|PronType=Prs , Gender=Fem|NumType=Card|Number=Plur|POS=NUM , Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin , POS=DET|PronType=Ind , POS=AUX , Case=Acc|Gender=Neut|Number=Sing|POS=PRON|Person=3|PronType=Prs , Case=Acc,Dat|Number=Plur|POS=PRON|Person=1|PronType=Prs , Degree=Cmp|Number=Sing|POS=ADJ , Number=Sing|POS=VERB , Gender=Masc|Number=Plur|POS=PRON|PronType=Ind , Gender=Fem|Number=Plur|POS=DET|PronType=Dem , Gender=Masc|Number=Plur|POS=DET|PronType=Art , Gender=Masc|Number=Plur|POS=DET|Person=3|Poss=Yes|PronType=Prs , Case=Acc|Gender=Fem,Masc|Number=Sing|POS=PRON|Person=3|PronType=Prs , Gender=Fem|Number=Sing|POS=VERB|Tense=Past|VerbForm=Part , Gender=Masc|Number=Sing|POS=PRON|PronType=Ind , Gender=Fem|Number=Plur|POS=PRON|PronType=Ind , Mood=Sub|Number=Sing|POS=VERB|Person=3|Tense=Pres|VerbForm=Fin , Number=Plur|POS=PRON|PronType=Rel , Gender=Masc|Number=Plur|POS=DET|PronType=Int , Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Imp|VerbForm=Fin , AdvType=Tim|POS=NOUN , Gender=Masc|Number=Plur|POS=DET|PronType=Ind , Gender=Fem|Number=Plur|POS=DET|PronType=Ind , Gender=Masc|Number=Sing|POS=DET|PronType=Int , Mood=Cnd|Number=Sing|POS=AUX|Person=3|VerbForm=Fin , Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Imp|VerbForm=Fin , Number=Sing|POS=DET|PronType=Art , Gender=Masc|Number=Sing|POS=DET|Person=3|Poss=Yes|PronType=Prs , Case=Acc|Gender=Masc|Number=Sing|POS=PRON|Person=3|PronType=Prs , Gender=Masc|Number=Sing|POS=PRON|PronType=Int , POS=PUNCT|PunctType=Semi , Mood=Cnd|Number=Plur|POS=AUX|Person=3|VerbForm=Fin , Case=Dat|Number=Sing|POS=PRON|Person=3|PronType=Prs , Gender=Masc|NumType=Card|Number=Plur|POS=NUM , Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Imp|VerbForm=Fin , Gender=Fem|Number=Sing|POS=PRON|PronType=Ind , Mood=Sub|Number=Sing|POS=AUX|Person=3|Tense=Imp|VerbForm=Fin , NumForm=Digit|POS=SYM , Gender=Masc|Number=Sing|POS=AUX|Tense=Past|VerbForm=Part , Gender=Fem|Number=Sing|POS=PRON|PronType=Int , Gender=Fem|Number=Sing|POS=DET|PronType=Int , POS=PRON|PronType=Int , Gender=Fem|Number=Plur|POS=DET|PronType=Int , Mood=Cnd|Number=Sing|POS=VERB|Person=3|VerbForm=Fin , Mood=Cnd|Number=Plur|POS=VERB|Person=3|VerbForm=Fin , POS=PART , Gender=Fem|Number=Sing|POS=PRON|PronType=Dem , Gender=Masc|Number=Sing|POS=DET|PronType=Tot , Gender=Masc|Number=Plur|POS=PRON|PronType=Dem , POS=ADJ , Gender=Masc|Number=Plur|POS=PRON|Person=3|PronType=Prs , Degree=Cmp|Number=Plur|POS=ADJ , POS=PUNCT|PunctType=Dash , Mood=Sub|Number=Sing|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin , Case=Acc|Gender=Fem|Number=Plur|POS=PRON|Person=3|PronType=Prs , Mood=Sub|Number=Sing|POS=VERB|Person=3|Tense=Imp|VerbForm=Fin , Gender=Fem|Number=Plur|POS=VERB|Tense=Past|VerbForm=Part , Gender=Fem|Number=Sing|POS=PRON|Person=3|PronType=Prs , Gender=Masc|POS=NOUN , Mood=Ind|Number=Sing|POS=VERB|Person=3|Tense=Past|VerbForm=Fin , Gender=Fem|Number=Plur|POS=PRON|PronType=Int , Gender=Masc|NumType=Ord|Number=Plur|POS=ADJ , Mood=Ind|Number=Plur|POS=AUX|Person=1|Tense=Fut|VerbForm=Fin , POS=PUNCT|PunctType=Colo , Gender=Masc|NumType=Card|POS=NUM , Gender=Masc|Number=Sing|POS=PRON|Person=3|PronType=Prs , Number=Sing|POS=PRON|PronType=Int , POS=PUNCT|PunctType=Quot , Mood=Imp|Number=Sing|POS=VERB|Person=3|VerbForm=Fin , Gender=Fem|Number=Sing|Number[psor]=Plur|POS=DET|Person=1|Poss=Yes|PronType=Prs , Gender=Masc|Number=Sing|Number[psor]=Plur|POS=DET|Person=1|Poss=Yes|PronType=Prs , Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Fut|VerbForm=Fin , POS=AUX|VerbForm=Ger , Gender=Fem|Number=Plur|POS=PRON|Person=3|PronType=Prs , Mood=Imp|Number=Sing|POS=AUX|Person=3|VerbForm=Fin , Number=Plur|POS=PRON|PronType=Ind , Gender=Masc|Number=Sing|POS=PRON|PronType=Dem , Case=Acc,Dat|Number=Sing|POS=PRON|Person=2|Polite=Infm|PrepCase=Npr|PronType=Prs , Gender=Masc|Number=Plur|POS=PRON|PronType=Int , Mood=Ind|Number=Plur|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin , NumForm=Digit|NumType=Frac|POS=NUM , POS=VERB , Gender=Fem|Number=Plur|POS=PRON|PronType=Dem , Gender=Fem|POS=NOUN , Case=Acc,Dat|Number=Sing|POS=PRON|Person=1|PrepCase=Npr|PronType=Prs , Mood=Sub|Number=Plur|POS=VERB|Person=2|Tense=Pres|VerbForm=Fin , Mood=Ind|Number=Plur|POS=AUX|Person=2|Tense=Fut|VerbForm=Fin , Mood=Sub|Number=Plur|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin , Mood=Sub|Number=Plur|POS=AUX|Person=3|Tense=Imp|VerbForm=Fin , Number=Plur|POS=PRON|Person=1|PronType=Prs , Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin , Case=Nom|Number=Sing|POS=PRON|Person=2|Polite=Infm|PronType=Prs , POS=X , Mood=Cnd|Number=Plur|POS=AUX|Person=1|VerbForm=Fin , Number=Sing|POS=DET|PronType=Dem , POS=DET , Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Fut|VerbForm=Fin , Mood=Ind|Number=Sing|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin , POS=DET|PronType=Art , Gender=Masc|Number=Sing|POS=PRON|Person=3|Poss=Yes|PronType=Prs , NumType=Ord|Number=Sing|POS=ADJ , Gender=Fem|Number=Sing|POS=AUX|Tense=Past|VerbForm=Part , Number=Plur|Number[psor]=Plur|POS=DET|Person=1|Poss=Yes|PronType=Prs , Gender=Fem|Number=Plur|POS=AUX|Tense=Past|VerbForm=Part , Gender=Masc|Number=Plur|POS=AUX|Tense=Past|VerbForm=Part , Number=Plur|POS=PRON|PronType=Dem , Mood=Imp|Number=Plur|POS=VERB|Person=1|VerbForm=Fin , POS=PRON|PronType=Ind , Mood=Ind|Number=Sing|POS=VERB|Person=2|Tense=Pres|VerbForm=Fin , Mood=Imp|Number=Plur|POS=VERB|Person=3|VerbForm=Fin , Case=Nom|Number=Sing|POS=PRON|Person=1|PronType=Prs , Case=Acc|Number=Sing|POS=PRON|Person=1|PrepCase=Pre|PronType=Prs , Mood=Ind|Number=Sing|POS=AUX|Person=2|Tense=Pres|VerbForm=Fin , Mood=Ind|Number=Plur|POS=VERB|Person=1|Tense=Imp|VerbForm=Fin , POS=PUNCT|PunctSide=Fin|PunctType=Qest , NumForm=Digit|NumType=Ord|POS=ADJ , Case=Acc|POS=PRON|Person=3|PrepCase=Pre|PronType=Prs|Reflex=Yes , NumForm=Digit|NumType=Frac|POS=SYM , Mood=Ind|Number=Plur|POS=VERB|Person=2|Tense=Pres|VerbForm=Fin , Gender=Masc|Number=Sing|Number[psor]=Sing|POS=DET|Person=2|Poss=Yes|PronType=Prs , Gender=Masc|Number=Plur|POS=PRON|Person=3|Poss=Yes|PronType=Prs , Mood=Sub|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin , POS=PUNCT|PunctSide=Ini|PunctType=Qest , NumType=Card|Number=Sing|POS=NUM , Foreign=Yes|POS=PRON|PronType=Int , Foreign=Yes|Mood=Ind|POS=VERB|VerbForm=Fin , Foreign=Yes|POS=ADP , Gender=Masc|Number=Sing|POS=PROPN , POS=PUNCT|PunctSide=Ini|PunctType=Excl , POS=PUNCT|PunctSide=Fin|PunctType=Excl , Mood=Cnd|Number=Sing|POS=AUX|Person=1|VerbForm=Fin , Number=Plur|POS=PRON|Person=2|Polite=Form|PronType=Prs , Mood=Sub|POS=AUX|Person=1|Tense=Imp|VerbForm=Fin , POS=PUNCT|PunctSide=Ini|PunctType=Comm , POS=PUNCT|PunctSide=Fin|PunctType=Comm , Number=Plur|POS=PRON|Person=2|PronType=Prs , Mood=Ind|Number=Plur|POS=AUX|Person=2|Tense=Pres|VerbForm=Fin , Case=Acc,Dat|Number=Plur|POS=PRON|Person=2|PronType=Prs , Mood=Cnd|Number=Sing|POS=VERB|Person=1|VerbForm=Fin , Mood=Cnd|Number=Plur|POS=VERB|Person=1|VerbForm=Fin , Mood=Ind|Number=Plur|POS=AUX|Person=1|Tense=Imp|VerbForm=Fin , Gender=Masc|Number=Plur|Number[psor]=Sing|POS=DET|Person=1|Poss=Yes|PronType=Prs , Definite=Ind|Gender=Masc|Number=Sing|POS=DET|PronType=Art , Number=Sing|POS=PRON|Person=2|Polite=Form|PronType=Prs , Gender=Masc|Number=Sing|Number[psor]=Sing|POS=DET|Person=1|Poss=Yes|PronType=Prs , Mood=Ind|Number=Sing|POS=VERB|Person=1|Tense=Imp|VerbForm=Fin , POS=VERB|Tense=Past|VerbForm=Part , Mood=Imp|Number=Plur|POS=AUX|Person=3|VerbForm=Fin , Case=Nom|POS=PRON|Person=3|PronType=Prs , Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Past|VerbForm=Fin , Gender=Fem|Number=Sing|POS=PRON|Person=3|Poss=Yes|PronType=Prs , Gender=Masc|Number=Sing|POS=PRON|PronType=Rel , Definite=Ind|Number=Sing|POS=DET|PronType=Art , Gender=Masc|Number=Sing|Number[psor]=Plur|POS=PRON|Person=1|Poss=Yes|PronType=Prs , Number=Plur|Number[psor]=Plur|POS=PRON|Person=1|Poss=Yes|PronType=Prs , POS=AUX|Tense=Past|VerbForm=Part , Gender=Fem|NumType=Card|POS=NUM , Mood=Ind|Number=Sing|POS=AUX|Person=1|Tense=Imp|VerbForm=Fin , Mood=Sub|Number=Sing|POS=VERB|Person=1|Tense=Imp|VerbForm=Fin , Gender=Fem|Number=Plur|POS=PRON|Person=3|Poss=Yes|PronType=Prs , Mood=Ind|Number=Sing|POS=AUX|Person=1|Tense=Fut|VerbForm=Fin , Mood=Ind|Number=Plur|POS=AUX|Person=3|Tense=Past|VerbForm=Fin , AdvType=Tim|Degree=Cmp|POS=ADV , Case=Acc|Number=Sing|POS=PRON|Person=2|Polite=Infm|PrepCase=Pre|PronType=Prs , POS=DET|PronType=Rel , Definite=Ind|Gender=Fem|Number=Plur|POS=DET|PronType=Art , Mood=Ind|Number=Plur|POS=VERB|Person=2|Tense=Fut|VerbForm=Fin , POS=INTJ , Mood=Sub|Number=Sing|POS=AUX|Person=1|Tense=Pres|VerbForm=Fin , POS=VERB|VerbForm=Fin , Mood=Ind|Number=Plur|POS=VERB|Person=3|Tense=Past|VerbForm=Fin , Definite=Ind|Gender=Fem|Number=Sing|POS=DET|PronType=Art , Mood=Sub|Number=Plur|POS=AUX|Person=1|Tense=Imp|VerbForm=Fin , Gender=Fem|Number=Sing|Number[psor]=Sing|POS=PRON|Person=3|Poss=Yes|PronType=Prs , Mood=Sub|Number=Sing|POS=VERB|Person=2|Tense=Pres|VerbForm=Fin , Case=Acc|POS=PRON|Person=3|PronType=Prs|Reflex=Yes , Foreign=Yes|POS=NOUN , Foreign=Yes|Mood=Ind|Number=Sing|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin , Foreign=Yes|Gender=Masc|Number=Sing|POS=PRON|Person=3|PronType=Prs , Foreign=Yes|POS=SCONJ , Foreign=Yes|Gender=Fem|Number=Sing|POS=DET|PronType=Art , Gender=Masc|POS=SYM , Gender=Fem|Number=Sing|Number[psor]=Sing|POS=DET|Person=2|Poss=Yes|PronType=Prs , Number=Sing|POS=DET|Person=3|Poss=Yes|PronType=Prs , Gender=Masc|Number=Plur|Number[psor]=Sing|POS=DET|Person=2|Poss=Yes|PronType=Prs , Gender=Fem|Number=Sing|POS=PROPN , Mood=Sub|Number=Plur|POS=VERB|Person=1|Tense=Imp|VerbForm=Fin , Definite=Def|Foreign=Yes|Gender=Masc|Number=Sing|POS=DET|PronType=Art , Foreign=Yes|POS=VERB , Foreign=Yes|POS=ADJ , Foreign=Yes|POS=DET , Foreign=Yes|POS=ADV , POS=PUNCT|PunctSide=Fin|Punta d'aignctType=Brck , Degree=Cmp|POS=ADJ , AdvType=Tim|POS=SYM , Number=Plur|POS=DET|PronType=Dem , Mood=Ind|Number=Sing|POS=VERB|Person=2|Tense=Fut|VerbForm=Fin |
parser |
ROOT , acl , advcl , advmod , amod , appos , aux , case , cc , ccomp , compound , conj , cop , csubj , dep , det , expl:pass , fixed , flat , iobj , mark , nmod , nsubj , nummod , obj , obl , parataxis , punct , xcomp |
ner |
EMAIL , FINANCIAL , ID , LOC , MISC , ORG , PER , TELEPHONE , VEHICLE , ZIP |
Accuracy
Type | Score |
---|---|
ENTS_F |
69.12 |
ENTS_P |
74.60 |
ENTS_R |
64.38 |
NER_LOSS |
26573.78 |
đ License
This project is licensed under the MIT License.
Indonesian Roberta Base Posp Tagger
MIT
This is a POS tagging model fine-tuned based on the Indonesian RoBERTa model, trained on the indonlu dataset for Indonesian text POS tagging tasks.
Sequence Labeling
Transformers Other

I
w11wo
2.2M
7
Bert Base NER
MIT
BERT fine-tuned named entity recognition model capable of identifying four entity types: Location (LOC), Organization (ORG), Person (PER), and Miscellaneous (MISC)
Sequence Labeling English
B
dslim
1.8M
592
Deid Roberta I2b2
MIT
This model is a sequence labeling model fine-tuned on RoBERTa, designed to identify and remove Protected Health Information (PHI/PII) from medical records.
Sequence Labeling
Transformers Supports Multiple Languages

D
obi
1.1M
33
Ner English Fast
Flair's built-in fast English 4-class named entity recognition model, based on Flair embeddings and LSTM-CRF architecture, achieving an F1 score of 92.92 on the CoNLL-03 dataset.
Sequence Labeling
PyTorch English
N
flair
978.01k
24
French Camembert Postag Model
French POS tagging model based on Camembert-base, trained using the free-french-treebank dataset
Sequence Labeling
Transformers French

F
gilf
950.03k
9
Xlm Roberta Large Ner Spanish
A Spanish named entity recognition model fine-tuned based on the XLM-Roberta-large architecture, with excellent performance on the CoNLL-2002 dataset.
Sequence Labeling
Transformers Spanish

X
MMG
767.35k
29
Nusabert Ner V1.3
MIT
Named entity recognition model fine-tuned on Indonesian NER tasks based on NusaBert-v1.3
Sequence Labeling
Transformers Other

N
cahya
759.09k
3
Ner English Large
Flair framework's built-in large English NER model for 4 entity types, utilizing document-level XLM-R embeddings and FLERT technique, achieving an F1 score of 94.36 on the CoNLL-03 dataset.
Sequence Labeling
PyTorch English
N
flair
749.04k
44
Punctuate All
MIT
A multilingual punctuation prediction model fine-tuned based on xlm-roberta-base, supporting automatic punctuation completion for 12 European languages
Sequence Labeling
Transformers

P
kredor
728.70k
20
Xlm Roberta Ner Japanese
MIT
Japanese named entity recognition model fine-tuned based on xlm-roberta-base
Sequence Labeling
Transformers Supports Multiple Languages

X
tsmatz
630.71k
25
Featured Recommended AI Models
Š 2025AIbase