Model Overview
Model Features
Model Capabilities
Use Cases
🚀 DaCy medium
DaCy is a Danish language processing framework that offers state - of - the - art pipelines and functionality for analyzing Danish pipelines. Its largest pipeline has achieved top - notch performance in parts - of - speech tagging and dependency parsing for Danish on the Danish Dependency treebank. It also shows competitive performance in named entity recognition, named entity disambiguation, and coreference resolution. To learn more, visit the [DaCy repository](https://github.com/centre - for - humanities - computing/DaCy) for usage instructions and result reproduction materials. DaCy also includes package usage guides and behavioral tests for biases and robustness of Danish NLP pipelines.
✨ Features
- Linguistic Analysis: Capable of token - classification, POS tagging, morphological analysis, lemmatization, dependency parsing, named entity recognition, coreference resolution, named entity linking, and named entity disambiguation.
- High - performance: Achieves excellent metrics on various tasks and datasets.
📚 Documentation
Model Information
Property | Details |
---|---|
Model Type | da_dacy_medium_trf |
Version | 0.2.0 |
spaCy Compatibility | >=3.5.2,<3.6.0 |
Default Pipeline | transformer , tagger , morphologizer , trainable_lemmatizer , parser , ner , coref , span_resolver , span_cleaner , entity_linker |
Components | transformer , tagger , morphologizer , trainable_lemmatizer , parser , ner , coref , span_resolver , span_cleaner , entity_linker |
Vectors | 0 keys, 0 unique vectors (0 dimensions) |
Sources | [UD Danish DDT v2.11](https://github.com/UniversalDependencies/UD_Danish - DDT) (Johannsen, Anders; Martínez Alonso, Héctor; Plank, Barbara) DaNE (Rasmus Hvingelby, Amalie B. Pauli, Maria Barrett, Christina Rosted, Lasse M. Lidegaard, Anders Søgaard) DaCoref (Buch - Kromann, Matthias) [DaNED](https://danlp - alexandra.readthedocs.io/en/stable/docs/datasets.html#daned) (Barrett, M. J., Lam, H., Wu, M., Lacroix, O., Plank, B., & Søgaard, A.) vesteinn/DanskBERT (Vésteinn Snæbjarnarson) |
License | Apache - 2.0 |
Author | Kenneth Enevoldsen |
Label Scheme
View label scheme (211 labels for 4 components)
Component | Labels |
---|---|
tagger |
ADJ , ADP , ADV , AUX , CCONJ , DET , INTJ , NOUN , NUM , PART , PRON , PROPN , PUNCT , SCONJ , SYM , VERB , X |
morphologizer |
AdpType=Prep|POS=ADP , Definite=Ind|Gender=Com|Number=Sing|POS=NOUN , Mood=Ind|POS=AUX|Tense=Pres|VerbForm=Fin|Voice=Act , POS=PROPN , Definite=Ind|Number=Sing|POS=VERB|Tense=Past|VerbForm=Part , Definite=Def|Gender=Neut|Number=Sing|POS=NOUN , POS=SCONJ , Definite=Def|Gender=Com|Number=Sing|POS=NOUN , Mood=Ind|POS=VERB|Tense=Pres|VerbForm=Fin|Voice=Act , POS=ADV , Number=Plur|POS=DET|PronType=Dem , Degree=Pos|Number=Plur|POS=ADJ , Definite=Ind|Gender=Com|Number=Plur|POS=NOUN , POS=PUNCT , NumType=Ord|POS=ADJ , POS=CCONJ , Definite=Ind|Gender=Neut|Number=Plur|POS=NOUN , POS=VERB|VerbForm=Inf|Voice=Act , Case=Acc|Gender=Neut|Number=Sing|POS=PRON|Person=3|PronType=Prs , Degree=Sup|POS=ADV , Degree=Pos|POS=ADV , Gender=Com|Number=Sing|POS=DET|PronType=Ind , Number=Plur|POS=DET|PronType=Ind , POS=ADP , POS=ADV|PartType=Inf , Case=Nom|Gender=Com|Number=Sing|POS=PRON|Person=3|PronType=Prs , Mood=Ind|POS=AUX|Tense=Past|VerbForm=Fin|Voice=Act , Definite=Def|Degree=Pos|Number=Sing|POS=ADJ , Number[psor]=Sing|POS=DET|Person=3|Poss=Yes|PronType=Prs , Mood=Ind|POS=VERB|Tense=Past|VerbForm=Fin|Voice=Act , POS=ADP|PartType=Inf , Definite=Ind|Degree=Pos|Gender=Com|Number=Sing|POS=ADJ , NumType=Card|POS=NUM , Degree=Pos|POS=ADJ , Definite=Ind|Number=Sing|POS=AUX|Tense=Past|VerbForm=Part , POS=PART|PartType=Inf , Case=Acc|POS=PRON|Person=3|PronType=Prs|Reflex=Yes , Definite=Def|Gender=Com|Number=Plur|POS=NOUN , Definite=Ind|Gender=Neut|Number=Sing|POS=NOUN , Number[psor]=Plur|POS=DET|Person=3|Poss=Yes|PronType=Prs , POS=VERB|Tense=Pres|VerbForm=Part , Case=Nom|Number=Plur|POS=PRON|Person=3|PronType=Prs , Case=Gen|Definite=Def|Gender=Com|Number=Sing|POS=NOUN , Definite=Def|Degree=Sup|Number=Plur|POS=ADJ , Case=Acc|Number=Plur|POS=PRON|Person=3|PronType=Prs , POS=AUX|VerbForm=Inf|Voice=Act , Definite=Ind|Degree=Pos|Gender=Neut|Number=Sing|POS=ADJ , Definite=Ind|Degree=Cmp|Number=Sing|POS=ADJ , Degree=Cmp|POS=ADJ , POS=PRON|PartType=Inf , Definite=Ind|Degree=Pos|Number=Sing|POS=ADJ , Case=Nom|Gender=Com|POS=PRON|PronType=Ind , Number=Plur|POS=PRON|PronType=Ind , POS=INTJ , Gender=Com|Number=Sing|POS=DET|PronType=Dem , Case=Gen|Number=Plur|POS=DET|PronType=Ind , Mood=Ind|POS=VERB|Tense=Pres|VerbForm=Fin|Voice=Pass , Definite=Def|Gender=Neut|Number=Plur|POS=NOUN , Degree=Cmp|POS=ADV , Number=Plur|Number[psor]=Plur|POS=PRON|Person=1|Poss=Yes|PronType=Prs|Style=Form , Case=Acc|Gender=Com|Number=Sing|POS=PRON|Person=3|PronType=Prs , Number=Plur|Number[psor]=Sing|POS=DET|Person=3|Poss=Yes|PronType=Prs|Reflex=Yes , Case=Gen|POS=PROPN , Gender=Neut|Number=Sing|POS=PRON|PronType=Ind , Number=Plur|POS=VERB|Tense=Past|VerbForm=Part , Gender=Neut|Number=Sing|Number[psor]=Sing|POS=DET|Person=3|Poss=Yes|PronType=Prs|Reflex=Yes , Case=Acc|Gender=Com|Number=Sing|POS=PRON|Person=1|PronType=Prs , Definite=Def|Degree=Sup|POS=ADJ , Gender=Neut|Number=Sing|POS=DET|PronType=Ind , Case=Gen|Definite=Ind|Gender=Neut|Number=Sing|POS=NOUN , Gender=Neut|Number=Sing|POS=DET|PronType=Dem , Definite=Def|Number=Sing|POS=VERB|Tense=Past|VerbForm=Part , POS=PRON|PronType=Dem , Degree=Pos|Gender=Com|Number=Sing|POS=ADJ , Number=Plur|POS=NUM , POS=VERB|VerbForm=Inf|Voice=Pass , Definite=Def|Degree=Sup|Number=Sing|POS=ADJ , Number=Sing|POS=PRON|PronType=Int,Rel , Case=Nom|Gender=Com|Number=Sing|POS=PRON|Person=1|PronType=Prs , Gender=Neut|Number=Sing|Number[psor]=Sing|POS=DET|Person=1|Poss=Yes|PronType=Prs , Gender=Com|Number=Sing|Number[psor]=Sing|POS=DET|Person=1|Poss=Yes|PronType=Prs , POS=PRON , Definite=Ind|Number=Sing|POS=NOUN , Definite=Ind|Number=Sing|POS=NUM , Case=Gen|Definite=Ind|Gender=Com|Number=Sing|POS=NOUN , Foreign=Yes|POS=ADV , POS=NOUN , Case=Gen|Definite=Def|Gender=Neut|Number=Sing|POS=NOUN , Gender=Com|Number=Plur|POS=NOUN , Gender=Neut|Number=Sing|POS=PRON|PronType=Int,Rel , Case=Nom|Gender=Com|Number=Plur|POS=PRON|Person=1|PronType=Prs , Number[psor]=Plur|POS=DET|Person=1|Poss=Yes|PronType=Prs , Gender=Com|Number=Sing|POS=PRON|PronType=Ind , Case=Gen|Definite=Ind|Gender=Com|Number=Plur|POS=NOUN , Degree=Pos|Gender=Neut|Number=Sing|POS=ADJ , Degree=Sup|POS=ADJ , Degree=Pos|Number=Sing|POS=ADJ , Mood=Imp|POS=VERB , Case=Nom|Gender=Com|POS=PRON|Person=2|Polite=Form|PronType=Prs , Case=Acc|Gender=Com|POS=PRON|Person=2|Polite=Form|PronType=Prs , POS=X , Case=Gen|Definite=Def|Gender=Com|Number=Plur|POS=NOUN , Number=Plur|POS=PRON|PronType=Dem , Case=Acc|Gender=Com|Number=Plur|POS=PRON|Person=1|PronType=Prs , Number=Plur|POS=PRON|PronType=Int,Rel , Gender=Com|Number=Sing|Number[psor]=Sing|POS=DET|Person=3|Poss=Yes|PronType=Prs|Reflex=Yes , Degree=Cmp|Number=Plur|POS=ADJ , Number=Plur|Number[psor]=Sing|POS=DET|Person=1|Poss=Yes|PronType=Prs , Gender=Com|Number=Sing|Number[psor]=Plur|POS=DET|Person=1|Poss=Yes|PronType=Prs|Style=Form , Case=Nom|Gender=Com|Number=Sing|POS=PRON|Person=2|PronType=Prs , Case=Acc|Gender=Com|Number=Sing|POS=PRON|Person=2|PronType=Prs , Gender=Com|POS=PRON|PronType=Int,Rel , Case=Gen|Degree=Pos|Number=Plur|POS=ADJ , Gender=Neut|Number=Sing|Number[psor]=Sing|POS=PRON|Person=3|Poss=Yes|PronType=Prs|Reflex=Yes , POS=VERB|VerbForm=Ger , Gender=Com|Number=Sing|POS=PRON|PronType=Dem , Case=Gen|POS=PRON|PronType=Int,Rel , Mood=Ind|POS=VERB|Tense=Past|VerbForm=Fin|Voice=Pass , Abbr=Yes|POS=X , Case=Gen|Definite=Ind|Gender=Neut|Number=Plur|POS=NOUN , Gender=Com|Number=Sing|Number[psor]=Sing|POS=DET|Person=2|Poss=Yes|PronType=Prs , Definite=Ind|Number=Plur|POS=NOUN , Foreign=Yes|POS=X , Number=Plur|POS=PRON|PronType=Rcp , Case=Nom|Gender=Com|Number=Plur|POS=PRON|Person=2|PronType=Prs , Case=Gen|Degree=Cmp|POS=ADJ , Case=Gen|Definite=Def|Gender=Neut|Number=Plur|POS=NOUN , Case=Acc|Gender=Com|Number=Plur|POS=PRON|Person=2|PronType=Prs , Gender=Neut|Number=Sing|POS=PRON|PronType=Dem , Number=Plur|Number[psor]=Plur|POS=DET|Person=1|Poss=Yes|PronType=Prs|Style=Form , Gender=Neut|Number=Sing|Number[psor]=Plur|POS=DET|Person=1|Poss=Yes|PronType=Prs|Style=Form , Number=Plur|Number[psor]=Sing|POS=PRON|Person=3|Poss=Yes|PronType=Prs|Reflex=Yes , Number[psor]=Sing|POS=PRON|Person=3|Poss=Yes|PronType=Prs , Case=Gen|Number=Plur|POS=PRON|PronType=Rcp , POS=DET|Person=2|Polite=Form|Poss=Yes|PronType=Prs , POS=SYM , POS=DET|PronType=Dem , Gender=Com|Number=Sing|POS=NUM , Number[psor]=Plur|POS=DET|Person=2|Poss=Yes|PronType=Prs , Case=Gen|Number=Plur|POS=VERB|Tense=Past|VerbForm=Part , Definite=Def|Degree=Abs|POS=ADJ , POS=VERB|Tense=Pres , Definite=Ind|Gender=Neut|Number=Sing|POS=NUM , Degree=Abs|POS=ADV , Case=Gen|Definite=Def|Degree=Pos|Number=Sing|POS=ADJ , Gender |
Performance Metrics
Task | Metric | Value | Dataset |
---|---|---|---|
NER | NER Precision | 0.8708487085 | DaNE (test split) |
NER | NER Recall | 0.8458781362 | DaNE (test split) |
NER | NER F Score | 0.8581818182 | DaNE (test split) |
TAG | TAG (XPOS) Accuracy | 0.9847290149 | UD Danish DDT (test split) |
POS | POS (UPOS) Accuracy | 0.985677928 | UD Danish DDT (test split) |
MORPH | Morph (UFeats) Accuracy | 0.9814371257 | UD Danish DDT (test split) |
LEMMA | Lemma Accuracy | 0.9419805438 | UD Danish DDT (test split) |
UNLABELED_DEPENDENCIES | Unlabeled Attachment Score (UAS) | 0.9083920564 | UD Danish DDT (test split) |
LABELED_DEPENDENCIES | Labeled Attachment Score (LAS) | 0.883349834 | UD Danish DDT (test split) |
SENTS | Sentences F - Score | 0.9885462555 | UD Danish DDT (test split) |
coreference - resolution | LEA | 0.4118366346 | DaCoref (custom split) |
coreference - resolution | Named entity Linking Precision | 0.9923076923 | DaNED (custom split) |
coreference - resolution | Named entity Linking Recall | 0.671875 | DaNED (custom split) |
coreference - resolution | Named entity Linking F Score | 0.801242236 | DaNED (custom split) |
📄 License
This project is licensed under the Apache - 2.0
license.






