A

Anglicisms Spanish Flair Cs

Developed by lirondos
A pre-trained model for detecting unassimilated English lexical borrowings in Spanish news, capable of identifying foreign words such as 'fake news' and 'machine learning'.
Downloads 8,115
Release Time : 3/29/2022

Model Overview

This model is a BiLSTM-CRF model specifically designed to detect foreign words (mainly from English) used in Spanish, such as *fake news* and *machine learning*.

Model Features

Multilingual lexical borrowing detection
Capable of identifying unassimilated English lexical borrowings (ENG tag) and borrowings from other languages (OTHER tag) in Spanish.
Pre-trained on code-switching data
The model input includes Transformer-based pre-trained embeddings from code-switching data, enhancing its ability to process mixed-language texts.
Highly challenging test set
The test set is designed to be highly challenging, covering sources and dates not seen in the training set, with a large number of out-of-vocabulary words (92% of borrowed words are OOV).

Model Capabilities

Identifying English loanwords in Spanish
Identifying loanwords from other languages in Spanish
Handling the recognition of multi-word borrowings

Use Cases

News media analysis
Detecting English loanwords in news
Analyzing English words used in Spanish news, such as 'fake news' and 'prime time'.
Precision 90.16%, recall 84.34%, F1-score 87.16% (ENG tag)
Linguistic research
Lexical borrowing research
Used to study the distribution and trends of unassimilated lexical borrowings in Spanish.
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
Š 2025AIbase