BERTIS Open-source Text Classification Model - Freely and Accurately Classify Texts into 14 Image Schema Categories

BERTIS

Developed by mireillfares

BERTIS is a text classification model based on the BERT architecture, specifically designed to categorize input texts into 14 predefined image schema categories.

Text Classification

Transformers

English#Image Schema Classification #Cognitive Pattern Recognition #Metaphorical Text Analysis

Downloads 52

Release Time : 6/28/2023

Model Overview

The BERTIS model generates semantically aligned metaphorical gestures through self-supervised text and speech representations, primarily used for text classification tasks to categorize texts into specific image schema categories.

Model Features

Image Schema Classification

Capable of automatically categorizing input texts into 14 predefined image schema categories.

BERT-based Architecture

Utilizes the BERT Base Cased model for fine-tuning, equipped with robust text comprehension capabilities.

High Performance

Achieves an overall accuracy of 0.93, with F1 scores ranging between 0.77 and 1 across categories.

Model Capabilities

Text Classification

Image Schema Recognition

Metaphorical Gesture Generation

Use Cases

Cognitive Linguistics

Metaphor Analysis

Analyzes metaphorical expressions in texts and categorizes them into specific image schema categories.

Highly accurate classification results, aiding in understanding cognitive patterns in language.

Human-Computer Interaction

Semantically Aligned Gesture Generation

Generates semantically aligned metaphorical gestures based on text content.

Enhances the naturalness and expressiveness of human-computer interaction.

🚀 Model Card for BERTIS Model

This README offers an overview of BERTIS, including its purpose and usage. BERTIS is introduced in the paper "META4: Semantically-Aligned Generation of metaphoric gestures using self-supervised text and speech representations". If you find this work useful, please consider citing our paper.

Bibtex:

@article{fares2023META4,
  title={META4: Semantically-Aligned Generation of metaphoric gestures using self-supervised text and speech representations},
  author={Fares, Mireille and Pelachaud, Catherine and Obin, Nicolas},
  journal={arXiv preprint arXiv:2311.05481},
  year={2023}
}

✨ Features

BERTIS Model Description

BERTIS (BERT-based Image Schema) is a computational model designed to classify input texts into specific Image Schema classes. Image Schemas are cognitive patterns that play a fundamental role in shaping the way humans conceptualize and reason about various concepts present in language. BERTIS is built upon the BERT architecture and fine-tuned using a specialized corpus created by Wachowiak et al. (2022).

BERTIS Image Schema Classes

The main purpose of BERTIS is to automatically categorize input texts into predefined Image Schema classes. BERTIS considers 14 distinct Image Schema classes, each capturing a specific cognitive pattern. These classes and their corresponding examples (taken from Wachowiak et al. (2022)) are as follows:

CENTER-PERIPHERY: She brushed the thought away.
CONTACT: That blew me away.
CONTAINMENT: Keep it in the back of your mind.
COVERING: His judgement is clouded.
FORCE: They are attracted to each other.
LINK: Breaking social ties.
OBJECT: Seize the opportunity.
PART-WHOLE: They assembled a theory.
SCALE: This class is bigger than that one.
SOURCE_PATH_GOAL: The time for action has arrived.
SPLITTING: What separates the men from the boys?
SUBSTANCE: Emotions are tinged with suffuse.
VERTICALITY: No known spoken language uses the lateral axis for time.
SUPPORT: The poor in our country need a boost up.

📦 Installation

No installation steps are provided in the original document.

💻 Usage Examples

No code examples are provided in the original document.

📚 Documentation

Training and Testing Details

The data used for training, validating, and testing BERTIS can be found at: https://github.com/lwachowiak/Systematic-Analysis-of-Image-Schemas-through-Explainable-Multilingual-Language-Models/blob/main/Data/Image%20Schemas%20English%20and%20German.csv.

80% of the data was used for training BERTIS, 10% for validation, and 10% for testing.

Evaluation

Metrics

Precision: measures the proportion of correctly predicted positive instances out of all instances predicted as positive. It focuses on the accuracy of positive predictions.
Recall: measures the proportion of correctly predicted positive instances out of all actual positive instances. It focuses on the ability to capture positive instances.
F1-score: the harmonic mean of precision and recall, providing a balanced measure that combines both metrics.

Results

Class	Precision	Recall	F1-score
"CENTER-PERIPHERY"	0.98	0.94	0.96
"CONTACT"	1	1	1
"CONTAINMENT"	0.74	0.67	0.7
"COVERING"	1	1	1
"FORCE"	0.8	0.91	0.85
"LINK"	1	1	1
"OBJECT"	0.81	0.83	0.82
"PART-WHOLE"	1	1	1
"SCALE"	1	1	1
"SOURCE_PATH_GOAL"	0.81	0.74	0.77
"SPLITTING"	1	1	1
"SUBSTANCE"	1	1	1
"SUPPORT"	1	1	1
"VERTICALITY"	0.88	0.93	0.90

Overall Accuracy: 0.93

For all Image Schema classes, the F1-score ranges from 0.77 to 1, indicating that BERTIS performs well in terms of both accurately predicting the correct Image Schema classes (precision) and capturing as many correct Image Schema classes as possible (recall). These results are also reflected in the precision and recall scores for all Image Schema classes. We observe a high precision (between 0.74 and 1) for all classes, which indicates the number of correctly predicted Image Schema classes. The recall scores are above 0.74 for most classes, with one Image Schema class having a recall score of 0.67. The overall accuracy of BERTIS with respect to all Image Schema classes is 0.93, indicating high accuracy and good performance in classifying the input text into the correct Image Schema classes.

🔧 Technical Details

The model is based on the BERT architecture and fine-tuned using a specialized corpus. The data used for training, validation, and testing is publicly available, and the distribution of the data for different purposes is clearly defined.

📄 License

The model is released under the Academic Free License (AFL-3.0).

Information Table

Property	Details
Model Type	BERT-based Image Schema (BERTIS)
Training Data	https://github.com/lwachowiak/Systematic-Analysis-of-Image-Schemas-through-Explainable-Multilingual-Language-Models/blob/main/Data/Image%20Schemas%20English%20and%20German.csv
License	Academic Free License (AFL-3.0)
Finetuned from model	BERT Base Cased model

Developer Information

Developed by: Mireille Fares Language(s) (NLP): English, can generalize to other languages such as German

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご