EconoBert Open-source AI Model - Free to Use, Empowering NLP Tasks in Economics, Politics, and Finance

Econobert

Developed by samchain

EconoBert is a model fine-tuned on economics domain datasets based on bert-base-uncased, suitable for NLP tasks in economics, political science, and finance.

Large Language Model

Transformers

EnglishOpen Source License:Apache-2.0 #Economics Text Understanding #Financial Domain Fine-tuning #BERT Architecture Optimization

Downloads 78

Release Time : 7/23/2023

Model Overview

This model is a simple fine-tuning of the base BERT on specific datasets in the economics domain, following the same architecture without adjusting the size of token_embeddings.

Model Features

Economics Domain Fine-tuning

Fine-tuned on specific datasets in the economics domain, suitable for NLP tasks in economics, political science, and finance.

High Performance

Achieved an accuracy of 73% in MLM tasks and 95% in NSP tasks on the test set.

Simple Fine-tuning

Follows the same architecture as the base BERT without adjusting the size of token_embeddings.

Model Capabilities

Fill-mask

Sentence Pair Classification

Use Cases

Economics

Economics Text Analysis

Used for analyzing text data in the economics domain, such as speeches, reports, etc.

Finance

Financial Text Processing

Used for processing text data in the finance domain, such as news, reports, etc.

🚀 EconoBert

This model, EconoBert, is a fine - tuned version of the pre - trained bert - base - uncased model. It is specifically fine - tuned on an economics - related dataset, aiming to provide better performance for NLP tasks in the fields of economics, politics, and finance.

🚀 Quick Start

This model can be used as a backbone for NLP tasks in the domains of economics, politics, and finance. You can load it using relevant NLP libraries and start fine - tuning or making predictions.

✨ Features

Fine - tuned on Economics Data: It is fine - tuned on a dataset specific to the economics domain, making it more suitable for related NLP tasks.
Good Performance: Achieves an accuracy of 73% for the MLM task and 95% for the NSP task on the test set.

📦 Installation

No specific installation steps are provided in the original document.

💻 Usage Examples

No code examples are provided in the original document.

📚 Documentation

Model description

The model is a simple fine - tuning of a base BERT model on a dataset specific to the domain of economics. It follows the same architecture as the base model, and no resize_token_embeddings operation was required.

Intended uses & limitations

This model should be used as a backbone for NLP tasks applied to the domains of economics, politics, and finance.

Training and evaluation data

The dataset used for fine - tuning is samchain/BIS_Speeches_97_23. It consists of 773k pairs of sentences, with half being negative pairs (sequence A and B are not related) and the other half being positive pairs (sequence B follows sequence A). The test set is made up of 136k pairs.

Training procedure

The model was fine - tuned for 2 epochs, with a batch size of 64 and a sequence length of 128. The Adam optimizer was used with a learning rate of 1e - 5.

Training hyperparameters

Property	Details
Optimizer	{'name': 'Adam', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': 1e - 05, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e - 07, 'amsgrad': False}
Training Precision	float32

Training results

The training loss is 1.6046 on the training set and 1.47 on the test set.

Framework versions

Property	Details
Transformers	4.31.0
TensorFlow	2.12.0
Datasets	2.13.1
Tokenizers	0.13.3

🔧 Technical Details

The model is a fine - tuned version of bert - base - uncased. It uses the same architecture as the base model and is trained on a specific economics - related dataset. The training process involves fine - tuning for 2 epochs with specific hyperparameters such as batch size, sequence length, and optimizer settings.

📄 License

This model is released under the Apache 2.0 license.

Citing & Authors

Samuel Chaineau

Featured Recommended AI Models

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご