roberta-large-financial-news-sentiment-en开源模型 - 免费分析加拿大金融新闻情绪

首页

Roberta Large Financial News Sentiment En

由 Jean-Baptiste 开发

该模型是针对金融新闻（尤其是加拿大新闻）进行情绪分类的微调模型，在混合数据集上训练完成，特别适用于加拿大金融新闻分析。

文本分类

Transformers

英语开源协议:MIT #金融情绪分析 #加拿大市场优化 #高精度F1-93%

下载量 969

发布时间 : 12/28/2022

模型简介

基于RoBERTa-large架构微调的金融新闻情绪分类模型，专门用于分析金融新闻文本的情绪倾向（负面/中性/正面），在加拿大金融新闻上表现优异。

模型特点

加拿大金融新闻专项优化

额外标注2000篇加拿大金融新闻进行训练，在该领域F1达83.6%

高质量标注数据

仅保留至少75%标注者达成一致的句子，确保标签可靠性

三分类精细划分

区分负面/中性/正面三种情绪状态，而非简单二元分类

模型能力

金融文本情绪分析

新闻情绪分类

加拿大市场新闻专项分析

使用案例

金融市场分析

上市公司财报情绪监测

分析企业财报新闻的情绪倾向

可识别'收入增长17%'为正面，'净收入下降3%'为负面

市场风险预警

检测破产公告等负面新闻事件

准确识别'申请破产保护'为负面情绪（置信度>93%）

投资决策支持

矿业公司生产报告分析

评估矿业公司生产公告的情绪倾向

正确分类'稳健生产业绩'为正面情绪

🚀 用于金融新闻情感分类的roberta - large微调模型（侧重加拿大新闻）

本模型基于roberta - large进行微调，用于金融新闻的情感分类，尤其侧重于加拿大新闻。它能有效识别金融新闻中的情感倾向，为金融领域的信息分析提供有力支持。

🚀 快速开始

模型介绍

此模型在financial_news_sentiment_mixte_with_phrasebank_75数据集上进行训练。这是phrasebank数据集的定制版本，其中仅保留了至少75%标注者验证过的句子。此外，还添加了约2000篇手动验证的加拿大金融新闻文章。因此，该模型更专门针对加拿大新闻进行了训练。最终结果显示，整体F1分数为93.25%，在加拿大新闻上的F1分数为83.6%。

📦 安装指南

使用HuggingFace加载模型

以下是加载roberta-large-financial-news-sentiment-en模型及其子词分词器的代码示例：

from transformers import AutoTokenizer, AutoModelForSequenceClassification

tokenizer = AutoTokenizer.from_pretrained("Jean-Baptiste/roberta-large-financial-news-sentiment-en")
model = AutoModelForSequenceClassification.from_pretrained("Jean-Baptiste/roberta-large-financial-news-sentiment-en")

处理文本样本

以下是使用加载好的模型处理文本样本的代码示例：

from transformers import pipeline

pipe = pipeline("text-classification", model=model, tokenizer=tokenizer)
pipe("Melcor REIT (TSX: MR.UN) today announced results for the third quarter ended September 30, 2022. Revenue was stable in the quarter and year-to-date. Net operating income was down 3% in the quarter at $11.61 million due to the timing of operating expenses and inflated costs including utilities like gas/heat and power")

[{'label': 'negative', 'score': 0.9399105906486511}]

💻 使用示例

基础用法

from transformers import AutoTokenizer, AutoModelForSequenceClassification

tokenizer = AutoTokenizer.from_pretrained("Jean-Baptiste/roberta-large-financial-news-sentiment-en")
model = AutoModelForSequenceClassification.from_pretrained("Jean-Baptiste/roberta-large-financial-news-sentiment-en")

from transformers import pipeline

pipe = pipeline("text-classification", model=model, tokenizer=tokenizer)
pipe("Melcor REIT (TSX: MR.UN) today announced results for the third quarter ended September 30, 2022. Revenue was stable in the quarter and year-to-date. Net operating income was down 3% in the quarter at $11.61 million due to the timing of operating expenses and inflated costs including utilities like gas/heat and power")

[{'label': 'negative', 'score': 0.9399105906486511}]