roberta-large-financial-news-sentiment-en開源模型 - 免費分析加拿大金融新聞情緒

首頁

Roberta Large Financial News Sentiment En

由Jean-Baptiste開發

該模型是針對金融新聞（尤其是加拿大新聞）進行情緒分類的微調模型，在混合數據集上訓練完成，特別適用於加拿大金融新聞分析。

文本分類

Transformers

英語開源協議:MIT #金融情緒分析 #加拿大市場優化 #高精度F1-93%

下載量 969

發布時間 : 12/28/2022

模型概述

基於RoBERTa-large架構微調的金融新聞情緒分類模型，專門用於分析金融新聞文本的情緒傾向（負面/中性/正面），在加拿大金融新聞上表現優異。

模型特點

加拿大金融新聞專項優化

額外標註2000篇加拿大金融新聞進行訓練，在該領域F1達83.6%

高質量標註數據

僅保留至少75%標註者達成一致的句子，確保標籤可靠性

三分類精細劃分

區分負面/中性/正面三種情緒狀態，而非簡單二元分類

模型能力

金融文本情緒分析

新聞情緒分類

加拿大市場新聞專項分析

使用案例

金融市場分析

上市公司財報情緒監測

分析企業財報新聞的情緒傾向

可識別'收入增長17%'為正面，'淨收入下降3%'為負面

市場風險預警

檢測破產公告等負面新聞事件

準確識別'申請破產保護'為負面情緒（置信度>93%）

投資決策支持

礦業公司生產報告分析

評估礦業公司生產公告的情緒傾向

正確分類'穩健生產業績'為正面情緒

🚀 用於金融新聞情感分類的roberta - large微調模型（側重加拿大新聞）

本模型基於roberta - large進行微調，用於金融新聞的情感分類，尤其側重於加拿大新聞。它能有效識別金融新聞中的情感傾向，為金融領域的信息分析提供有力支持。

🚀 快速開始

模型介紹

此模型在financial_news_sentiment_mixte_with_phrasebank_75數據集上進行訓練。這是phrasebank數據集的定製版本，其中僅保留了至少75%標註者驗證過的句子。此外，還添加了約2000篇手動驗證的加拿大金融新聞文章。因此，該模型更專門針對加拿大新聞進行了訓練。最終結果顯示，整體F1分數為93.25%，在加拿大新聞上的F1分數為83.6%。

📦 安裝指南

使用HuggingFace加載模型

以下是加載roberta-large-financial-news-sentiment-en模型及其子詞分詞器的代碼示例：

from transformers import AutoTokenizer, AutoModelForSequenceClassification

tokenizer = AutoTokenizer.from_pretrained("Jean-Baptiste/roberta-large-financial-news-sentiment-en")
model = AutoModelForSequenceClassification.from_pretrained("Jean-Baptiste/roberta-large-financial-news-sentiment-en")

處理文本樣本

以下是使用加載好的模型處理文本樣本的代碼示例：

from transformers import pipeline

pipe = pipeline("text-classification", model=model, tokenizer=tokenizer)
pipe("Melcor REIT (TSX: MR.UN) today announced results for the third quarter ended September 30, 2022. Revenue was stable in the quarter and year-to-date. Net operating income was down 3% in the quarter at $11.61 million due to the timing of operating expenses and inflated costs including utilities like gas/heat and power")

[{'label': 'negative', 'score': 0.9399105906486511}]

💻 使用示例

基礎用法

from transformers import AutoTokenizer, AutoModelForSequenceClassification

tokenizer = AutoTokenizer.from_pretrained("Jean-Baptiste/roberta-large-financial-news-sentiment-en")
model = AutoModelForSequenceClassification.from_pretrained("Jean-Baptiste/roberta-large-financial-news-sentiment-en")

from transformers import pipeline

pipe = pipeline("text-classification", model=model, tokenizer=tokenizer)
pipe("Melcor REIT (TSX: MR.UN) today announced results for the third quarter ended September 30, 2022. Revenue was stable in the quarter and year-to-date. Net operating income was down 3% in the quarter at $11.61 million due to the timing of operating expenses and inflated costs including utilities like gas/heat and power")

[{'label': 'negative', 'score': 0.9399105906486511}]