开源autogluon-chronos-t5-large模型 - 高效进行时间序列预测

首页

Autogluon Chronos T5 Large

由 anchovy 开发

Chronos是基于语言模型架构的预训练时间序列预测模型家族，通过量化和缩放将时间序列转化为标记序列进行训练。

气候模型

Safetensors

开源协议:Apache-2.0 #时间序列预测 #语言模型架构 #概率预测

下载量 25

发布时间 : 8/30/2024

模型简介

Chronos-T5是一个基于T5架构的时间序列预测模型，通过将时间序列数据转换为标记序列并使用交叉熵损失进行训练，能够生成概率预测。

模型特点

语言模型架构

采用类似语言模型的架构处理时间序列数据，将数值序列转化为标记序列进行训练和预测。

概率预测

通过多次采样生成多条未来轨迹，提供概率预测而非单点预测。

大规模预训练

在大量公开时间序列数据及合成数据上进行预训练，具有广泛适用性。

多尺寸选择

提供从800万到7.1亿参数的不同规模模型，适应不同计算需求。

模型能力

时间序列预测

概率预测

多步预测

不确定性量化

使用案例

商业预测

销售预测

预测未来商品销售趋势

可提供未来销售的中位数预测及置信区间

库存管理

预测产品需求变化

帮助优化库存水平，减少缺货或过剩

经济金融

股票价格预测

预测金融时间序列走势

提供未来价格的概率分布

工业应用

设备维护预测

预测设备故障时间点

提前安排维护，减少停机时间

🚀 Chronos-T5 (Large)

Chronos是一系列基于语言模型架构的预训练时间序列预测模型。通过缩放和量化将时间序列转换为一系列标记，然后使用交叉熵损失在这些标记上训练语言模型。训练完成后，在给定历史上下文的情况下，通过对多个未来轨迹进行采样来获得概率预测。Chronos模型已经在大量公开可用的时间序列数据以及使用高斯过程生成的合成数据上进行了训练。

如需了解Chronos模型、训练数据和流程以及实验结果的详细信息，请参考论文Chronos: Learning the Language of Time Series。

图1：Chronos的高层描述。(左) 对输入的时间序列进行缩放和量化，以获得一系列标记。(中) 将标记输入到语言模型中，该模型可以是编码器 - 解码器模型，也可以是仅解码器模型。使用交叉熵损失对模型进行训练。(右) 在推理过程中，我们自回归地从模型中采样标记，并将它们映射回数值。对多个轨迹进行采样以获得预测分布。

🚀 快速开始

Chronos是基于语言模型架构的预训练时间序列预测模型家族，能将时间序列转换为标记序列进行训练，并通过采样获得概率预测。

✨ 主要特性

基于语言模型架构，将时间序列转换为标记序列进行训练。
在大量公开时间序列数据和合成数据上训练。
通过采样多个未来轨迹获得概率预测。

📦 安装指南

要使用Chronos模型进行推理，需安装相关包，可通过以下命令安装：

pip install git+https://github.com/amazon-science/chronos-forecasting.git

💻 使用示例

基础用法

以下是一个使用Chronos模型进行推理的最小示例：

import matplotlib.pyplot as plt
import numpy as np
import pandas as pd
import torch
from chronos import ChronosPipeline

pipeline = ChronosPipeline.from_pretrained(
  "amazon/chronos-t5-large",
  device_map="cuda",
  torch_dtype=torch.bfloat16,
)

df = pd.read_csv("https://raw.githubusercontent.com/AileenNielsen/TimeSeriesAnalysisWithPython/master/data/AirPassengers.csv")

# context must be either a 1D tensor, a list of 1D tensors,
# or a left-padded 2D tensor with batch as the first dimension
context = torch.tensor(df["#Passengers"])
prediction_length = 12
forecast = pipeline.predict(context, prediction_length)  # shape [num_series, num_samples, prediction_length]

# visualize the forecast
forecast_index = range(len(df), len(df) + prediction_length)
low, median, high = np.quantile(forecast[0].numpy(), [0.1, 0.5, 0.9], axis=0)

plt.figure(figsize=(8, 4))
plt.plot(df["#Passengers"], color="royalblue", label="historical data")
plt.plot(forecast_index, median, color="tomato", label="median forecast")
plt.fill_between(forecast_index, low, high, color="tomato", alpha=0.3, label="80% prediction interval")
plt.legend()
plt.grid()
plt.show()

📚 详细文档

架构

本仓库中的模型基于T5架构。唯一的区别在于词汇量大小：Chronos-T5模型使用4096个不同的标记，而原始T5模型使用32128个标记，这导致Chronos-T5模型的参数更少。

模型	参数数量	基于的模型
chronos-t5-tiny	8M	t5-efficient-tiny
chronos-t5-mini	20M	t5-efficient-mini
chronos-t5-small	46M	t5-efficient-small
chronos-t5-base	200M	t5-efficient-base
chronos-t5-large	710M	t5-efficient-large

📄 许可证

本项目采用Apache-2.0许可证。

📚 引用

如果您发现Chronos模型对您的研究有用，请考虑引用相关论文：

@article{ansari2024chronos,
  author  = {Ansari, Abdul Fatir and Stella, Lorenzo and Turkmen, Caner and Zhang, Xiyuan, and Mercado, Pedro and Shen, Huibin and Shchur, Oleksandr and Rangapuram, Syama Syndar and Pineda Arango, Sebastian and Kapoor, Shubham and Zschiegner, Jasper and Maddix, Danielle C. and Mahoney, Michael W. and Torkkola, Kari and Gordon Wilson, Andrew and Bohlke-Schneider, Michael and Wang, Yuyang},
  title   = {Chronos: Learning the Language of Time Series},
  journal = {arXiv preprint arXiv:2403.07815},
  year    = {2024}
}