Exploring the Role of Token in Transformer-based Time Series Forecasting

Zhang, Jianqi; Wang, Jingyao; Sun, Chuxiong; Shen, Xingchen; Xu, Fanjiang; Zheng, Changwen; Qiang, Wenwen

Computer Science > Artificial Intelligence

arXiv:2404.10337 (cs)

[Submitted on 16 Apr 2024 (v1), last revised 30 Oct 2024 (this version, v3)]

Title:Exploring the Role of Token in Transformer-based Time Series Forecasting

Authors:Jianqi Zhang, Jingyao Wang, Chuxiong Sun, Xingchen Shen, Fanjiang Xu, Changwen Zheng, Wenwen Qiang

View PDF HTML (experimental)

Abstract:Transformer-based methods are a mainstream approach for solving time series forecasting (TSF). These methods use temporal or variable tokens from observable data to make predictions. However, most focus on optimizing the model structure, with few studies paying attention to the role of tokens for predictions. The role is crucial since a model that distinguishes useful tokens from useless ones will predict more effectively. In this paper, we explore this issue. Through theoretical analyses, we find that the gradients mainly depend on tokens that contribute to the predicted series, called positive tokens. Based on this finding, we explore what helps models select these positive tokens. Through a series of experiments, we obtain three observations: i) positional encoding (PE) helps the model identify positive tokens; ii) as the network depth increases, the PE information gradually weakens, affecting the model's ability to identify positive tokens in deeper layers; iii) both enhancing PE in the deeper layers and using semantic-based PE can improve the model's ability to identify positive tokens, thus boosting performance. Inspired by these findings, we design temporal positional encoding (T-PE) for temporal tokens and variable positional encoding (V-PE) for variable tokens. To utilize T-PE and V-PE, we propose T2B-PE, a Transformer-based dual-branch framework. Extensive experiments demonstrate that T2B-PE has superior robustness and effectiveness.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2404.10337 [cs.AI]
	(or arXiv:2404.10337v3 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2404.10337

Submission history

From: Jianqi Zhang [view email]
[v1] Tue, 16 Apr 2024 07:21:39 UTC (6,957 KB)
[v2] Tue, 29 Oct 2024 07:29:39 UTC (4,852 KB)
[v3] Wed, 30 Oct 2024 01:49:45 UTC (4,852 KB)

Computer Science > Artificial Intelligence

Title:Exploring the Role of Token in Transformer-based Time Series Forecasting

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Exploring the Role of Token in Transformer-based Time Series Forecasting

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators