What type of study is this?

This is a Experimental Study study.

September 24, 2025Open Access

Filter then Attend: Improving attention-based Time Series Forecasting with Spectral Filtering

Key Points

Adding spectral filters yields a 5-10% relative improvement in forecasting performance for long time-series.
Learnable filters significantly enhance transformer-based models while only increasing parameters by around 1000.
Reducing the embedding dimension results in smaller, more effective transformer architectures.
Synthetic experiments validate how filters facilitate better spectral utilization for enhanced forecasting.

Abstract

Transformer-based models are at the forefront in long time-series forecasting (LTSF). While in many cases, these models are able to achieve state of the art results, they suffer from a bias toward low-frequencies in the data and high computational and memory requirements. Recent work has established that learnable frequency filters can be an integral part of a deep forecasting model by enhancing the model's spectral utilization. These works choose to use a multilayer perceptron to process their filtered signals and thus do not solve the issues found with transformer-based models. In this paper, we establish that adding a filter to the beginning of transformer-based models enhances their performance in long time-series forecasting. We add learnable filters, which only add an additional 1000 parameters to several transformer-based models and observe in multiple instances 5-10 \% relative improvement in forecasting performance. Additionally, we find that with filters added, we are able to decrease the embedding dimension of our models, resulting in transformer-based architectures that are both smaller and more effective than their non-filtering base models. We also conduct synthetic experiments to analyze how the filters enable Transformer-based models to better utilize the full spectrum for forecasting.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Dayag et al. (Wed,) studied this question.

www.synapsesocial.com/papers/68d6d8978b2b6861e4c3eb88 — DOI: https://doi.org/10.48550/arxiv.2508.20206

Authors

Elisha Dayag

Nhat Thanh Tran

Jack Xin

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Filter then Attend: Improving attention-based Time Series Forecasting with Spectral Filtering

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion