Fidel-TS: A High-Fidelity Benchmark for Multimodal Time Series Forecasting
2509.24789v1
cs.LG, stat.ML
2025-10-01
Авторы:
Zhijian Xu, Wanxu Cai, Xilin Dai, Zhaorong Deng, Qiang Xu
Abstract
The evaluation of time series forecasting models is hindered by a critical
lack of high-quality benchmarks, leading to a potential illusion of progress.
Existing datasets suffer from issues ranging from pre-training data
contamination in the age of LLMs to the causal and description leakage
prevalent in early multimodal designs. To address this, we formalize the core
principles of high-fidelity benchmarking, focusing on data sourcing integrity,
strict causal soundness, and structural clarity. We introduce Fidel-TS, a new
large-scale benchmark built from the ground up on these principles by sourcing
data from live APIs. Our extensive experiments validate this approach by
exposing the critical biases and design limitations of prior benchmarks.
Furthermore, we conclusively demonstrate that the causal relevance of textual
information is the key factor in unlocking genuine performance gains in
multimodal forecasting.
Ссылки и действия
Дополнительные ресурсы: