Temporal Generalization: A Reality Check

2509.23487v1 cs.LG, cs.CL, cs.CV 2025-10-01

Авторы:

Divyam Madaan, Sumit Chopra, Kyunghyun Cho

Abstract

Machine learning (ML) models often struggle to maintain performance under distribution shifts, leading to inaccurate predictions on unseen future data. In this work, we investigate whether and under what conditions models can achieve such a generalization when relying solely on past data. We explore two primary approaches: convex combinations of past model parameters (\emph{parameter interpolation}) and explicit extrapolation beyond the convex hull of past parameters (\emph{parameter extrapolation}). We benchmark several methods within these categories on a diverse set of temporal tasks, including language modeling, news summarization, news tag prediction, academic paper categorization, satellite image-based land use classification over time, and historical yearbook photo gender prediction. Our empirical findings show that none of the evaluated methods consistently outperforms the simple baseline of using the latest available model parameters in all scenarios. In the absence of access to future data or robust assumptions about the underlying data-generating process, these results underscore the inherent difficulties of generalizing and extrapolating to future data and warrant caution when evaluating claims of such generalization.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Temporal Generalization: A Reality Check

Авторы:

Abstract

Ссылки и действия

Связанные статьи

MemLoRA: Distilling Expert Adapters for On-Device Memory Systems

DeepCoT: Deep Continual Transformers for Real-Time Inference on Data Streams

BayesQ: Uncertainty-Guided Bayesian Quantization

A U-Net and Transformer Pipeline for Multilingual Image Translation

FairImagen: Post-Processing for Bias Mitigation in Text-to-Image Models

Навигация