Diagnosing Shortcut-Induced Rigidity in Continual Learning: The Einstellung Rigidity Index (ERI)
2510.00475v1
cs.LG, cs.CV
2025-10-04
Авторы:
Kai Gu, Weishi Shi
Abstract
Deep neural networks frequently exploit shortcut features, defined as
incidental correlations between inputs and labels without causal meaning.
Shortcut features undermine robustness and reduce reliability under
distribution shifts. In continual learning (CL), the consequences of shortcut
exploitation can persist and intensify: weights inherited from earlier tasks
bias representation reuse toward whatever features most easily satisfied prior
labels, mirroring the cognitive Einstellung effect, a phenomenon where past
habits block optimal solutions. Whereas catastrophic forgetting erodes past
skills, shortcut-induced rigidity throttles the acquisition of new ones. We
introduce the Einstellung Rigidity Index (ERI), a compact diagnostic that
disentangles genuine transfer from cue-inflated performance using three
interpretable facets: (i) Adaptation Delay (AD), (ii) Performance Deficit (PD),
and (iii) Relative Suboptimal Feature Reliance (SFR_rel). On a two-phase
CIFAR-100 CL benchmark with a deliberately spurious magenta patch in Phase 2,
we evaluate Naive fine-tuning (SGD), online Elastic Weight Consolidation
(EWC_on), Dark Experience Replay (DER++), Gradient Projection Memory (GPM), and
Deep Generative Replay (DGR). Across these continual learning methods, we
observe that CL methods reach accuracy thresholds earlier than a Scratch-T2
baseline (negative AD) but achieve slightly lower final accuracy on patched
shortcut classes (positive PD). Masking the patch improves accuracy for CL
methods while slightly reducing Scratch-T2, yielding negative SFR_rel. This
pattern indicates the patch acted as a distractor for CL models in this setting
rather than a helpful shortcut.
Ссылки и действия
Дополнительные ресурсы: