SliceFine: The Universal Winning-Slice Hypothesis for Pretrained Networks
2510.08513v1
cs.CV, cs.CL
2025-10-11
Авторы:
Md Kowsher, Ali O. Polat, Ehsan Mohammady Ardehaly, Mehrdad Salehi, Zia Ghiasi, Prasanth Murali, Chen Chen
Abstract
This paper presents a theoretical framework explaining why fine tuning small,
randomly selected subnetworks (slices) within pre trained models can be
sufficient for downstream adaptation. We prove that pretrained networks exhibit
a universal winning slice property arising from two phenomena: (1) spectral
balance the eigenspectra of different weight matrix slices are remarkably
similar; and (2) high task energy their backbone representations retain rich,
task relevant features. This leads to the Universal Winning Slice Hypothesis,
which provides a theoretical foundation for parameter efficient fine tuning
(PEFT) in large scale models. Inspired by this, we propose SliceFine, a PEFT
method that exploits this inherent redundancy by updating only selected slices
of the original weights introducing zero new parameters, unlike adapter-based
approaches. Empirically, SliceFine matches the performance of state of the art
PEFT methods across language and vision tasks, while significantly improving
training speed, memory efficiency, and model compactness. Our work bridges
theory and practice, offering a theoretically grounded alternative to existing
PEFT techniques.
Ссылки и действия
Дополнительные ресурсы: