S2AP: Score-space Sharpness Minimization for Adversarial Pruning
2510.18381v1
cs.CV, cs.AI, cs.LG
2025-10-23
Авторы:
Giorgio Piras, Qi Zhao, Fabio Brau, Maura Pintor, Christian Wressnegger, Battista Biggio
Abstract
Adversarial pruning methods have emerged as a powerful tool for compressing
neural networks while preserving robustness against adversarial attacks. These
methods typically follow a three-step pipeline: (i) pretrain a robust model,
(ii) select a binary mask for weight pruning, and (iii) finetune the pruned
model. To select the binary mask, these methods minimize a robust loss by
assigning an importance score to each weight, and then keep the weights with
the highest scores. However, this score-space optimization can lead to sharp
local minima in the robust loss landscape and, in turn, to an unstable mask
selection, reducing the robustness of adversarial pruning methods. To overcome
this issue, we propose a novel plug-in method for adversarial pruning, termed
Score-space Sharpness-aware Adversarial Pruning (S2AP). Through our method, we
introduce the concept of score-space sharpness minimization, which operates
during the mask search by perturbing importance scores and minimizing the
corresponding robust loss. Extensive experiments across various datasets,
models, and sparsity levels demonstrate that S2AP effectively minimizes
sharpness in score space, stabilizing the mask selection, and ultimately
improving the robustness of adversarial pruning methods.
Ссылки и действия
Дополнительные ресурсы: