A New Type of Adversarial Examples
2510.19347v1
cs.LG, cs.AI, cs.GR
2025-10-24
Авторы:
Xingyang Nie, Guojie Xiao, Su Pan, Biao Wang, Huilin Ge, Tao Fang
Abstract
Most machine learning models are vulnerable to adversarial examples, which
poses security concerns on these models. Adversarial examples are crafted by
applying subtle but intentionally worst-case modifications to examples from the
dataset, leading the model to output a different answer from the original
example. In this paper, adversarial examples are formed in an exactly opposite
manner, which are significantly different from the original examples but result
in the same answer. We propose a novel set of algorithms to produce such
adversarial examples, including the negative iterative fast gradient sign
method (NI-FGSM) and the negative iterative fast gradient method (NI-FGM),
along with their momentum variants: the negative momentum iterative fast
gradient sign method (NMI-FGSM) and the negative momentum iterative fast
gradient method (NMI-FGM). Adversarial examples constructed by these methods
could be used to perform an attack on machine learning systems in certain
occasions. Moreover, our results show that the adversarial examples are not
merely distributed in the neighbourhood of the examples from the dataset;
instead, they are distributed extensively in the sample space.
Ссылки и действия
Дополнительные ресурсы: