Rotation Control Unlearning: Quantifying and Controlling Continuous Unlearning for LLM with The Cognitive Rotation Space
2509.25743v1
cs.LG, cs.CL
2025-10-02
Авторы:
Xiang Zhang, Kun Wei, Xu Yang, Chenghao Xu, Su Yan, Cheng Deng
Abstract
As Large Language Models (LLMs) become increasingly prevalent, their security
vulnerabilities have already drawn attention. Machine unlearning is introduced
to seek to mitigate these risks by removing the influence of undesirable data.
However, existing methods not only rely on the retained dataset to preserve
model utility, but also suffer from cumulative catastrophic utility loss under
continuous unlearning requests. To solve this dilemma, we propose a novel
method, called Rotation Control Unlearning (RCU), which leverages the
rotational salience weight of RCU to quantify and control the unlearning degree
in the continuous unlearning process. The skew symmetric loss is designed to
construct the existence of the cognitive rotation space, where the changes of
rotational angle can simulate the continuous unlearning process. Furthermore,
we design an orthogonal rotation axes regularization to enforce mutually
perpendicular rotation directions for continuous unlearning requests,
effectively minimizing interference and addressing cumulative catastrophic
utility loss. Experiments on multiple datasets confirm that our method without
retained dataset achieves SOTA performance.
Ссылки и действия
Дополнительные ресурсы: