Towards Strong Certified Defense with Universal Asymmetric Randomization

2510.19977v1 cs.LG, cs.CR 2025-10-25

Авторы:

Hanbin Hong, Ashish Kundu, Ali Payani, Binghui Wang, Yuan Hong

Abstract

Randomized smoothing has become essential for achieving certified adversarial robustness in machine learning models. However, current methods primarily use isotropic noise distributions that are uniform across all data dimensions, such as image pixels, limiting the effectiveness of robustness certification by ignoring the heterogeneity of inputs and data dimensions. To address this limitation, we propose UCAN: a novel technique that \underline{U}niversally \underline{C}ertifies adversarial robustness with \underline{A}nisotropic \underline{N}oise. UCAN is designed to enhance any existing randomized smoothing method, transforming it from symmetric (isotropic) to asymmetric (anisotropic) noise distributions, thereby offering a more tailored defense against adversarial attacks. Our theoretical framework is versatile, supporting a wide array of noise distributions for certified robustness in different $\ell_p$-norms and applicable to any arbitrary classifier by guaranteeing the classifier's prediction over perturbed inputs with provable robustness bounds through tailored noise injection. Additionally, we develop a novel framework equipped with three exemplary noise parameter generators (NPGs) to optimally fine-tune the anisotropic noise parameters for different data dimensions, allowing for pursuing different levels of robustness enhancements in practice.Empirical evaluations underscore the significant leap in UCAN's performance over existing state-of-the-art methods, demonstrating up to $182.6\%$ improvement in certified accuracy at large certified radii on MNIST, CIFAR10, and ImageNet datasets.\footnote{Code is anonymously available at \href{https://github.com/youbin2014/UCAN/}{https://github.com/youbin2014/UCAN/}}

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Towards Strong Certified Defense with Universal Asymmetric Randomization

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Towards Irreversible Machine Unlearning for Diffusion Models

Log Probability Tracking of LLM APIs

Efficient Public Verification of Private ML via Regularization

Exploiting \texttt{ftrace}'s \texttt{function\_graph} Tracer Features for Machin...

SD-CGAN: Conditional Sinkhorn Divergence GAN for DDoS Anomaly Detection in IoT N...

Навигация