Unsupervised Backdoor Detection and Mitigation for Spiking Neural Networks

2510.06629v1 cs.CR, cs.CV, cs.LG 2025-10-10

Авторы:

Jiachen Li, Bang Wu, Xiaoyu Xia, Xiaoning Liu, Xun Yi, Xiuzhen Zhang

Abstract

Spiking Neural Networks (SNNs) have gained increasing attention for their superior energy efficiency compared to Artificial Neural Networks (ANNs). However, their security aspects, particularly under backdoor attacks, have received limited attention. Existing defense methods developed for ANNs perform poorly or can be easily bypassed in SNNs due to their event-driven and temporal dependencies. This paper identifies the key blockers that hinder traditional backdoor defenses in SNNs and proposes an unsupervised post-training detection framework, Temporal Membrane Potential Backdoor Detection (TMPBD), to overcome these challenges. TMPBD leverages the maximum margin statistics of temporal membrane potential (TMP) in the final spiking layer to detect target labels without any attack knowledge or data access. We further introduce a robust mitigation mechanism, Neural Dendrites Suppression Backdoor Mitigation (NDSBM), which clamps dendritic connections between early convolutional layers to suppress malicious neurons while preserving benign behaviors, guided by TMP extracted from a small, clean, unlabeled dataset. Extensive experiments on multiple neuromorphic benchmarks and state-of-the-art input-aware dynamic trigger attacks demonstrate that TMPBD achieves 100% detection accuracy, while NDSBM reduces the attack success rate from 100% to 8.44%, and to 2.81% when combined with detection, without degrading clean accuracy.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Unsupervised Backdoor Detection and Mitigation for Spiking Neural Networks

Авторы:

Abstract

Ссылки и действия

Связанные статьи

PhishSnap: Image-Based Phishing Detection Using Perceptual Hashing

Class-feature Watermark: A Resilient Black-box Watermark Against Model Extractio...

Privacy-Aware Federated nnU-Net for ECG Page Digitization

From See to Shield: ML-Assisted Fine-Grained Access Control for Visual Data

Goal-oriented Backdoor Attack against Vision-Language-Action Models via Physical...

Навигация