Stealing AI Model Weights Through Covert Communication Channels
2510.00151v1
cs.CR, cs.AI
2025-10-05
Авторы:
Valentin Barbaza, Alan Rodrigo Diaz-Rizo, Hassan Aboushady, Spyridon Raptis, Haralampos-G. Stratigopoulos
Abstract
AI models are often regarded as valuable intellectual property due to the
high cost of their development, the competitive advantage they provide, and the
proprietary techniques involved in their creation. As a result, AI model
stealing attacks pose a serious concern for AI model providers. In this work,
we present a novel attack targeting wireless devices equipped with AI hardware
accelerators. The attack unfolds in two phases. In the first phase, the
victim's device is compromised with a hardware Trojan (HT) designed to covertly
leak model weights through a hidden communication channel, without the victim
realizing it. In the second phase, the adversary uses a nearby wireless device
to intercept the victim's transmission frames during normal operation and
incrementally reconstruct the complete weight matrix. The proposed attack is
agnostic to both the AI model architecture and the hardware accelerator used.
We validate our approach through a hardware-based demonstration involving four
diverse AI models of varying types and sizes. We detail the design of the HT
and the covert channel, highlighting their stealthy nature. Additionally, we
analyze the impact of bit error rates on the reception and propose an error
mitigation technique. The effectiveness of the attack is evaluated based on the
accuracy of the reconstructed models with stolen weights and the time required
to extract them. Finally, we explore potential defense mechanisms.
Ссылки и действия
Дополнительные ресурсы: