Hierarchical Dual-Strategy Unlearning for Biomedical and Healthcare Intelligence Using Imperfect and Privacy-Sensitive Medical Data

2511.19498v1 cs.LG, cs.AI, cs.CR 2025-11-26

Авторы:

Yi Zhang, Tianxiang Xu, Zijian Li, Chao Zhang, Kunyu Zhang, Zhan Gao, Meinuo Li, Xiaohan Zhang, Qichao Qi, Bing Chen

Abstract

Large language models (LLMs) exhibit exceptional performance but pose substantial privacy risks due to training data memorization, particularly within healthcare contexts involving imperfect or privacy-sensitive patient information. We present a hierarchical dual-strategy framework for selective knowledge unlearning that precisely removes specialized knowledge while preserving fundamental medical competencies. Our approach synergistically integrates geometric-constrained gradient updates to selectively modulate target parameters with concept-aware token-level interventions that distinguish between preservation-critical and unlearning-targeted tokens via a unified four-level medical concept hierarchy. Comprehensive evaluations on the MedMCQA (surgical) and MHQA (anxiety, depression, trauma) datasets demonstrate superior performance, achieving an 82.7% forgetting rate and 88.5% knowledge preservation. Notably, our framework maintains robust privacy guarantees while requiring modification of only 0.1% of parameters, addressing critical needs for regulatory compliance, auditability, and ethical standards in clinical research.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Hierarchical Dual-Strategy Unlearning for Biomedical and Healthcare Intelligence Using Imperfect and Privacy-Sensitive Medical Data

Авторы:

Abstract

Ссылки и действия

Связанные статьи

MarkTune: Improving the Quality-Detectability Trade-off in Open-Weight LLM Water...

A Safety and Security Framework for Real-World Agentic Systems

Teleportation-Based Defenses for Privacy in Approximate Machine Unlearning

BrowseSafe: Understanding and Preventing Prompt Injection Within AI Browser Agen...

Privacy Auditing of Multi-domain Graph Pre-trained Model under Membership Infere...

Навигация