EvoMail: Self-Evolving Cognitive Agents for Adaptive Spam and Phishing Email Defense
2509.21129v1
cs.LG, cs.CR
2025-09-27
Авторы:
Wei Huang, De-Tian Chu, Lin-Yuan Bai, Wei Kang, Hai-Tao Zhang, Bo Li, Zhi-Mo Han, Jing Ge, Hai-Feng Lin
Резюме на русском
#### Контекст
Modern email spam and phishing attacks have significantly evolved, surpassing traditional methods like keyword blacklists or simple heuristics. Adversaries now employ multi-modal tactics, combining natural-language text with obfuscated URLs, forged headers, and malicious attachments. These strategies are dynamically adapted to bypass existing filters within days. Existing spam detection systems, reliant on static rules or single-modality models, struggle to integrate heterogeneous data sources and fail to consistently adapt, leading to rapid performance degradation.
This growing threat highlights the need for innovative approaches capable of robustly identifying and mitigating evolving spam and phishing campaigns.
#### Метод
EvoMail is a self-evolving cognitive agent framework designed to address these challenges. Its architecture integrates a heterogeneous email graph that unifies textual content, metadata (headers, senders, domains), and embedded resources (URLs, attachments). A Cognitive Graph Neural Network (Cognitive-GNN), enhanced by a Large Language Model (LLM), performs context-aware reasoning across these sources to detect coordinated spam campaigns.
A unique adversarial self-evolution loop is central to EvoMail's design. A "red-team" agent generates novel evasion tactics, such as character obfuscation or AI-generated phishing text, while the "blue-team" detector learns from failures. Failures are compressed into a memory module, enabling the system to reuse these experiences for future reasoning, ensuring continuous adaptation to new threats.
#### Результаты
Extensive experiments were conducted using real-world datasets, including Enron-Spam, Ling-Spam, SpamAssassin, and TREC, as well as synthetic adversarial variants. EvoMail consistently outperformed state-of-the-art baselines across multiple metrics: detection accuracy, adaptability to evolving spam tactics, and interpretability of reasoning traces. These results underscore EvoMail's resilience and explainability as a defense framework against next-generation spam and phishing threats.
#### Значимость
EvoMail's adaptive and self-evolving design offers significant advantages across various domains. Its ability to integrate heterogeneous data sources and continuously learn from adversarial tactics makes it highly effective against modern spam and phishing threats. Potential applications include enterprise email security, personal email protection, and anti-phishing initiatives.
By providing a robust and interpretable defense mechanism, EvoMail has the potential to transform the landscape of email security, reducing the impact of evolving cyber threats and enhancing trust in digital communication.
#### Выводы
EvoMail represents a breakthrough in spam and phishing defense, combining cognitive graph neural networks, adversarial self-evolution, and Large Language Models. Its ability to adapt to novel threats and provide transparent reasoning traces positions it as a resilient solution for modern email security challenges. Future research will focus on scaling EvoMail for real-time deployment, exploring its integration with other security frameworks, and addressing emerging multi-modal threats.
Abstract
Modern email spam and phishing attacks have evolved far beyond keyword
blacklists or simple heuristics. Adversaries now craft multi-modal campaigns
that combine natural-language text with obfuscated URLs, forged headers, and
malicious attachments, adapting their strategies within days to bypass filters.
Traditional spam detection systems, which rely on static rules or
single-modality models, struggle to integrate heterogeneous signals or to
continuously adapt, leading to rapid performance degradation.
We propose EvoMail, a self-evolving cognitive agent framework for robust
detection of spam and phishing. EvoMail first constructs a unified
heterogeneous email graph that fuses textual content, metadata (headers,
senders, domains), and embedded resources (URLs, attachments). A Cognitive
Graph Neural Network enhanced by a Large Language Model (LLM) performs
context-aware reasoning across these sources to identify coordinated spam
campaigns. Most critically, EvoMail engages in an adversarial self-evolution
loop: a ''red-team'' agent generates novel evasion tactics -- such as character
obfuscation or AI-generated phishing text -- while the ''blue-team'' detector
learns from failures, compresses experiences into a memory module, and reuses
them for future reasoning.
Extensive experiments on real-world datasets (Enron-Spam, Ling-Spam,
SpamAssassin, and TREC) and synthetic adversarial variants demonstrate that
EvoMail consistently outperforms state-of-the-art baselines in detection
accuracy, adaptability to evolving spam tactics, and interpretability of
reasoning traces. These results highlight EvoMail's potential as a resilient
and explainable defense framework against next-generation spam and phishing
threats.
Ссылки и действия
Дополнительные ресурсы: