BrowseSafe: Understanding and Preventing Prompt Injection Within AI Browser Agents

2511.20597v1 cs.LG, cs.AI, cs.CR 2025-11-27

Авторы:

Kaiyuan Zhang, Mark Tenenholtz, Kyle Polley, Jerry Ma, Denis Yarats, Ninghui Li

Abstract

The integration of artificial intelligence (AI) agents into web browsers introduces security challenges that go beyond traditional web application threat models. Prior work has identified prompt injection as a new attack vector for web agents, yet the resulting impact within real-world environments remains insufficiently understood. In this work, we examine the landscape of prompt injection attacks and synthesize a benchmark of attacks embedded in realistic HTML payloads. Our benchmark goes beyond prior work by emphasizing injections that can influence real-world actions rather than mere text outputs, and by presenting attack payloads with complexity and distractor frequency similar to what real-world agents encounter. We leverage this benchmark to conduct a comprehensive empirical evaluation of existing defenses, assessing their effectiveness across a suite of frontier AI models. We propose a multi-layered defense strategy comprising both architectural and model-based defenses to protect against evolving prompt injection attacks. Our work offers a blueprint for designing practical, secure web agents through a defense-in-depth approach.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

BrowseSafe: Understanding and Preventing Prompt Injection Within AI Browser Agents

Авторы:

Abstract

Ссылки и действия

Связанные статьи

MarkTune: Improving the Quality-Detectability Trade-off in Open-Weight LLM Water...

A Safety and Security Framework for Real-World Agentic Systems

Teleportation-Based Defenses for Privacy in Approximate Machine Unlearning

Privacy Auditing of Multi-domain Graph Pre-trained Model under Membership Infere...

Hierarchical Dual-Strategy Unlearning for Biomedical and Healthcare Intelligence...

Навигация