Advanced Academy Reader

Benchmarking Agent Safety in Browsers

Analyzing the security risks of agentic browsing, specifically prompt injection via HTML, and exploring benchmarks like BrowseSafe.

advanced•4 / 5

Benchmarking with BrowseSafe

In this section

BrowseSafe is a benchmark suite designed to evaluate these defenses. It tests agents against a dataset of diverse injection attacks embedded in realistic web pages.

Key Metrics#

Attack Success Rate (ASR)#

The percentage of attacks that successfully manipulate the agent.

False Positive Rate#

How often legitimate content is flagged as malicious.

Latency Overhead#

The time added to the browsing session by the defense mechanism.

← Previous

Section 4 of 5•