AmpleGCG-Plus: A Strong Generative Model of Adversarial Suffixes to Jailbreak LLMs with Higher Success Rates in Fewer Attempts Paper • 2410.22143 • Published Oct 29, 2024 • 1
RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments Paper • 2505.21936 • Published May 28, 2025 • 1
When Actions Go Off-Task: Detecting and Correcting Misaligned Actions in Computer-Use Agents Paper • 2602.08995 • Published 3 days ago • 1
When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents Paper • 2602.08235 • Published 3 days ago
AutoElicit Collection When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents • 4 items • Updated 2 days ago
AutoElicit Collection When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents • 4 items • Updated 2 days ago
Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge Paper • 2506.21506 • Published Jun 26, 2025 • 52