What Users Leave Unsaid: Under-Specified Queries Limit Vision-Language Models
Paper
• 2601.06165 • Published
• 16
AI Safety & AI Security
COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs
X-Teaming Evolutionary M2S: Automated Discovery of Multi-turn to Single-turn Jailbreak Templates