Running on Zero Agents SHADE Monitor — Before vs After 🛡 Qwen 1.5B baseline vs GRPO-trained LoRA monitor.