Whom to Query for What: Adaptive Group Elicitation via Multi-Turn LLM Interactions Paper • 2602.14279 • Published 10 days ago • 1
Whom to Query for What: Adaptive Group Elicitation via Multi-Turn LLM Interactions Paper • 2602.14279 • Published 10 days ago • 1
Rubrics as an Attack Surface: Stealthy Preference Drift in LLM Judges Paper • 2602.13576 • Published 12 days ago • 2
Rubrics as an Attack Surface (RIPD) Collection This collection releases the official artifacts accompanying “Rubrics as an Attack Surface: Stealthy Preference Drift in LLM Judges.” • 18 items • Updated 6 days ago
ZDCSlab/ripd-anthropic-saferlhf-dolphin3-llama31-8b-biased-bt Text Generation • Updated 4 days ago • 6
ZDCSlab/ripd-anthropic-saferlhf-dolphin3-llama31-8b-biased-bt Text Generation • Updated 4 days ago • 6
ZDCSlab/ripd-anthropic-saferlhf-gemma-2b-uncensored-v1-biased-bt Text Generation • 3B • Updated 5 days ago • 5
ZDCSlab/ripd-anthropic-saferlhf-gemma-2b-uncensored-v1-biased-bt Text Generation • 3B • Updated 5 days ago • 5
ZDCSlab/ripd-anthropic-saferlhf-gemma-2b-uncensored-v1-seed-bt Text Generation • 3B • Updated 5 days ago • 7
ZDCSlab/ripd-anthropic-saferlhf-gemma-2b-uncensored-v1-seed-bt Text Generation • 3B • Updated 5 days ago • 7