SafeSwitch: Steering Unsafe LLM Behavior via Internal Activation Signals Paper • 2502.01042 • Published Feb 3 • 1
The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination Paper • 2502.16143 • Published Feb 22 • 6
Energy-Based Transformers are Scalable Learners and Thinkers Paper • 2507.02092 • Published Jul 2 • 69
Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers Paper • 2506.23918 • Published Jun 30 • 89
ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind Paper • 2505.22961 • Published May 29 • 8
SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents Paper • 2505.23559 • Published May 29 • 11
SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents Paper • 2505.23559 • Published May 29 • 11
ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind Paper • 2505.22961 • Published May 29 • 8
ToMAP Collection Models related to paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind" • 3 items • Updated May 19