Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models Paper • 2601.14004 • Published Jan 20 • 46
Graph-Guided Textual Explanation Generation Framework Paper • 2412.12318 • Published Dec 16, 2024 • 4