Assessing the Sensitivity and Alignment of FOL Closeness Metrics Paper • 2501.08613 • Published Jan 15
Logical Reasoning with Outcome Reward Models for Test-Time Scaling Paper • 2508.19903 • Published Aug 27
Strategies for Improving NL-to-FOL Translation with LLMs: Data Generation, Incremental Fine-Tuning, and Verification Paper • 2409.16461 • Published Sep 24, 2024