AntiPaSTO: Self-Supervised Steering of Moral Reasoning Paper • 2601.07473 • Published 11 days ago • 1
Foundation Text-Generation Models Below 360M Parameters Collection Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. • 41 items • Updated Oct 4, 2025 • 37
RewardBench: Evaluating Reward Models for Language Modeling Paper • 2403.13787 • Published Mar 20, 2024 • 22