Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Text Generation • 28B • Updated 5 days ago • 40.7k • 455
PRISM: Pushing the Frontier of Deep Think via Process Reward Model-Guided Inference Paper • 2603.02479 • Published 9 days ago • 18