Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts Paper • 2602.13367 • Published Feb 13 • 33
Measuring and Mitigating Post-hoc Rationalization in Reverse Chain-of-Thought Generation Paper • 2602.14469 • Published Feb 16 • 3
Measuring and Mitigating Post-hoc Rationalization in Reverse Chain-of-Thought Generation Paper • 2602.14469 • Published Feb 16 • 3
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 28 items • Updated about 20 hours ago • 100