CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models Paper • 2509.09675 • Published Sep 11 • 28
NousResearch/DeepHermes-ToolCalling-Specialist-Atropos Reinforcement Learning • 8B • Updated Apr 28 • 52 • 14
NousResearch/DeepHermes-Financial-Fundamentals-Prediction-Specialist-Atropos Text Generation • 8B • Updated Apr 28 • 158 • 14
NousResearch/DeepHermes-Egregore-v1-RLAIF-8b-Atropos Reinforcement Learning • 8B • Updated Apr 29 • 59 • 3
NousResearch/DeepHermes-Egregore-v2-RLAIF-8b-Atropos Reinforcement Learning • 8B • Updated Apr 29 • 88 • 6
NousResearch/DeepHermes-AscensionMaze-RLAIF-8b-Atropos Reinforcement Learning • 8B • Updated Apr 29 • 90 • 7
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources Paper • 2509.21268 • Published 26 days ago • 100