-
THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning
Paper • 2509.13761 • Published • 16 -
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation
Paper • 2509.25849 • Published • 46 -
Reactive Transformer (RxT) -- Stateful Real-Time Processing for Event-Driven Reactive Language Models
Paper • 2510.03561 • Published • 23 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 396
Daniel Kloimwieder
dkkloimwieder
·
AI & ML interests
None yet
Recent Activity
updated
a collection
3 days ago
Paper
updated
a collection
5 days ago
Paper
updated
a collection
7 days ago
Paper
Organizations
None yet