andthattoo
·
AI & ML interests
RL, Inference, Evolutionary Compute
Recent Activity
Organizations
mem-agent: Equipping LLM Agents with Memory Using RL
mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL
datasets
14
andthattoo/rr_math_judged
Viewer
•
Updated
•
3.23k
•
3
Viewer
•
Updated
•
3.23k
•
6
Viewer
•
Updated
•
3.23k
•
5
andthattoo/router-r1-1.5b-5k
Viewer
•
Updated
•
3.33k
•
4
andthattoo/router-r1-32b-5k
Viewer
•
Updated
•
3.33k
•
4
andthattoo/router-r1-14b-5k
Viewer
•
Updated
•
3.33k
•
4
andthattoo/router-r1-7b-5k
Viewer
•
Updated
•
3.33k
•
8
andthattoo/router_math_subset_below_8k
Viewer
•
Updated
•
3.33k
•
2
andthattoo/router_math_subset_below_4k
Viewer
•
Updated
•
1.11k
•
3
andthattoo/router_math_subset
Viewer
•
Updated
•
1.6k
•
2