2 4 1

Federico Bianchi

federicotogether

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

DSGym: A Holistic Framework for Evaluating and Training Data Science Agents

upvoted a paper 8 days ago

Learning to Discover at Test Time

new activity 3 months ago

togethercomputer/FutureBench:Leaderboard is broken

View all activity

Organizations

upvoted a paper 4 days ago

DSGym: A Holistic Framework for Evaluating and Training Data Science Agents

Paper • 2601.16344 • Published 8 days ago • 10

upvoted a paper 8 days ago

Learning to Discover at Test Time

Paper • 2601.16175 • Published 9 days ago • 41

New activity in togethercomputer/FutureBench 3 months ago

Leaderboard is broken

#1 opened 3 months ago by

dgallegos

updated 2 datasets 3 months ago

futurebench/results

Viewer • Updated Nov 12, 2025 • 5 • 11

futurebench/data

Viewer • Updated Nov 12, 2025 • 2.34k • 6

upvoted a paper 3 months ago

ReasonIF: Large Reasoning Models Fail to Follow Instructions During Reasoning

Paper • 2510.15211 • Published Oct 17, 2025 • 2

published 2 datasets 6 months ago

futurebench/results

Viewer • Updated Nov 12, 2025 • 5 • 11

futurebench/data

Viewer • Updated Nov 12, 2025 • 2.34k • 6

updated a dataset 6 months ago

federicotogether/frames-test

Viewer • Updated Aug 1, 2025 • 324 • 3

published a dataset 6 months ago

federicotogether/frames-test

Viewer • Updated Aug 1, 2025 • 324 • 3

updated a dataset 6 months ago

federicotogether/frames-train

Viewer • Updated Aug 1, 2025 • 500 • 4

published a dataset 6 months ago

federicotogether/frames-train

Viewer • Updated Aug 1, 2025 • 500 • 4

updated 2 datasets 6 months ago

federicotogether/math-search-o1-v1

Viewer • Updated Jul 28, 2025 • 1.63k • 6

federicotogether/composite-math-search-coding-v1

Viewer • Updated Jul 28, 2025 • 599 • 4

published a dataset 6 months ago

federicotogether/composite-math-search-coding-v1

Viewer • Updated Jul 28, 2025 • 599 • 4

upvoted an article 7 months ago

Article

Back to The Future: Evaluating AI Agents on Predicting Future Events

Jul 17, 2025

•

liked a Space 7 months ago

FutureBench Leaderboard

🔮

Display and analyze prediction leaderboard data

updated a Space 7 months ago

FutureBench Leaderboard

🔮

Display and analyze prediction leaderboard data

New activity in huggingface/documentation-images 7 months ago

Upload 3 files

#521 opened 7 months ago by

federicotogether

updated a model 8 months ago

the-real-gabagool/qwen-s1-fede-7b-dynamic-cheatsheet-shuffled-v2-checkpoint-142

8B • Updated May 30, 2025 • 1

Federico Bianchi

AI & ML interests

Recent Activity

Organizations

federicotogether's activity

Leaderboard is broken

Back to The Future: Evaluating AI Agents on Predicting Future Events

FutureBench Leaderboard

FutureBench Leaderboard

Upload 3 files