Xiaoyu Tan

WIlliam1900

https://scholar.google.com/citations?user=ftq5rBYAAAAJ&hl=en

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

liked a Space about 2 months ago

HuggingFaceTB/smol-training-playbook

upvoted an article about 2 months ago

Aligning to What? Rethinking Agent Generalization in MiniMax M2

View all activity

Organizations

upvoted a paper 3 days ago

SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

Paper • 2512.22322 • Published 7 days ago • 34

liked a Space about 2 months ago

The Smol Training Playbook

📚

2.76k

The secrets to building world-class LLMs

upvoted an article about 2 months ago

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30, 2025

•

upvoted an article 3 months ago

Article

Gaia2 and ARE: Empowering the community to study agents

Sep 22, 2025

•

125

upvoted 2 papers 3 months ago

Training-Free Group Relative Policy Optimization

Paper • 2510.08191 • Published Oct 9, 2025 • 44

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Paper • 2509.22601 • Published Sep 26, 2025 • 29

liked a Space 7 months ago

Reward Bench Leaderboard

📐

417

Display and analyze reward model evaluation results

liked a model 7 months ago

infly/INF-AZ-7B-0524

Image-to-Text • 8B • Updated May 25, 2025 • 31 • 3

liked a model 8 months ago

infly/inf-o1-pi0

33B • Updated Apr 30, 2025 • 76 • 8

liked a Space 8 months ago

Open LLM Leaderboard

🏆

13.8k

Track, rank and evaluate open LLMs and chatbots

liked a dataset 8 months ago

Post-training-Data-Flywheel/AutoIF-instruct-61k-with-funcs

Viewer • Updated Oct 3, 2024 • 61.5k • 189 • 6

liked a model 8 months ago

Goedel-LM/Goedel-Prover-DPO

7B • Updated Apr 22, 2025 • 7 • 4

liked a model 10 months ago

Goedel-LM/Goedel-Prover-SFT

7B • Updated Apr 18, 2025 • 48 • 28

liked 3 datasets about 1 year ago

upvoted a paper almost 2 years ago

TinyGSM: achieving >80% on GSM8k with small language models

Paper • 2312.09241 • Published Dec 14, 2023 • 39

liked 2 datasets over 2 years ago

GAIR/lima

Viewer • Updated Jun 8, 2023 • 1.33k • 831 • 452

anon8231489123/ShareGPT_Vicuna_unfiltered

Updated Apr 12, 2023 • 61.4k • 835

Xiaoyu Tan

AI & ML interests

Recent Activity

Organizations

WIlliam1900's activity

The Smol Training Playbook

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Gaia2 and ARE: Empowering the community to study agents

Reward Bench Leaderboard

Open LLM Leaderboard