Open to Collab

6 17 9

Yu Zeng

YuZeng260

https://scholar.google.com/citations?user=XJmAr8EAAAAJ&hl=en&oi=sra

yuzeng0-0

AI & ML interests

VLMs, LLMs, RL, Agent, Reasoning

Recent Activity

authored a paper 3 days ago

SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents

submitted a paper 3 days ago

SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents

upvoted a paper 3 days ago

SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents

View all activity

Organizations

authored a paper 3 days ago

SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents

Paper • 2604.17308 • Published 5 days ago • 21

submitted a paper to Daily Papers 3 days ago

SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents

Paper • 2604.17308 • Published 5 days ago • 21

upvoted a paper 3 days ago

SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents

Paper • 2604.17308 • Published 5 days ago • 21

upvoted a paper 16 days ago

Beyond Accuracy: Unveiling Inefficiency Patterns in Tool-Integrated Reasoning

Paper • 2604.05404 • Published 17 days ago • 42

liked a Space 25 days ago

Unlocking On-Policy Distillation for Any Model Family

📝

Visualize on-policy distillation for any model family

upvoted a paper 29 days ago

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published 30 days ago • 54

authored a paper 2 months ago

VimRAG: Navigating Massive Visual Context in Retrieval-Augmented Generation via Multimodal Memory Graph

Paper • 2602.12735 • Published Feb 13 • 8

upvoted 2 papers 2 months ago

VimRAG: Navigating Massive Visual Context in Retrieval-Augmented Generation via Multimodal Memory Graph

Paper • 2602.12735 • Published Feb 13 • 8

Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model

Paper • 2602.07422 • Published Feb 7 • 22

authored a paper 2 months ago

Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models

Paper • 2602.10224 • Published Feb 10 • 19

upvoted a paper 2 months ago

Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models

Paper • 2602.10224 • Published Feb 10 • 19

submitted a paper to Daily Papers 2 months ago

Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models

Paper • 2602.10224 • Published Feb 10 • 19

authored 2 papers 3 months ago

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Paper • 2601.22060 • Published Jan 29 • 155

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Paper • 2602.02185 • Published Feb 2 • 118

commented a paper 3 months ago

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Paper • 2601.22060 • Published Jan 29 • 155 •

upvoted a paper 3 months ago

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Paper • 2602.02185 • Published Feb 2 • 118

submitted a paper to Daily Papers 3 months ago

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Paper • 2602.02185 • Published Feb 2 • 118

upvoted a paper 3 months ago

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Paper • 2601.22060 • Published Jan 29 • 155

submitted a paper to Daily Papers 3 months ago

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Paper • 2601.22060 • Published Jan 29 • 155

upvoted a collection 3 months ago

Vision-DeepResearch

Collection

7 items • Updated Feb 3 • 3

Yu Zeng

AI & ML interests

Recent Activity

Organizations

YuZeng260's activity

Unlocking On-Policy Distillation for Any Model Family