yongmao's picture

5

yongmao

yyong119

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

Training-Free Group Relative Policy Optimization

upvoted a paper 12 days ago

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

upvoted a paper 12 days ago

Less is More: Recursive Reasoning with Tiny Networks

View all activity

Organizations

None yet

upvoted 3 papers 12 days ago

Training-Free Group Relative Policy Optimization

Paper • 2510.08191 • Published 12 days ago • 41

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Paper • 2509.22601 • Published 25 days ago • 29

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published 15 days ago • 421

upvoted a paper 28 days ago

Qwen3-Omni Technical Report

Paper • 2509.17765 • Published 29 days ago • 129

upvoted a paper about 1 month ago

WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research

Paper • 2509.13312 • Published Sep 16 • 104