Haolin Liu's picture

11

Haolin Liu

lhl616

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning

upvoted a paper 16 days ago

CLUE: Non-parametric Verification from Experience via Hidden-State Clustering

upvoted a paper 18 days ago

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

View all activity

Organizations

None yet

upvoted 2 papers 16 days ago

VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning

Paper • 2510.01444 • Published 17 days ago • 19

CLUE: Non-parametric Verification from Experience via Hidden-State Clustering

Paper • 2510.01591 • Published 17 days ago • 26

upvoted a paper 18 days ago

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published 19 days ago • 52

upvoted a paper 30 days ago

Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

Paper • 2509.15194 • Published about 1 month ago • 33

upvoted 3 papers about 1 month ago

CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models

Paper • 2509.09675 • Published Sep 11 • 28

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Paper • 2509.07980 • Published Sep 9 • 98

Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training

Paper • 2509.03403 • Published Sep 3 • 21

upvoted a paper about 2 months ago

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Paper • 2508.19652 • Published Aug 27 • 84

upvoted 2 papers 2 months ago

Towards Optimal Regret in Adversarial Linear MDPs with Bandit Feedback

Paper • 2310.11550 • Published Oct 17, 2023 • 1

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7 • 126

authored a paper 3 months ago

One Token to Fool LLM-as-a-Judge

Paper • 2507.08794 • Published Jul 11 • 31

upvoted a paper 3 months ago

One Token to Fool LLM-as-a-Judge

Paper • 2507.08794 • Published Jul 11 • 31