xylcbd (xylcbd)

upvoted 2 papers 3 months ago

Video models are zero-shot learners and reasoners

Paper • 2509.20328 • Published Sep 24, 2025 • 99

SIM-CoT: Supervised Implicit Chain-of-Thought

Paper • 2509.20317 • Published Sep 24, 2025 • 41

upvoted a paper 4 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 180

upvoted 2 papers 5 months ago

DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 291

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 267

upvoted a collection 5 months ago

DINOv3

Collection

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21, 2025 • 435

upvoted a paper 5 months ago

ForCenNet: Foreground-Centric Network for Document Image Rectification

Paper • 2507.19804 • Published Jul 26, 2025 • 11

upvoted an article 7 months ago

Article

GRPO for GUI Grounding Done Right

Jun 11, 2025

•

35

upvoted a paper 12 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8, 2025 • 287

upvoted 3 collections over 1 year ago

xylcbd

AI & ML interests

Organizations

Video models are zero-shot learners and reasoners

SIM-CoT: Supervised Implicit Chain-of-Thought

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

DINOv3

Qwen-Image Technical Report

DINOv3

ForCenNet: Foreground-Centric Network for Document Image Rectification

GRPO for GUI Grounding Done Right

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

InternVL2.0

Qwen2

MGM

xylcbd

AI & ML interests

Organizations

xylcbd's activity

GRPO for GUI Grounding Done Right