tongxiao

tongxiao2002

https://tongxiao2002.github.io

tongxiao2002

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

upvoted a paper 9 days ago

Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents

upvoted a paper 19 days ago

Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners

View all activity

Organizations

upvoted 2 papers 9 days ago

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Paper • 2508.02317 • Published Aug 4 • 18

Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents

Paper • 2507.04009 • Published Jul 5 • 48

upvoted a paper 19 days ago

Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners

Paper • 2509.26226 • Published 20 days ago • 31

upvoted a collection 26 days ago

Qwen3-VL

Collection

17 items • Updated 5 days ago • 272

liked a dataset about 1 month ago

HuggingFaceM4/FineVision

Viewer • Updated 9 days ago • 24.1M • 209k • 369

upvoted a paper about 1 month ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 187

upvoted 4 papers about 2 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31 • 83

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1 • 71

R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning

Paper • 2508.21113 • Published Aug 28 • 109

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25 • 201

updated a model about 2 months ago

tongxiao2002/Perception-R1-7B

8B • Updated Aug 27 • 2 • 1

published a model 2 months ago

tongxiao2002/Perception-R1-7B

8B • Updated Aug 27 • 2 • 1

upvoted 4 papers 3 months ago

The Invisible Leash: Why RLVR May Not Escape Its Origin

Paper • 2507.14843 • Published Jul 20 • 84

Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning

Paper • 2507.05255 • Published Jul 7 • 74

Perception-Aware Policy Optimization for Multimodal Reasoning

Paper • 2507.06448 • Published Jul 8 • 47

Skywork-R1V3 Technical Report

Paper • 2507.06167 • Published Jul 8 • 71

updated a dataset 3 months ago

tongxiao2002/Perception-R1-Dataset

Viewer • Updated Jul 11 • 500 • 83

upvoted a paper 3 months ago

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Paper • 2507.07999 • Published Jul 10 • 48

published a dataset 3 months ago

tongxiao2002/Perception-R1-Dataset

Viewer • Updated Jul 11 • 500 • 83

upvoted a paper 4 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 302

tongxiao

AI & ML interests

Recent Activity

Organizations

tongxiao2002's activity