Jingcheng Hu's picture

Jingcheng Hu

reign12

·

AI & ML interests

Foundation models and alignment

Recent Activity

liked a model 1 day ago

stepfun-ai/GELab-Zero-4B-preview

upvoted a paper 8 days ago

Step-Audio-R1 Technical Report

upvoted a paper about 1 month ago

Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents

View all activity

Organizations

upvoted a paper 8 days ago

Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published 17 days ago • 51

upvoted 4 papers about 1 month ago

Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents

Paper • 2510.23691 • Published Oct 27 • 51

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28 • 96

The Principles of Diffusion Models

Paper • 2510.21890 • Published Oct 24 • 58

AMO-Bench: Large Language Models Still Struggle in High School Math Competitions

Paper • 2510.26768 • Published Oct 30 • 33

upvoted a paper 3 months ago

Video-MTR: Reinforced Multi-Turn Reasoning for Long Video Understanding

Paper • 2508.20478 • Published Aug 28 • 17

upvoted 3 papers 4 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 314

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Paper • 2507.22448 • Published Jul 30 • 66

Step-3 is Large yet Affordable: Model-system Co-design for Cost-effective Decoding

Paper • 2507.19427 • Published Jul 25 • 18

upvoted 3 papers 5 months ago

SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?

Paper • 2507.12415 • Published Jul 16 • 42

Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation

Paper • 2507.08441 • Published Jul 11 • 61

Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning

Paper • 2507.05255 • Published Jul 7 • 74

upvoted 2 papers 7 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 138

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120

upvoted 6 papers 8 months ago

Step1X-Edit: A Practical Framework for General Image Editing

Paper • 2504.17761 • Published Apr 24 • 92

Kimi-VL Technical Report

Paper • 2504.07491 • Published Apr 10 • 132

Rethinking Reflection in Pre-Training

Paper • 2504.04022 • Published Apr 5 • 79

MegaMath: Pushing the Limits of Open Math Corpora

Paper • 2504.02807 • Published Apr 3 • 34

Expanding RL with Verifiable Rewards Across Diverse Domains

Paper • 2503.23829 • Published Mar 31 • 23

What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models

Paper • 2503.24235 • Published Mar 31 • 54