Liu Hengyu's picture

Liu Hengyu

Piang

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago

Track4World: Feedforward World-centric Dense 3D Tracking of All Pixels

submitted a paper 13 days ago

Track4World: Feedforward World-centric Dense 3D Tracking of All Pixels

upvoted a paper 18 days ago

GeoWorld: Geometric World Models

View all activity

Organizations

None yet

upvoted a paper 13 days ago

Track4World: Feedforward World-centric Dense 3D Tracking of All Pixels

Paper • 2603.02573 • Published 14 days ago • 11

upvoted a paper 18 days ago

GeoWorld: Geometric World Models

Paper • 2602.23058 • Published 19 days ago • 8

upvoted 4 papers about 1 month ago

MemFly: On-the-Fly Memory Optimization via Information Bottleneck

Paper • 2602.07885 • Published Feb 8 • 7

Olaf-World: Orienting Latent Actions for Video World Modeling

Paper • 2602.10104 • Published Feb 10 • 27

Context Forcing: Consistent Autoregressive Video Generation with Long Context

Paper • 2602.06028 • Published Feb 5 • 36

3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation

Paper • 2602.03796 • Published Feb 3 • 62

upvoted 2 papers 2 months ago

Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals

Paper • 2601.05848 • Published Jan 9 • 16

VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control

Paper • 2601.05138 • Published Jan 8 • 18

upvoted 3 papers 5 months ago

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Paper • 2510.23607 • Published Oct 27, 2025 • 179

GigaBrain-0: A World Model-Powered Vision-Language-Action Model

Paper • 2510.19430 • Published Oct 22, 2025 • 52

Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Paper • 2510.15742 • Published Oct 17, 2025 • 51

upvoted 2 papers 6 months ago

Seedream 4.0: Toward Next-generation Multimodal Image Generation

Paper • 2509.20427 • Published Sep 24, 2025 • 82

VolSplat: Rethinking Feed-Forward 3D Gaussian Splatting with Voxel-Aligned Prediction

Paper • 2509.19297 • Published Sep 23, 2025 • 25

upvoted 3 papers 7 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 206

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published Mar 3, 2025 • 86

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 272

upvoted 2 papers 9 months ago

IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering

Paper • 2506.23329 • Published Jun 29, 2025 • 8

JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent

Paper • 2506.17612 • Published Jun 21, 2025 • 65

upvoted 2 papers 10 months ago

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23, 2025 • 88

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 338