ARC Lab, Tencent PCG
company
Verified
AI & ML interests
ARC mainly focuses on areas of computer vision, speech, and natural language processing, including speech/video generation, enhancement, retrieval, understanding, AutoML, etc. Considering research developments and industry trends, ARC consistently pursues exploration, innovation, and breakthroughs in technologies.
Recent Activity
View all activity
Papers
MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE
VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control
Dense Geometry and Motion Reconstruction with a 4D VAE
[CVPR 2026] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs
-
TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs
Paper β’ 2512.14698 β’ Published β’ 21 -
TencentARC/TimeLens-Bench
Viewer β’ Updated β’ 2.32k β’ 319 β’ 2 -
TencentARC/TimeLens-100K
Viewer β’ Updated β’ 19.2k β’ 273 β’ 5 -
TencentARC/TimeLens-8B
Video-Text-to-Text β’ 9B β’ Updated β’ 642 β’ 9
A customization method
Crafter series models for 3D reconstruction and generation
-
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
Paper β’ 2504.01016 β’ Published β’ 29 -
TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models
Paper β’ 2503.05638 β’ Published β’ 20 -
StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
Paper β’ 2409.07447 β’ Published β’ 1 -
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Paper β’ 2409.02095 β’ Published β’ 37
Any-length Video Inpainting and Editing with Plug-and-Play Context Control
Retrieval-based manga sequence colorization
Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images
The smallest and most efficient control models for SDXL!
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models
Autoregressive Long Video Diffusion in Real Time
Streamlining Cartoon Production with Generative Post-Keyframing
BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing
Inpainting-based image insturction editing
A 3.2 B text-to-image model distilled from flux
A Series of Powerful Visual Tokenizers
Let us create photos/paintings/avatars for anyone in any style within seconds.
Dense Geometry and Motion Reconstruction with a 4D VAE
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models
[CVPR 2026] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs
-
TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs
Paper β’ 2512.14698 β’ Published β’ 21 -
TencentARC/TimeLens-Bench
Viewer β’ Updated β’ 2.32k β’ 319 β’ 2 -
TencentARC/TimeLens-100K
Viewer β’ Updated β’ 19.2k β’ 273 β’ 5 -
TencentARC/TimeLens-8B
Video-Text-to-Text β’ 9B β’ Updated β’ 642 β’ 9
Autoregressive Long Video Diffusion in Real Time
A customization method
Streamlining Cartoon Production with Generative Post-Keyframing
Crafter series models for 3D reconstruction and generation
-
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
Paper β’ 2504.01016 β’ Published β’ 29 -
TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models
Paper β’ 2503.05638 β’ Published β’ 20 -
StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos
Paper β’ 2409.07447 β’ Published β’ 1 -
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Paper β’ 2409.02095 β’ Published β’ 37
BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing
Any-length Video Inpainting and Editing with Plug-and-Play Context Control
Inpainting-based image insturction editing
Retrieval-based manga sequence colorization
A 3.2 B text-to-image model distilled from flux
Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images
A Series of Powerful Visual Tokenizers
The smallest and most efficient control models for SDXL!
Let us create photos/paintings/avatars for anyone in any style within seconds.