quangdq's picture

24 4

quangdq

kaidduong

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Skin Tokens: A Learned Compact Representation for Unified Autoregressive Rigging

upvoted a paper 4 days ago

Residual Context Diffusion Language Models

upvoted a paper 4 days ago

Kimi K2.5: Visual Agentic Intelligence

View all activity

Organizations

None yet

upvoted a paper 3 days ago

Skin Tokens: A Learned Compact Representation for Unified Autoregressive Rigging

Paper • 2602.04805 • Published 5 days ago • 5

upvoted 4 papers 4 days ago

Residual Context Diffusion Language Models

Paper • 2601.22954 • Published 10 days ago • 30

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 7 days ago • 209

Advancing Open-source World Models

Paper • 2601.20540 • Published 12 days ago • 120

PaperBanana: Automating Academic Illustration for AI Scientists

Paper • 2601.23265 • Published 10 days ago • 155

liked a model 14 days ago

lightx2v/Qwen-Image-Edit-2511-Lightning

Image-to-Image • Updated 25 days ago • 362k • • 362

upvoted a paper about 1 month ago

Xmodel-2 Technical Report

Paper • 2412.19638 • Published Dec 27, 2024 • 27

upvoted 2 papers about 2 months ago

MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos

Paper • 2512.10881 • Published Dec 11, 2025 • 30

Towards Interactive Intelligence for Digital Humans

Paper • 2512.13674 • Published Dec 15, 2025 • 12

liked a model about 2 months ago

py-feat/mp_blendshapes

Image Feature Extraction • Updated Sep 2, 2024 • 6

upvoted a paper 2 months ago

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 237

upvoted 2 papers 3 months ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 123

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30, 2025 • 110

upvoted 2 papers 4 months ago

Trace Anything: Representing Any Video in 4D via Trajectory Fields

Paper • 2510.13802 • Published Oct 15, 2025 • 31

TTT3R: 3D Reconstruction as Test-Time Training

Paper • 2509.26645 • Published Sep 30, 2025 • 15

upvoted 5 papers 5 months ago

Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis

Paper • 2509.09595 • Published Sep 11, 2025 • 48

InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis

Paper • 2509.10441 • Published Sep 12, 2025 • 31

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 271

DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 295

HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Paper • 2509.08519 • Published Sep 10, 2025 • 128