JihoPark's picture

JihoPark

jiho31

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation

upvoted a paper 3 days ago

Grounding World Simulation Models in a Real-World Metropolis

upvoted a paper 3 days ago

Causal-JEPA: Learning World Models through Object-Level Latent Interventions

View all activity

Organizations

None yet

upvoted a paper 1 day ago

WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation

Paper • 2603.16871 • Published 2 days ago • 51

upvoted 2 papers 3 days ago

Grounding World Simulation Models in a Real-World Metropolis

Paper • 2603.15583 • Published 3 days ago • 127

Causal-JEPA: Learning World Models through Object-Level Latent Interventions

Paper • 2602.11389 • Published Feb 11 • 7

liked a dataset about 1 month ago

junwann/CSFM-ImageNet1K-Caption

Viewer • Updated Feb 6 • 1.33M • 15 • 3

upvoted 3 papers 5 months ago

Exploring Conditions for Diffusion models in Robotic Control

Paper • 2510.15510 • Published Oct 17, 2025 • 40

Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation

Paper • 2510.23581 • Published Oct 27, 2025 • 42

MATRIX: Mask Track Alignment for Interaction-aware Video Generation

Paper • 2510.07310 • Published Oct 8, 2025 • 36

published a dataset 6 months ago

jiho31/re10k_pixelsplat

Updated Sep 10, 2025 • 7

upvoted a paper 6 months ago

Visual Representation Alignment for Multimodal Large Language Models

Paper • 2509.07979 • Published Sep 9, 2025 • 84

upvoted 4 papers 9 months ago

Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation

Paper • 2506.11924 • Published Jun 13, 2025 • 35

Fine-Grained Perturbation Guidance via Attention Head Selection

Paper • 2506.10978 • Published Jun 12, 2025 • 25

Text-Aware Image Restoration with Diffusion Models

Paper • 2506.09993 • Published Jun 11, 2025 • 45

Revisit What You See: Disclose Language Prior in Vision Tokens for Efficient Guided Decoding of LVLMs

Paper • 2506.09522 • Published Jun 11, 2025 • 21

upvoted a paper 12 months ago

URECA: Unique Region Caption Anything

Paper • 2504.05305 • Published Apr 7, 2025 • 35