HanSaem Kim's picture

HanSaem Kim

kensaem

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 5 hours ago

RAVEN: Real-time Autoregressive Video Extrapolation with Consistency-model GRPO

upvoted a paper about 5 hours ago

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

upvoted a paper about 8 hours ago

Causal Forcing++: Scalable Few-Step Autoregressive Diffusion Distillation for Real-Time Interactive Video Generation

View all activity

Organizations

None yet

upvoted 2 papers about 5 hours ago

RAVEN: Real-time Autoregressive Video Extrapolation with Consistency-model GRPO

Paper • 2605.15190 • Published 6 days ago • 12

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Paper • 2605.15178 • Published 6 days ago • 75

upvoted 4 papers about 8 hours ago

Causal Forcing++: Scalable Few-Step Autoregressive Diffusion Distillation for Real-Time Interactive Video Generation

Paper • 2605.15141 • Published 6 days ago • 88

LiteFrame: Efficient Vision Encoders Unlock Frame Scaling in Video LLMs

Paper • 2605.17260 • Published 3 days ago • 18

LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation

Paper • 2605.18739 • Published 1 day ago • 84

AI for Auto-Research: Roadmap & User Guide

Paper • 2605.18661 • Published 1 day ago • 43

upvoted a paper 5 days ago

Qwen-Image-VAE-2.0 Technical Report

Paper • 2605.13565 • Published 7 days ago • 56

upvoted 3 papers 7 days ago

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published 13 days ago • 184

Model Merging Scaling Laws in Large Language Models

Paper • 2509.24244 • Published 9 days ago • 44

Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

Paper • 2605.09063 • Published 11 days ago • 77

upvoted a collection 19 days ago

SenseNova-U1

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-Unify Architecture • 8 items • Updated 4 days ago • 64

upvoted a paper 19 days ago

Sapiens2

Paper • 2604.21681 • Published 27 days ago • 19

upvoted 6 papers 21 days ago

HP-Edit: A Human-Preference Post-Training Framework for Image Editing

Paper • 2604.19406 • Published 29 days ago • 7

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published 28 days ago • 240

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

Paper • 2604.13602 • Published Apr 15 • 32

Seeing Fast and Slow: Learning the Flow of Time in Videos

Paper • 2604.21931 • Published 27 days ago • 19

Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation

Paper • 2604.24763 • Published 23 days ago • 70

FlowAnchor: Stabilizing the Editing Signal for Inversion-Free Video Editing

Paper • 2604.22586 • Published 26 days ago • 16

upvoted a collection 22 days ago

Qwen-Image

14 items • Updated Dec 31, 2025 • 106

liked a model 26 days ago

prithivMLmods/QIE-2511-Zoom-Master

Image-to-Image • Updated Jan 24 • 279 • • 21