oh sehun's picture

oh sehun

sehun

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 15 hours ago

Nested Browser-Use Learning for Agentic Information Seeking

upvoted a paper about 15 hours ago

mHC: Manifold-Constrained Hyper-Connections

upvoted a paper about 15 hours ago

Evaluating Parameter Efficient Methods for RLVR

View all activity

Organizations

upvoted 3 papers about 15 hours ago

Nested Browser-Use Learning for Agentic Information Seeking

Paper • 2512.23647 • Published 4 days ago • 17

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 2 days ago • 119

Evaluating Parameter Efficient Methods for RLVR

Paper • 2512.23165 • Published 5 days ago • 20

upvoted 2 papers 2 days ago

Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation

Paper • 2512.23705 • Published 4 days ago • 39

GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models

Paper • 2512.15560 • Published 16 days ago • 24

upvoted a paper 3 days ago

Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone

Paper • 2512.22615 • Published 6 days ago • 38

upvoted a paper 4 days ago

TimeBill: Time-Budgeted Inference for Large Language Models

Paper • 2512.21859 • Published 8 days ago • 18

upvoted a paper 7 days ago

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Paper • 2512.20605 • Published 10 days ago • 59

upvoted 3 papers 8 days ago

Latent Implicit Visual Reasoning

Paper • 2512.21218 • Published 9 days ago • 63

Beyond Memorization: A Multi-Modal Ordinal Regression Benchmark to Expose Popularity Bias in Vision-Language Models

Paper • 2512.21337 • Published 9 days ago • 26

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Paper • 2512.20557 • Published 10 days ago • 48

upvoted 5 papers 9 days ago

LongVideoAgent: Multi-Agent Reasoning with Long Videos

Paper • 2512.20618 • Published 10 days ago • 52

Reasoning Palette: Modulating Reasoning via Latent Contextualization for Controllable Exploration for (V)LMs

Paper • 2512.17206 • Published 15 days ago • 19

Reinforcement Learning for Self-Improving Agent with Skill Library

Paper • 2512.17102 • Published 15 days ago • 30

CASA: Cross-Attention via Self-Attention for Efficient Vision-Language Fusion

Paper • 2512.19535 • Published 11 days ago • 10

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

Paper • 2512.19673 • Published 11 days ago • 60

upvoted 2 papers 10 days ago

QuCo-RAG: Quantifying Uncertainty from the Pre-training Corpus for Dynamic Retrieval-Augmented Generation

Paper • 2512.19134 • Published 12 days ago • 31

WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion

Paper • 2512.19678 • Published 11 days ago • 29

upvoted 2 papers 11 days ago

Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers

Paper • 2512.17351 • Published 15 days ago • 24

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published 15 days ago • 82