Jaehyun Jun's picture

Jaehyun Jun

btjhjeon

·

https://btjhjeon.github.io/

btjhjeon

AI & ML interests

Multimodal

Recent Activity

updated a collection 2 days ago

Multimodal Agent

updated a collection 2 days ago

Multimodal Reasoning

updated a collection 2 days ago

View all activity

Organizations

upvoted 2 papers 3 days ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published 15 days ago • 119

SIMA 2: A Generalist Embodied Agent for Virtual Worlds

Paper • 2512.04797 • Published 7 days ago • 19

upvoted 2 papers 8 days ago

Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch

Paper • 2512.02395 • Published 9 days ago • 46

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 9 days ago • 197

upvoted 2 papers 13 days ago

UniGame: Turning a Unified Multimodal Model Into Its Own Adversary

Paper • 2511.19413 • Published 17 days ago • 20

NVIDIA Nemotron Parse 1.1

Paper • 2511.20478 • Published 16 days ago • 20

upvoted 4 papers about 1 month ago

GUI-360: A Comprehensive Dataset and Benchmark for Computer-Using Agents

Paper • 2511.04307 • Published Nov 6 • 14

NVIDIA Nemotron Nano V2 VL

Paper • 2511.03929 • Published Nov 6 • 26

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27 • 96

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30 • 107

upvoted a collection 2 months ago

Qwen3

84 items • Updated Aug 6 • 1.48k

upvoted 2 papers 3 months ago

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16 • 51

Qwen3-Omni Technical Report

Paper • 2509.17765 • Published Sep 22 • 139

upvoted 2 collections 3 months ago

Qwen3-VL

37 items • Updated Nov 1 • 507

Qwen3-Omni

6 items • Updated Oct 9 • 171

upvoted 2 papers 3 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 193

Robix: A Unified Model for Robot Interaction, Reasoning and Planning

Paper • 2509.01106 • Published Sep 1 • 49

upvoted 3 papers 4 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 256

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25 • 208

Beyond the Trade-off: Self-Supervised Reinforcement Learning for Reasoning Models' Instruction Following

Paper • 2508.02150 • Published Aug 4 • 36