6 328 31

Young-Jun Lee PRO

passing2961

https://sites.google.com/view/passing2961/home

AI & ML interests

Social Dialogue System, Multi-Modal Dialogue

Recent Activity

upvoted a paper 6 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

upvoted a paper 7 days ago

SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence

upvoted a paper 7 days ago

Agentic Rubrics as Contextual Verifiers for SWE Agents

View all activity

Organizations

upvoted a paper 6 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 6 days ago • 176

upvoted 2 papers 7 days ago

SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence

Paper • 2512.22334 • Published 19 days ago • 34

Agentic Rubrics as Contextual Verifiers for SWE Agents

Paper • 2601.04171 • Published 7 days ago • 10

upvoted a paper 8 days ago

NitroGen: An Open Foundation Model for Generalist Gaming Agents

Paper • 2601.02427 • Published 10 days ago • 38

upvoted a paper 9 days ago

K-EXAONE Technical Report

Paper • 2601.01739 • Published 10 days ago • 81

upvoted a paper 16 days ago

Training AI Co-Scientists Using Rubric Rewards

Paper • 2512.23707 • Published 16 days ago • 19

upvoted an article 16 days ago

Article

Bringing Fusion Down to Earth: ML for Stellarator Optimization

Jul 2, 2025

•

upvoted a paper 16 days ago

Masking Teacher and Reinforcing Student for Distilling Vision-Language Models

Paper • 2512.22238 • Published 22 days ago • 19

upvoted a paper 17 days ago

UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture

Paper • 2512.21675 • Published 20 days ago • 24

upvoted a paper 20 days ago

SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios

Paper • 2512.18470 • Published 25 days ago • 10

upvoted 4 papers 22 days ago

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Paper • 2512.16969 • Published 28 days ago • 112

MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive, and MCP-Augmented Environments

Paper • 2512.19432 • Published 23 days ago • 12

QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models

Paper • 2512.19526 • Published 23 days ago • 11

Reinforcement Learning for Self-Improving Agent with Skill Library

Paper • 2512.17102 • Published 27 days ago • 32

upvoted 3 papers 25 days ago

Nemotron-Math: Efficient Long-Context Distillation of Mathematical Reasoning from Multi-Mode Supervision

Paper • 2512.15489 • Published 28 days ago • 6

Adaptation of Agentic AI

Paper • 2512.16301 • Published 28 days ago • 101

Kling-Omni Technical Report

Paper • 2512.16776 • Published 27 days ago • 166

upvoted 2 papers 29 days ago

Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows

Paper • 2512.13168 • Published about 1 month ago • 49

Olmo 3

Paper • 2512.13961 • Published about 1 month ago • 23

upvoted a paper about 1 month ago

The FACTS Leaderboard: A Comprehensive Benchmark for Large Language Model Factuality

Paper • 2512.10791 • Published Dec 11, 2025 • 7

Young-Jun Lee PRO

AI & ML interests

Recent Activity

Organizations

passing2961's activity

Bringing Fusion Down to Earth: ML for Stellarator Optimization