Jeongjae Park

jjp97

AI & ML interests

I’m interested in the latest NLP and AI technologies, such as uncertainty, retrieval, agentic approaches, and long-context models!

Recent Activity

upvoted a paper about 3 hours ago

Your Group-Relative Advantage Is Biased

upvoted a paper about 3 hours ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper about 3 hours ago

mHC: Manifold-Constrained Hyper-Connections

View all activity

Organizations

None yet

upvoted 3 papers about 3 hours ago

upvoted a paper 2 days ago

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

Paper • 2601.08808 • Published 9 days ago • 34

upvoted a paper 4 days ago

Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits

Paper • 2512.20578 • Published 30 days ago • 80

upvoted a paper 6 days ago

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Paper • 2601.07372 • Published 10 days ago • 35

upvoted 3 papers 7 days ago

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

Paper • 2601.04720 • Published 14 days ago • 46

EpiCaR: Knowing What You Don't Know Matters for Better Reasoning in LLMs

Paper • 2601.06786 • Published 11 days ago • 5

The Confidence Dichotomy: Analyzing and Mitigating Miscalibration in Tool-Use Agents

Paper • 2601.07264 • Published 10 days ago • 23

upvoted 3 papers 8 days ago

Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

Paper • 2512.24615 • Published 22 days ago • 114

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Paper • 2601.05593 • Published 13 days ago • 78

Solar Open Technical Report

Paper • 2601.07022 • Published 11 days ago • 61

upvoted a paper 11 days ago

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Paper • 2512.24618 • Published 22 days ago • 138

upvoted 4 papers 13 days ago

RelayLLM: Efficient Reasoning via Collaborative Decoding

Paper • 2601.05167 • Published 14 days ago • 28

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 14 days ago • 202

Recursive Language Models

Paper • 2512.24601 • Published 22 days ago • 72

K-EXAONE Technical Report

Paper • 2601.01739 • Published 17 days ago • 84

Jeongjae Park

AI & ML interests

Recent Activity

Organizations

jjp97's activity