Doohyuk Jang's picture

5 8

Doohyuk Jang

jadohu

·

https://jadohu.github.io

AI & ML interests

None yet

Recent Activity

updated a model about 1 month ago

jadohu/Qwen2.5-32B-GRPO

updated a model about 1 month ago

jadohu/Qwen3-8B-GRPO

updated a model about 1 month ago

jadohu/Qwen3-8B-MASA-efficient

View all activity

Organizations

upvoted a paper about 1 month ago

Instruction-Guided Lesion Segmentation for Chest X-rays with Automatically Generated Large-Scale Dataset

Paper • 2511.15186 • Published Nov 19, 2025 • 25

upvoted a paper 2 months ago

When to Ensemble: Identifying Token-Level Points for Stable and Fast LLM Ensembling

Paper • 2510.15346 • Published Oct 17, 2025 • 33

upvoted 4 papers 3 months ago

HAMLET: Switch your Vision-Language-Action Model into a History-Aware Policy

Paper • 2510.00695 • Published Oct 1, 2025 • 5

Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning

Paper • 2510.03259 • Published Sep 26, 2025 • 57

No Prompt Left Behind: Exploiting Zero-Variance Prompts in LLM Reinforcement Learning via Entropy-Guided Advantage Shaping

Paper • 2509.21880 • Published Sep 26, 2025 • 52

ReviewScore: Misinformed Peer Review Detection with Large Language Models

Paper • 2509.21679 • Published Sep 25, 2025 • 63

upvoted 2 papers 7 months ago

Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness

Paper • 2505.22960 • Published May 29, 2025 • 16

Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models

Paper • 2505.17225 • Published May 22, 2025 • 64