SON, SEONG HO's picture

1 3 1

SON, SEONG HO

geronest

·

AI & ML interests

None yet

Recent Activity

commented on a paper 4 days ago

Multi-Task GRPO: Reliable LLM Reasoning Across Tasks

upvoted a paper 4 days ago

Multi-Task GRPO: Reliable LLM Reasoning Across Tasks

updated a model 15 days ago

Meta-Okapi/ca_bloom7b1_adaptdpo_tdata100_lora_2msteps_200steps_batch20_gradacc2_200steps

View all activity

Organizations

upvoted a paper 4 days ago

Multi-Task GRPO: Reliable LLM Reasoning Across Tasks

Paper • 2602.05547 • Published 5 days ago • 11

upvoted a collection 10 months ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 182

upvoted a paper 11 months ago

This Is Your Doge, If It Please You: Exploring Deception and Robustness in Mixture of LLMs

Paper • 2503.05856 • Published Mar 7, 2025 • 7