木村優斗's picture

8 6

木村優斗

hehaoran47

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

How Fast Should a Model Commit to Supervision? Training Reasoning Models on the Tsallis Loss Continuum

upvoted a paper 8 days ago

Heterogeneous Scientific Foundation Model Collaboration

liked a model 27 days ago

BeaverAI/Artemis-31B-v1d-GGUF-BROKEN

View all activity

Organizations

None yet

upvoted a paper 3 days ago

How Fast Should a Model Commit to Supervision? Training Reasoning Models on the Tsallis Loss Continuum

Paper • 2604.25907 • Published 12 days ago • 3

upvoted a paper 8 days ago

Heterogeneous Scientific Foundation Model Collaboration

Paper • 2604.27351 • Published 10 days ago • 208

liked 2 models 27 days ago

BeaverAI/Artemis-31B-v1d-GGUF-BROKEN

31B • Updated 27 days ago • 577 • 1

Albertoo12/IndoBertV2-finetune

0.1B • Updated 27 days ago • 76 • 1

upvoted a paper 29 days ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 627

liked a dataset 29 days ago

galaxythereal/competition-frames-dataset1

Viewer • Updated 29 days ago • 1.11k • 854 • 1

liked 3 models about 1 month ago

juanvilla/CACAOIA

Updated about 1 month ago • 1

Bingsu/adetailer

Updated Nov 21, 2024 • 15.6M • 690

chilkersion/Mokoko2609

Updated 16 days ago • 1

upvoted 3 papers about 1 month ago

SEAR: Schema-Based Evaluation and Routing for LLM Gateways

Paper • 2603.26728 • Published Mar 20 • 12

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 350

MedOpenClaw: Auditable Medical Imaging Agents Reasoning over Uncurated Full Studies

Paper • 2603.24649 • Published Mar 25 • 31

upvoted 2 papers about 2 months ago

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published Mar 17 • 109

SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

Paper • 2603.16859 • Published Mar 17 • 248