Yanhan Ye's picture

1 5

Yanhan Ye

CoolColoury

·

CoolColoury

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper 22 days ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

upvoted a collection 3 months ago

View all activity

Organizations

upvoted 2 papers 22 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 24 days ago • 146

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published 23 days ago • 86

upvoted a collection 3 months ago

PCC-Finetuned

11 items • Updated Sep 22, 2025 • 2

New activity in BroAlanTaps/Stage2-PCC-Lite-4x 3 months ago

Could you please open source the 4x PCC lite model weights based on the 【Mistral】 model?

#1 opened 3 months ago by

upvoted an article about 1 year ago

Article

Mastering Tensor Dimensions in Transformers

Jan 12, 2025

•

134

authored a paper almost 2 years ago

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20, 2024 • 178

upvoted a paper almost 2 years ago

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20, 2024 • 178