LLM paper - a Wcb0219 Collection

Wcb0219 's Collections

LLM paper

updated 22 days ago

Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space

Paper • 2505.15778 • Published May 21 • 18
UserRL: Training Interactive User-Centric Agent via Reinforcement Learning

Paper • 2509.19736 • Published 25 days ago • 11