1 6

Xiangchi Yuan

Xiangchi

https://xiangchi-yuan.github.io/

xiangchi-yuan

AI & ML interests

None yet

Recent Activity

authored a paper 3 months ago

Behavior Knowledge Merge in Reinforced Agentic Models

upvoted a paper 3 months ago

CloneMem: Benchmarking Long-Term Memory for AI Clones

upvoted a paper 3 months ago

Behavior Knowledge Merge in Reinforced Agentic Models

View all activity

Organizations

authored a paper 3 months ago

Behavior Knowledge Merge in Reinforced Agentic Models

Paper • 2601.13572 • Published Jan 20 • 27

upvoted 2 papers 3 months ago

CloneMem: Benchmarking Long-Term Memory for AI Clones

Paper • 2601.07023 • Published Jan 11 • 3

Behavior Knowledge Merge in Reinforced Agentic Models

Paper • 2601.13572 • Published Jan 20 • 27

submitted a paper to Daily Papers 3 months ago

Behavior Knowledge Merge in Reinforced Agentic Models

Paper • 2601.13572 • Published Jan 20 • 27

authored 7 papers 3 months ago

Growing Through Experience: Scaling Episodic Grounding in Language Models

Paper • 2506.01312 • Published Jun 2, 2025

AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment

Paper • 2411.10606 • Published Nov 15, 2024 • 1

LongMamba: Enhancing Mamba's Long Context Capabilities via Training-Free Receptive Field Enlargement

Paper • 2504.16053 • Published Apr 22, 2025

SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs

Paper • 2510.05069 • Published Oct 6, 2025 • 13

LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models

Paper • 2507.14204 • Published Jul 14, 2025

Superficial Self-Improved Reasoners Benefit from Model Merging

Paper • 2503.02103 • Published Mar 3, 2025

Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners

Paper • 2510.04454 • Published Oct 6, 2025

upvoted a paper 4 months ago

Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models

Paper • 2511.18890 • Published Nov 24, 2025 • 35

upvoted 3 papers 6 months ago

SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs

Paper • 2510.05069 • Published Oct 6, 2025 • 13

Large Reasoning Models Learn Better Alignment from Flawed Thinking

Paper • 2510.00938 • Published Oct 1, 2025 • 60

Tree-based Dialogue Reinforced Policy Optimization for Red-Teaming Attacks

Paper • 2510.02286 • Published Oct 2, 2025 • 29

updated a model 8 months ago

Xiangchi/1.5b_SFT

2B • Updated Aug 8, 2025

published a model 8 months ago

Xiangchi/1.5b_SFT

2B • Updated Aug 8, 2025

updated a model over 1 year ago

Xiangchi/math_without_reason_13bf

Updated Jul 25, 2024 • 3

updated 2 models almost 2 years ago

Xiangchi/math_with_reason_13bf

Updated Jul 5, 2024 • 2

Xiangchi/grammars_13bf

Updated Jul 5, 2024 • 2

Xiangchi Yuan

AI & ML interests

Recent Activity

Organizations

Xiangchi's activity