Self-improving LLMs
updated
Self-Taught Self-Correction for Small Language Models
Paper
• 2503.08681
• Published • 15
Self-Improving Robust Preference Optimization
Paper
• 2406.01660
• Published • 20
LADDER: Self-Improving LLMs Through Recursive Problem Decomposition
Paper
• 2503.00735
• Published • 23
Meta-Rewarding Language Models: Self-Improving Alignment with
LLM-as-a-Meta-Judge
Paper
• 2407.19594
• Published • 21
Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation
Paper
• 2310.02304
• Published • 1
Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four
Habits of Highly Effective STaRs
Paper
• 2503.01307
• Published • 38
Large Language Models Can Self-Improve in Long-context Reasoning
Paper
• 2411.08147
• Published • 65
B-STaR: Monitoring and Balancing Exploration and Exploitation in
Self-Taught Reasoners
Paper
• 2412.17256
• Published • 47
Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling
Verification
Paper
• 2502.01839
• Published • 10
Enabling Scalable Oversight via Self-Evolving Critic
Paper
• 2501.05727
• Published • 72
Symbolic Learning Enables Self-Evolving Agents
Paper
• 2406.18532
• Published • 12
A Survey on Self-Evolution of Large Language Models
Paper
• 2404.14387
• Published • 3
Gödel Agent: A Self-Referential Agent Framework for Recursive
Self-Improvement
Paper
• 2410.04444
• Published • 3
Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge
through Self-Teaching
Paper
• 2406.06326
• Published • 2
Learning Evolving Tools for Large Language Models
Paper
• 2410.06617
• Published • 2
LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as
Evolutionary Optimizers
Paper
• 2503.14434
• Published • 7
Self-Rewarding Language Models
Paper
• 2401.10020
• Published • 153