melisa
's Collections
Self-improving LLMs
updated
Self-Taught Self-Correction for Small Language Models
Paper
•
2503.08681
•
Published
•
13
Self-Improving Robust Preference Optimization
Paper
•
2406.01660
•
Published
•
20
LADDER: Self-Improving LLMs Through Recursive Problem Decomposition
Paper
•
2503.00735
•
Published
•
20
Meta-Rewarding Language Models: Self-Improving Alignment with
LLM-as-a-Meta-Judge
Paper
•
2407.19594
•
Published
•
20
Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation
Paper
•
2310.02304
•
Published
•
1
Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four
Habits of Highly Effective STaRs
Paper
•
2503.01307
•
Published
•
37
Large Language Models Can Self-Improve in Long-context Reasoning
Paper
•
2411.08147
•
Published
•
67
B-STaR: Monitoring and Balancing Exploration and Exploitation in
Self-Taught Reasoners
Paper
•
2412.17256
•
Published
•
48
Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling
Verification
Paper
•
2502.01839
•
Published
•
8
Enabling Scalable Oversight via Self-Evolving Critic
Paper
•
2501.05727
•
Published
•
75
Symbolic Learning Enables Self-Evolving Agents
Paper
•
2406.18532
•
Published
•
12
A Survey on Self-Evolution of Large Language Models
Paper
•
2404.14387
•
Published
•
3
Gödel Agent: A Self-Referential Agent Framework for Recursive
Self-Improvement
Paper
•
2410.04444
•
Published
•
3
Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge
through Self-Teaching
Paper
•
2406.06326
•
Published
•
2
Learning Evolving Tools for Large Language Models
Paper
•
2410.06617
•
Published
•
2
LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as
Evolutionary Optimizers
Paper
•
2503.14434
•
Published
•
7
Self-Rewarding Language Models
Paper
•
2401.10020
•
Published
•
148