Reasoning - a DylanASHillier Collection

DylanASHillier 's Collections

Benchmarks etc.

State Space Models

Learning from feedback dir

Imitative Learning

Sample Efficiency

Embodied useful

STLM

Model Internals

Reasoning

updated Mar 8, 2024

Self-Discover: Large Language Models Self-Compose Reasoning Structures

Paper • 2402.03620 • Published Feb 6, 2024 • 116
Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 109
Orca-Math: Unlocking the potential of SLMs in Grade School Math

Paper • 2402.14830 • Published Feb 16, 2024 • 26
Teaching Large Language Models to Reason with Reinforcement Learning

Paper • 2403.04642 • Published Mar 7, 2024 • 51