-
Reasoning Under 1 Billion: Memory-Augmented Reinforcement Learning for Large Language Models
Paper • 2504.02273 • Published • 5 -
Multi-Reference Preference Optimization for Large Language Models
Paper • 2405.16388 • Published • 1 -
Automatic Prompt Selection for Large Language Models
Paper • 2404.02717 • Published • 1
Hung Le
neurocoder
AI & ML interests
None yet
Recent Activity
published
a model
14 days ago
neurocoder/Qwen2.5-0.5B-Open-R1-Code-GRPO
updated
a collection
18 days ago
My papers
Organizations
None yet
Collections
1
models
5

neurocoder/Qwen2.5-0.5B-Open-R1-Code-GRPO
Updated

neurocoder/Qwen2.5-0.5B-Instruct-MemoryR
Updated

neurocoder/Falcon3-1B-Instruct-sft-math-gsm8k
Updated
•
3

neurocoder/Llama-3.2-1B-Instruct-sft-math-gsm8k
Text Generation
•
Updated
•
3

neurocoder/logsQwen2.5-0.5B-Instruct-math-gsm8k
Text Generation
•
Updated
•
3
datasets
None public yet