liu zh's picture

6

liu zh

morphism42

·

AI & ML interests

None yet

Organizations

None yet

morphism42's activity

upvoted a paper 3 months ago

Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

Paper • 2502.02508 • Published Feb 4 • 23

upvoted an article 7 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 235

upvoted 2 articles 9 months ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11, 2024

• 119

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

• 235

upvoted an article 12 months ago

Article

Personal Copilot: Train Your Own Coding Assistant

Oct 27, 2023

• 51