1 56 146

Zhaocheng Liu

zhaocheng

https://scholar.google.com/citations?user=Kk-dRIAAAAAJ

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning

liked a dataset 4 days ago

naver-clova-ix/cord-v2

upvoted a paper 10 days ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

View all activity

Organizations

zhaocheng's activity

upvoted a paper 3 days ago

Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning

Paper • 2407.10718 • Published Jul 15, 2024 • 19

liked a dataset 4 days ago

naver-clova-ix/cord-v2

Viewer • Updated Jul 19, 2022 • 1k • 3.58k • 83

upvoted a paper 10 days ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published 11 days ago • 116

upvoted a paper 24 days ago

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published 25 days ago • 64

upvoted a paper 25 days ago

Measuring General Intelligence with Generated Games

Paper • 2505.07215 • Published 27 days ago • 10

liked a dataset 25 days ago

a-m-team/AM-DeepSeek-R1-Distilled-1.4M

Preview • Updated Mar 30 • 3.35k • 145

liked a model 25 days ago

a-m-team/AM-Thinking-v1

Text Generation • Updated 25 days ago • 9.88k • • 181

upvoted a paper about 1 month ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29 • 94

commented a paper about 2 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 127 •

upvoted a paper about 2 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 127

upvoted a paper 2 months ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published Mar 28 • 44

liked a dataset 3 months ago

BytedTsinghua-SIA/DAPO-Math-17k

Viewer • Updated Apr 18 • 1.79M • 2.67k • 74

upvoted a paper 3 months ago

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Paper • 2503.05592 • Published Mar 7 • 27

liked a model 3 months ago

BadToBest/EchoMimicV2

Updated Jan 6 • 123

liked a model 4 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • Updated Feb 24 • 1.26M • • 1.22k

liked a dataset 4 months ago

deepmind/code_contests

Viewer • Updated Jun 11, 2023 • 4.04k • 7.02k • 173

liked a model 4 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

Text Generation • Updated Feb 24 • 514k • • 652

liked a dataset 4 months ago

agentica-org/DeepScaleR-Preview-Dataset

Viewer • Updated Feb 10 • 40.3k • 5.23k • 124

liked a model 4 months ago

Qwen/Qwen2.5-7B-Instruct-1M

Text Generation • Updated Jan 29 • 1.3M • • 328

upvoted a paper 4 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 123