1 56 146

Zhaocheng Liu

zhaocheng

https://scholar.google.com/citations?user=Kk-dRIAAAAAJ

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning

liked a dataset 4 days ago

naver-clova-ix/cord-v2

upvoted a paper 10 days ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

View all activity

Organizations

zhaocheng's activity

upvoted a paper 3 days ago

Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning

Paper • 2407.10718 • Published Jul 15, 2024 • 19

upvoted a paper 10 days ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published 11 days ago • 116

upvoted a paper 24 days ago

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published 25 days ago • 64

upvoted a paper 25 days ago

Measuring General Intelligence with Generated Games

Paper • 2505.07215 • Published 27 days ago • 10

upvoted a paper about 1 month ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29 • 94

upvoted a paper about 2 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 127

upvoted a paper 2 months ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published Mar 28 • 44

upvoted a paper 3 months ago

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Paper • 2503.05592 • Published Mar 7 • 27

upvoted 4 papers 4 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 123

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 62

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published Feb 3 • 216

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

Paper • 2501.13926 • Published Jan 23 • 42

upvoted 5 papers 5 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 401

upvoted 2 papers 9 months ago

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

Paper • 2409.06666 • Published Sep 10, 2024 • 58

Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

Paper • 2409.08264 • Published Sep 12, 2024 • 49

upvoted a paper 10 months ago

Automated Design of Agentic Systems

Paper • 2408.08435 • Published Aug 15, 2024 • 41