Yi Cui

onekq

AI & ML interests

Benchmark, Code Generation Model

Recent Activity

View all activity

Organizations

MLX Community's profile picture ONEKQ AI's profile picture

Posts 52

view post
Post
215
This post discussed the same trend as the Sutton post, but is more concrete and down-to-earth.

https://ysymyth.github.io/The-Second-Half/

Two takeaways for me. (1) deep neural network is the backbone to unify everything. RLHF will stand the test of time because it brings two distinct fields (NLP and RL) onto the same model weights. (2) language model will continue to play a central role in the era of agent. It probably won't be the end game to AGI, but definitely not offramp.

Articles 2

Article
5

Does Daily Software Engineering Work Need Reasoning Models?

models

None public yet

datasets

None public yet