wongyukim
wongyukim
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 4 hours ago
LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making
Abilities
upvoted
a
paper
about 4 hours ago
TTRL: Test-Time Reinforcement Learning
upvoted
a
paper
1 day ago
DRAGON: Distributional Rewards Optimize Diffusion Generative Models
Organizations
None yet
models
None public yet
datasets
None public yet