wongyukim
wongyukim
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning
upvoted
a
paper
2 days ago
Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target
Self-Distillation
upvoted
a
paper
2 days ago
OpenThinkIMG: Learning to Think with Images via Visual Tool
Reinforcement Learning
Organizations
None yet
wongyukim's activity
number of hardnegs
๐
1
1
#3 opened 3 months ago
by
wongyukim
number of hardnegs
๐
1
1
#3 opened 3 months ago
by
wongyukim