3 175 29

Young-Jun Lee PRO

passing2961

https://sites.google.com/view/passing2961/home

AI & ML interests

Social Dialogue System, Multi-Modal Dialogue

Recent Activity

upvoted a paper 6 days ago

VeriThinker: Learning to Verify Makes Reasoning Model Efficient

upvoted a paper 6 days ago

Distilling LLM Agent into Small Models with Retrieval and Code Tools

upvoted a paper 6 days ago

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

View all activity

Organizations

passing2961's activity

upvoted 3 papers 6 days ago

upvoted 2 papers 12 days ago

AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published 12 days ago • 74

VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning

Paper • 2505.12081 • Published 14 days ago • 17

upvoted 2 papers 13 days ago

The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think

Paper • 2505.10185 • Published 16 days ago • 25

Qwen3 Technical Report

Paper • 2505.09388 • Published 17 days ago • 168

updated a dataset 24 days ago

RefineBench/Human-Eval

Viewer • Updated 24 days ago • 130 • 45

upvoted a paper about 1 month ago

Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models

Paper • 2504.20157 • Published Apr 28 • 36

commented a paper about 1 month ago

Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models

Paper • 2504.20157 • Published Apr 28 • 36 •

upvoted 6 papers about 2 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 266

Kimi-VL Technical Report

Paper • 2504.07491 • Published Apr 10 • 124

VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model

Paper • 2504.07615 • Published Apr 10 • 32

MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft

Paper • 2504.08388 • Published Apr 11 • 40

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Paper • 2504.05118 • Published Apr 7 • 25

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 186

upvoted a collection about 2 months ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29 • 518

upvoted 3 papers about 2 months ago

Improved Visual-Spatial Reasoning via R1-Zero-Like Training

Paper • 2504.00883 • Published Apr 1 • 64

Understanding R1-Zero-Like Training: A Critical Perspective

Paper • 2503.20783 • Published Mar 26 • 49

PaperBench: Evaluating AI's Ability to Replicate AI Research

Paper • 2504.01848 • Published Apr 2 • 36