2 10 36

Arthur EDMOND PRO

Shumatsurontek

AI & ML interests

LLM & Computer Vision

Recent Activity

upvoted a paper about 7 hours ago

TTRL: Test-Time Reinforcement Learning

upvoted a paper 1 day ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

new activity 6 days ago

huggingface/InferenceSupport:OpenGVLab/InternVL3-78B

View all activity

Organizations

None yet

Shumatsurontek's activity

upvoted a paper about 7 hours ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published about 20 hours ago • 47

upvoted a paper 1 day ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published 5 days ago • 90

New activity in huggingface/InferenceSupport 6 days ago

OpenGVLab/InternVL3-78B

#801 opened 7 days ago by

galvani4987

liked a model 6 days ago

HiDream-ai/HiDream-I1-Full

Text-to-Image • Updated 1 day ago • 28.2k • • 703

upvoted a paper 12 days ago

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published 15 days ago • 73

liked 2 models 13 days ago

moonshotai/Kimi-VL-A3B-Thinking

Image-Text-to-Text • Updated 3 days ago • 37.5k • 380

HuggingFaceTB/SmolVLM2-2.2B-Instruct

Image-Text-to-Text • Updated 15 days ago • 51.9k • 163

updated a model 14 days ago

Shumatsurontek/florence-2-large-ft-mod

Image-Text-to-Text • Updated 14 days ago • 86

published a model 14 days ago

Shumatsurontek/florence-2-large-ft-mod

Image-Text-to-Text • Updated 14 days ago • 86

updated a collection 14 days ago

VLMs

Collection

2 items • Updated 14 days ago

liked 2 models 14 days ago

microsoft/Florence-2-large-ft

Image-Text-to-Text • Updated Jul 20, 2024 • 278k • 348

microsoft/Florence-2-base

Image-Text-to-Text • Updated Nov 4, 2024 • 461k • 260

liked a Space 16 days ago

Exam 1 - Fundamentals of GRPO

🔥

Test your knowledge of GRPO, TRL, RL, and Deepseek R1.

upvoted a paper 19 days ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published 23 days ago • 256

liked a model 19 days ago

Qwen/QwQ-32B

Text Generation • Updated Mar 11 • 667k • • 2.71k

upvoted a paper 23 days ago

Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation

Paper • 2503.22675 • Published 26 days ago • 34

upvoted a paper 26 days ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published 28 days ago • 140

upvoted a paper 30 days ago

When Less is Enough: Adaptive Token Reduction for Efficient Image Representation

Paper • 2503.16660 • Published Mar 20 • 73

upvoted a paper about 1 month ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18 • 142

liked a model about 1 month ago

agents-course/notebooks

Updated 7 days ago • 343