Felix Tuma

floom

AI & ML interests

NLP

Recent Activity

updated a collection 2 days ago

ShowAndTell

upvoted a paper 9 days ago

Do PhD-level LLMs Truly Grasp Elementary Addition? Probing Rule Learning vs. Memorization in Large Language Models

upvoted a paper 9 days ago

Reasoning Models Can Be Effective Without Thinking

View all activity

Organizations

None yet

floom's activity

updated a collection 2 days ago

ShowAndTell

Collection

36 items • Updated 2 days ago

upvoted 2 papers 9 days ago

Do PhD-level LLMs Truly Grasp Elementary Addition? Probing Rule Learning vs. Memorization in Large Language Models

Paper • 2504.05262 • Published 18 days ago • 11

Reasoning Models Can Be Effective Without Thinking

Paper • 2504.09858 • Published 12 days ago • 10

updated a collection 9 days ago

ShowAndTell

Collection

36 items • Updated 2 days ago

upvoted a paper 9 days ago

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Paper • 2504.10481 • Published 11 days ago • 84

updated 2 collections 9 days ago

RL

Collection

25 items • Updated 9 days ago • 2

ShowAndTell

Collection

36 items • Updated 2 days ago

upvoted a paper 9 days ago

Heimdall: test-time scaling on the generative verification

Paper • 2504.10337 • Published 11 days ago • 32

updated a collection 19 days ago

ShowAndTell

Collection

36 items • Updated 2 days ago

upvoted a paper 19 days ago

Agentic Knowledgeable Self-awareness

Paper • 2504.03553 • Published 21 days ago • 28

updated a collection 22 days ago

ShowAndTell

Collection

36 items • Updated 2 days ago

updated a collection 23 days ago

ShowAndTell

Collection

36 items • Updated 2 days ago

updated a collection about 1 month ago

ShowAndTell

Collection

36 items • Updated 2 days ago

upvoted a paper about 1 month ago

Temporal Consistency for LLM Reasoning Process Error Identification

Paper • 2503.14495 • Published Mar 18 • 9

liked a Space about 1 month ago

124

smolagents LLM leaderboard

🏆

A leaderboard for LLMs powering smolagents

updated a collection about 1 month ago

ShowAndTell

Collection

36 items • Updated 2 days ago