Felix Tuma's picture

58 54

Felix Tuma

floom

·

AI & ML interests

NLP

Recent Activity

updated a collection about 16 hours ago

upvoted a paper 7 days ago

Do PhD-level LLMs Truly Grasp Elementary Addition? Probing Rule Learning vs. Memorization in Large Language Models

upvoted a paper 7 days ago

Reasoning Models Can Be Effective Without Thinking

View all activity

Organizations

None yet

floom's activity

upvoted 4 papers 7 days ago

Do PhD-level LLMs Truly Grasp Elementary Addition? Probing Rule Learning vs. Memorization in Large Language Models

Paper • 2504.05262 • Published 16 days ago • 11

Reasoning Models Can Be Effective Without Thinking

Paper • 2504.09858 • Published 10 days ago • 10

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Paper • 2504.10481 • Published 9 days ago • 83

Heimdall: test-time scaling on the generative verification

Paper • 2504.10337 • Published 9 days ago • 32

upvoted a paper 17 days ago

Agentic Knowledgeable Self-awareness

Paper • 2504.03553 • Published 19 days ago • 28

upvoted a paper about 1 month ago

Temporal Consistency for LLM Reasoning Process Error Identification

Paper • 2503.14495 • Published Mar 18 • 9

upvoted a paper about 2 months ago

NeoBERT: A Next-Generation BERT

Paper • 2502.19587 • Published Feb 26 • 39

upvoted 6 papers 2 months ago

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published Feb 5 • 61

APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding

Paper • 2502.05431 • Published Feb 8 • 6

LM2: Large Memory Models

Paper • 2502.06049 • Published Feb 9 • 30

ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates

Paper • 2502.06772 • Published Feb 10 • 21

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 151

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published Feb 10 • 61

upvoted 5 papers 3 months ago

Scaling Flaws of Verifier-Guided Search in Mathematical Reasoning

Paper • 2502.00271 • Published Feb 1 • 1

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 385

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published Dec 27, 2024 • 58

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published Jan 24 • 56

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 286

upvoted 2 papers 5 months ago

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published Nov 21, 2024 • 31

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Paper • 2411.14405 • Published Nov 21, 2024 • 62