Jade's picture

Jade

euclaise

·

AI & ML interests

None yet

Recent Activity

liked a dataset 4 days ago

Gryphe/Opus-WritingPrompts

liked a dataset 4 days ago

GeneralReasoning/GeneralThought-430K

liked a dataset 4 days ago

davanstrien/fine-reasoning-questions

View all activity

Organizations

euclaise's activity

liked 3 datasets 4 days ago

Gryphe/Opus-WritingPrompts

Viewer • Updated Jan 9 • 6.02k • 168 • 62

GeneralReasoning/GeneralThought-430K

Viewer • Updated Mar 14 • 431k • 12.8k • 31

davanstrien/fine-reasoning-questions

Viewer • Updated 8 days ago • 244 • 313 • 16

upvoted 4 papers 5 days ago

Reasoning Models Can Be Effective Without Thinking

Paper • 2504.09858 • Published 10 days ago • 10

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

Paper • 2504.11343 • Published 8 days ago • 14

DataDecide: How to Predict Best Pretraining Data with Small Experiments

Paper • 2504.11393 • Published 8 days ago • 15

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Paper • 2504.10481 • Published 9 days ago • 83

liked a dataset 12 days ago

trl-lib/tldr-preference

Viewer • Updated Jan 8 • 179k • 267 • 2

upvoted a paper 15 days ago

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Paper • 2504.05118 • Published 16 days ago • 25

upvoted a paper 16 days ago

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

Paper • 2504.00891 • Published 22 days ago • 12

liked 4 models 21 days ago

zed-industries/zeta

Updated Feb 27 • 4.86k • 268

featherless-ai/Qwerky-72B

Text Generation • Updated 28 days ago • 1.58k • 50

LGAI-EXAONE/EXAONE-Deep-32B

Text Generation • Updated Mar 19 • 72.6k • 288

yandex/YandexGPT-5-Lite-8B-instruct

Updated 23 days ago • 7.25k • 63

upvoted 2 papers 21 days ago

Multi-Token Attention

Paper • 2504.00927 • Published 22 days ago • 45

JudgeLRM: Large Reasoning Models as a Judge

Paper • 2504.00050 • Published 24 days ago • 60

upvoted a paper 22 days ago

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published 23 days ago • 62

upvoted a paper 23 days ago

General Reasoning Requires Learning to Reason from the Get-go

Paper • 2502.19402 • Published Feb 26 • 5

upvoted 2 papers 29 days ago

SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs

Paper • 2410.13276 • Published Oct 17, 2024 • 30

Modifying Large Language Model Post-Training for Diverse Creative Writing

Paper • 2503.17126 • Published Mar 21 • 36