Roee Aharoni's picture

1 3

Roee Aharoni

roeeaharoni

·

http://www.roeeaharoni.com

AI & ML interests

Natural Language Processing

Recent Activity

upvoted a paper 27 days ago

Inside-Out: Hidden Factual Knowledge in LLMs

liked a dataset 9 months ago

google/granola-entity-questions

reacted to gsarti's post with 🤗 about 1 year ago

🔍 Today's pick in Interpretability & Analysis of LMs: A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains by @alonjacovi @yonatanbitton B. Bohnet J. Herzig @orhonovic M. Tseng M. Collins @roeeaharoni @mega This work introduces a new methodology for human verification of reasoning chains and adopts it to annotate a dataset of chain-of-thought reasoning chains produced by 3 LMs. The annotated dataset, REVEAL, can be used to benchmark automatic verifiers of reasoning in LMs. In their analysis, the authors find that LM-produced CoTs generally contain faulty steps, often leading to incorrect automatic verification. In particular, CoT-generating LMs are found to produce non-attributable reasoning steps often, and reasoning verifiers generally struggle to verify logical correctness. 📄 Paper: https://huggingface.co/papers/2402.00559 🔡 Dataset: https://huggingface.co/datasets/google/reveal

View all activity

Organizations

roeeaharoni's activity

liked a dataset 9 months ago

google/granola-entity-questions

Viewer • Updated Aug 1, 2024 • 12.5k • 111 • 8

liked a model over 1 year ago

google/t5_11b_trueteacher_and_anli

Text2Text Generation • Updated Dec 26, 2023 • 441 • 16

liked a model about 2 years ago

google/t5_xxl_true_nli_mixture

Text2Text Generation • Updated Mar 23, 2023 • 2.87k • 46