Drishti Sharma's picture

Drishti Sharma

DrishtiSharma

·

https://scholar.google.com/citations?hl=en&user=9-GkrdkAAAAJ

AI & ML interests

None yet

Recent Activity

liked a Space 3 days ago

ling99/OCRBench-v2-leaderboard

liked a dataset 3 days ago

huyhuy123/ViOCRVQA

liked a dataset 3 days ago

VLLMs/MIRB

View all activity

Organizations

DrishtiSharma's activity

upvoted a collection 20 days ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 21 items • Updated 8 days ago • 129

upvoted a collection 22 days ago

NCERT_Dataset

The NCERT dataset is a collection of educational content derived from NCERT textbooks for students in standards 6 to 12. • 35 items • Updated Feb 27 • 6

upvoted 2 papers about 1 month ago

LLM as a Broken Telephone: Iterative Generation Distorts Information

Paper • 2502.20258 • Published Feb 27 • 26

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 99

upvoted a paper about 2 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 110

upvoted an article 2 months ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

Feb 19

• 69

upvoted 13 papers 2 months ago

Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published Feb 18 • 85

IHEval: Evaluating Language Models on Following the Instruction Hierarchy

Paper • 2502.08745 • Published Feb 12 • 19

ReLearn: Unlearning via Learning for Large Language Models

Paper • 2502.11190 • Published Feb 16 • 29

How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training

Paper • 2502.11196 • Published Feb 16 • 22

Logical Reasoning in Large Language Models: A Survey

Paper • 2502.09100 • Published Feb 13 • 23

An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging

Paper • 2502.09056 • Published Feb 13 • 32

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published Feb 13 • 36

Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation

Paper • 2502.08690 • Published Feb 12 • 44

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published Feb 13 • 149

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Paper • 2502.06394 • Published Feb 10 • 90

Expect the Unexpected: FailSafe Long Context QA for Finance

Paper • 2502.06329 • Published Feb 10 • 131

BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models

Paper • 2502.07346 • Published Feb 11 • 53

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 151