Sharath Turuvekere Sreenivas's picture

1 3

Sharath Turuvekere Sreenivas

sharathtsnv

·

AI & ML interests

Learning algorithms, LLM efficiency: Knowledege distillation and compression.

Recent Activity

upvoted a collection about 1 month ago

authored a paper about 2 months ago

LLM Pruning and Distillation in Practice: The Minitron Approach

authored a paper about 2 months ago

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

View all activity

Organizations

sharathtsnv's activity

upvoted a collection about 1 month ago

Nemotron-H

Mamba-Transformer hybrid models • 6 items • Updated 8 days ago • 22

authored 2 papers about 2 months ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 59

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Paper • 2504.03624 • Published Apr 4 • 13

New activity in nvidia/Llama-3.1-Minitron-4B-Width-Base 8 months ago

Teacher correction training hyperparameters

#13 opened 8 months ago by

upvoted a paper 9 months ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 59

authored a paper 10 months ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19, 2024 • 40

upvoted a paper 10 months ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19, 2024 • 40