Sumit Yadav's picture

2 7 32

Sumit Yadav

rockerritesh

·

https://sumityadav.com.np

AI & ML interests

AI(GAN) || LLM RAG

Recent Activity

updated a dataset 5 days ago

rockerritesh/devanagari_and_roman_digits

published a dataset 5 days ago

rockerritesh/devanagari_and_roman_digits

updated a Space 11 days ago

aioverlords-amnil/internal-ollama

View all activity

Organizations

rockerritesh's activity

upvoted a collection 19 days ago

Cogito v1 Preview

5 items • Updated 20 days ago • 105

upvoted a paper 19 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 20 days ago • 176

upvoted a collection 26 days ago

Vision Language Models Quantization

Vision Language Models (VLMs) quantized by Neural Magic • 20 items • Updated Mar 4 • 6

upvoted 2 collections about 1 month ago

MambaVision

MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. • 13 items • Updated 4 days ago • 31

MoshiVis v0.1

MoshiVis is a Vision Speech Model built as a perceptually-augmented version of Moshi v0.1 for conversing about image inputs • 8 items • Updated Mar 21 • 22

upvoted an article about 1 month ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

• 401

upvoted an article 2 months ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

Feb 20

• 238