Mahmud ElHuseyni's picture

60 79

Mahmud ElHuseyni

MElHuseyni

·

AI & ML interests

Computer Vision NLP Machine Learning

Recent Activity

liked a model 3 days ago

PleIAs/Pleias-RAG-1B

upvoted a collection 3 days ago

upvoted a collection 4 days ago

Describe Anything

View all activity

Organizations

MElHuseyni's activity

liked a model 3 days ago

PleIAs/Pleias-RAG-1B

Updated 3 days ago • 94 • 27

upvoted a collection 3 days ago

Pleias-RAG

New generation of small reasoning models for RAG, search, and source summarization. • 4 items • Updated 4 days ago • 22

upvoted a collection 4 days ago

Describe Anything

Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated 3 days ago • 40

upvoted 3 papers 4 days ago

Kuwain 1.5B: An Arabic SLM via Language Injection

Paper • 2504.15120 • Published 7 days ago • 111

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published 5 days ago • 90

Describe Anything: Detailed Localized Image and Video Captioning

Paper • 2504.16072 • Published 5 days ago • 53

liked a model 7 days ago

russwang/ThinkLite-VL-7B

Updated 10 days ago • 273 • 11

liked 2 models 8 days ago

nvidia/MambaVision-L3-512-21K

Image Classification • Updated 29 days ago • 7.85k • 49

microsoft/beit-large-patch16-512

Image Classification • Updated Jan 28, 2022 • 691 • 11

upvoted 2 papers 13 days ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published 21 days ago • 125

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 13 days ago • 245

liked 2 models 14 days ago

agentica-org/DeepCoder-14B-Preview

Text Generation • Updated 18 days ago • 44.7k • 613

pandaphd/generative_photography

Text-to-Video • Updated Mar 4 • 7

upvoted a paper 15 days ago

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published 26 days ago • 82

upvoted 2 collections 17 days ago

InternVL3

34 items • Updated 8 days ago • 56

RADIO

A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). • 13 items • Updated 3 days ago • 17

upvoted a collection 18 days ago

Orpheus Multilingual Research Release

Beta Release of multilingual models. • 12 items • Updated 17 days ago • 76

upvoted a paper 19 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 20 days ago • 176

upvoted a collection 20 days ago

SmolVLM2 📺 Smallest video LM ever 🤏🏻

11 items • Updated 3 days ago • 82

updated a collection 21 days ago

Speech Models

27 items • Updated 21 days ago