Al-Hussein

AlHussein

AI & ML interests

Knowledge Distillation, Self-Supervised Learning, Semi-Supervised Learning

Recent Activity

upvoted a paper about 12 hours ago

Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model

upvoted a paper 9 days ago

Emerging Properties in Unified Multimodal Pretraining

upvoted a paper 21 days ago

Describe Anything: Detailed Localized Image and Video Captioning

View all activity

Organizations

None yet

AlHussein's activity

upvoted a paper about 12 hours ago

Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model

Paper • 2505.17894 • Published 9 days ago • 209

upvoted a paper 9 days ago

Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published 12 days ago • 124

upvoted a paper 21 days ago

Describe Anything: Detailed Localized Image and Video Captioning

Paper • 2504.16072 • Published Apr 22 • 60

upvoted 3 papers about 1 month ago

upvoted 3 papers 2 months ago

Scaling Vision Pre-Training to 4K Resolution

Paper • 2503.19903 • Published Mar 25 • 42

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 154

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24 • 118

upvoted 2 papers 5 months ago

EXAONE 3.5: Series of Large Language Models for Real-world Use Cases

Paper • 2412.04862 • Published Dec 6, 2024 • 51

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published Dec 6, 2024 • 48

upvoted 5 papers 6 months ago

Video Depth without Video Models

Paper • 2411.19189 • Published Nov 28, 2024 • 39

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 118

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 71

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published Oct 28, 2024 • 83

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Paper • 2411.10442 • Published Nov 15, 2024 • 80

liked a model 7 months ago

timm/resnet50.a1_in1k

Image Classification • Updated Jan 21 • 17.5M • 38

upvoted 3 papers 8 months ago

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17, 2024 • 75

Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations

Paper • 2410.02762 • Published Oct 3, 2024 • 9

Kolmogorov-Arnold Transformer

Paper • 2409.10594 • Published Sep 16, 2024 • 46