Yassine Ennaour's picture

Yassine Ennaour

Lyte

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

facebook/Perception-LM-1B

liked a dataset 3 days ago

amazon-agi/SIFT-50M

upvoted a paper 3 days ago

WORLDMEM: Long-term Consistent World Simulation with Memory

View all activity

Organizations

Lyte's activity

upvoted a paper 3 days ago

WORLDMEM: Long-term Consistent World Simulation with Memory

Paper • 2504.12369 • Published 4 days ago • 28

upvoted a collection 3 days ago

DataDecide

A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. • 358 items • Updated 4 days ago • 11

upvoted a paper 11 days ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published 12 days ago • 101

upvoted a paper 14 days ago

TransMamba: Flexibly Switching between Transformer and Mamba

Paper • 2503.24067 • Published 21 days ago • 17

upvoted a paper 19 days ago

Multi-Token Attention

Paper • 2504.00927 • Published 19 days ago • 44

upvoted a collection 25 days ago

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 3 items • Updated 25 days ago • 89

upvoted 2 collections about 2 months ago

Tiny Models

7 items • Updated Sep 13, 2024 • 1

Gemstone Models

Our 22 open source Gemstone models for scaling laws range from 50M to 2B parameters, spanning 11 widths from 256 to 3072 and 18 depths from 3 to 80. • 59 items • Updated Feb 26 • 8

upvoted an article about 2 months ago

Article

FastRTC: The Real-Time Communication Library for Python

Feb 25

• 158

upvoted a collection about 2 months ago

Ovis2

Our latest advancement in multi-modal large language models (MLLMs) • 15 items • Updated 27 days ago • 59

upvoted a paper about 2 months ago

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Paper • 2502.14768 • Published Feb 20 • 48

upvoted a collection about 2 months ago

SmolVLM2 📺 Smallest video LM ever 🤏🏻

11 items • Updated Feb 25 • 82

upvoted a paper 3 months ago

Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published Jan 31 • 22

upvoted an article 3 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 844

upvoted a collection 3 months ago

YuE

YuE: Open Full-song Generation Foundation Model • 11 items • Updated Mar 18 • 23

upvoted an article 3 months ago

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 475

upvoted a collection 3 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated 21 days ago • 446

upvoted a paper 3 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 383