wang's picture

94 2

wang

wangxbx

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 23 hours ago

Kuwain 1.5B: An Arabic SLM via Language Injection

upvoted a paper 6 days ago

70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float

upvoted a paper 7 days ago

AlayaDB: The Data Foundation for Efficient and Effective Long-context LLM Inference

View all activity

Organizations

None yet

wangxbx's activity

upvoted a paper about 23 hours ago

Kuwain 1.5B: An Arabic SLM via Language Injection

Paper • 2504.15120 • Published 4 days ago • 106

upvoted a paper 6 days ago

70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float

Paper • 2504.11651 • Published 10 days ago • 15

upvoted 2 papers 7 days ago

AlayaDB: The Data Foundation for Efficient and Effective Long-context LLM Inference

Paper • 2504.10326 • Published 11 days ago • 25

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published 9 days ago • 67

upvoted 2 papers 10 days ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published 18 days ago • 123

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 11 days ago • 239

upvoted 9 papers 14 days ago

Open Deep Search: Democratizing Search with Open-source Reasoning Agents

Paper • 2503.20201 • Published about 1 month ago • 46

Gemma 3 Technical Report

Paper • 2503.19786 • Published about 1 month ago • 48

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published 23 days ago • 82

Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models

Paper • 2504.04823 • Published 18 days ago • 30

SkyReels-A2: Compose Anything in Video Diffusion Transformers

Paper • 2504.02436 • Published 22 days ago • 35

Multi-Token Attention

Paper • 2504.00927 • Published 24 days ago • 46

Kimi-VL Technical Report

Paper • 2504.07491 • Published 15 days ago • 121

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

Paper • 2504.05599 • Published 17 days ago • 81

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published 17 days ago • 104

upvoted 5 papers about 1 month ago

A Comprehensive Survey on Long Context Language Modeling

Paper • 2503.17407 • Published Mar 20 • 49

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20 • 48

Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

Paper • 2503.15558 • Published Mar 18 • 46

Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models

Paper • 2503.16257 • Published Mar 20 • 24

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

Paper • 2503.16419 • Published Mar 20 • 71