1 54 64

wei

fengwei

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

microsoft/bitnet-b1.58-2B-4T

upvoted a paper 4 days ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

upvoted a paper 13 days ago

Kimi-VL Technical Report

View all activity

Organizations

None yet

fengwei's activity

upvoted a paper 4 days ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published 6 days ago • 86

upvoted a paper 13 days ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published 14 days ago • 120

upvoted a collection 13 days ago

Kimi-VL-A3B

Collection

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated 11 days ago • 61

upvoted 2 papers 18 days ago

ZClip: Adaptive Spike Mitigation for LLM Pre-Training

Paper • 2504.02507 • Published 20 days ago • 76

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published 23 days ago • 256

upvoted a paper 21 days ago

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

Paper • 2504.00999 • Published 22 days ago • 83

upvoted a paper 23 days ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published 29 days ago • 140

upvoted an article 28 days ago

Article

Guiding Text Generation with Constrained Beam Search in 🤗 Transformers

Mar 11, 2022

• 11

upvoted 5 papers 28 days ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published about 1 month ago • 117

upvoted a paper about 1 month ago

EuroBERT: Scaling Multilingual Encoders for European Languages

Paper • 2503.05500 • Published Mar 7 • 78

upvoted a paper about 2 months ago

NeoBERT: A Next-Generation BERT

Paper • 2502.19587 • Published Feb 26 • 39

upvoted 5 papers 2 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 103

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 143

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20 • 191

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 182

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 226