siyeng feng

siyengfeng

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

a-m-team/AM-Thinking-v1

upvoted a paper 6 days ago

Overflow Prevention Enhances Long-Context Recurrent LLMs

upvoted a paper 6 days ago

DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation

View all activity

Organizations

None yet

siyengfeng's activity

upvoted 20 papers 6 days ago

Overflow Prevention Enhances Long-Context Recurrent LLMs

Paper • 2505.07793 • Published 7 days ago • 3

DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation

Paper • 2505.07233 • Published 7 days ago • 7

Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent

Paper • 2505.07596 • Published 7 days ago • 10

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published 7 days ago • 74

Learning from Peers in Reasoning Models

Paper • 2505.07787 • Published 7 days ago • 41

WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch

Paper • 2505.03733 • Published 13 days ago • 16

Learning Dynamics in Continual Pre-Training for Large Language Models

Paper • 2505.07796 • Published 7 days ago • 18

Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning

Paper • 2505.07263 • Published 7 days ago • 29

DanceGRPO: Unleashing GRPO on Visual Generation

Paper • 2505.07818 • Published 7 days ago • 27

AttentionInfluence: Adopting Attention Head Influence for Weak-to-Strong Pretraining Data Selection

Paper • 2505.07293 • Published 7 days ago • 24

REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback

Paper • 2505.06548 • Published 9 days ago • 27

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Paper • 2505.04588 • Published 12 days ago • 60

Flow-GRPO: Training Flow Matching Models via Online RL

Paper • 2505.05470 • Published 11 days ago • 69

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Paper • 2505.02567 • Published 14 days ago • 70

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published 14 days ago • 66

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published 8 days ago • 132

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Paper • 2504.20752 • Published 20 days ago • 88