BookWorld: From Novels to Interactive Agent Societies for Creative Story Generation Paper • 2504.14538 • Published Apr 20 • 29
FlashThink: An Early Exit Method For Efficient Reasoning Paper • 2505.13949 • Published 11 days ago • 1
FlashThink: An Early Exit Method For Efficient Reasoning Paper • 2505.13949 • Published 11 days ago • 1
RLAP: A Reinforcement Learning Enhanced Adaptive Planning Framework for Multi-step NLP Task Solving Paper • 2505.11893 • Published 14 days ago • 1
RLAP: A Reinforcement Learning Enhanced Adaptive Planning Framework for Multi-step NLP Task Solving Paper • 2505.11893 • Published 14 days ago • 1
Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression Paper • 2503.02812 • Published Mar 4 • 10
AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction Paper • 2409.01854 • Published Sep 3, 2024 • 1
AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction Paper • 2409.01854 • Published Sep 3, 2024 • 1
Q-Filters Collection Pre-computed Q-Filters for efficient KV cache compression. • 15 items • Updated Mar 3 • 7
FlowKV: A Disaggregated Inference Framework with Low-Latency KV Cache Transfer and Load-Aware Scheduling Paper • 2504.03775 • Published Apr 3
Reason from Fallacy: Enhancing Large Language Models' Logical Reasoning through Logical Fallacy Understanding Paper • 2404.04293 • Published Apr 4, 2024 • 1
P-ICL: Point In-Context Learning for Named Entity Recognition with Large Language Models Paper • 2405.04960 • Published May 8, 2024 • 1
SED: Self-Evaluation Decoding Enhances Large Language Models for Better Generation Paper • 2405.16552 • Published May 26, 2024 • 1
Adaptive Reinforcement Learning Planning: Harnessing Large Language Models for Complex Information Extraction Paper • 2406.11455 • Published Jun 17, 2024 • 1