BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs Paper • 2505.19457 • Published 13 days ago • 61
MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning Paper • 2411.03314 • Published Nov 5, 2024 • 1
MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning Paper • 2502.19634 • Published Feb 26 • 63