microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated 18 days ago • 607k • 1.33k
Running on CPU Upgrade 396 396 GAIA Leaderboard 🦾 Submit models for evaluation and view leaderboard results
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use Paper • 2501.02506 • Published Jan 5 • 11 • 3
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use Paper • 2501.02506 • Published Jan 5 • 11