bartowski/nvidia_Llama-3.1-8B-UltraLong-4M-Instruct-GGUF Text Generation • Updated 10 days ago • 5.64k • 21
Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models Paper • 2504.07951 • Published 14 days ago • 27
lmstudio-community/Llama-4-Scout-17B-16E-Instruct-GGUF Text Generation • Updated 17 days ago • 26.1k • 27
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 Image-Text-to-Text • Updated 15 days ago • 52.1k • • 105
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency Paper • 2503.20785 • Published 29 days ago • 21
Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals Paper • 2503.19953 • Published 30 days ago • 3
FirePlace: Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement Paper • 2503.04919 • Published Mar 6 • 8
Open Deep Search: Democratizing Search with Open-source Reasoning Agents Paper • 2503.20201 • Published 29 days ago • 46
AMD-Hummingbird: Towards an Efficient Text-to-Video Model Paper • 2503.18559 • Published about 1 month ago • 5