ggml_llava-v1.5-7b

This repo contains GGUF files to inference llava-v1.5-7b with llama.cpp end-to-end without any extra dependency.

Note: The mmproj-model-f16.gguf file structure is experimental and may change. Always use the latest code in llama.cpp.

Downloads last month
5,424
GGUF
Model size
6.74B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

4-bit

5-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Spaces using mys/ggml_llava-v1.5-7b 3