Phi-4 model loads successfully on text-generation-webui, but Phi-4-mini-instruct does not

#21

by harisnaeem - opened Mar 14

Mar 14

I'm using the text-generation-webui latest release on my computer with CPU mode and it is able to successfully load both '4-bit' and '2-bit' GGUF of the Phi-4 model without any problems.

however it does not seem to load any GGUF of Phi-4-mini-instruct

Can someone who uses the text-generation-webui tell me what the problem with phi-4-mini-instruct might be?

sphynx-henryay

20 days ago

•

edited 20 days ago

it uses gpt-4o as pre tokenizer, where the included llama.cpp version didnt support. theres no update for the upstream repo https://github.com/jllllll/llama-cpp-python-cuBLAS-wheels.
can solve by updating llama cpp according to your environment.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment