torch compile compatibility issue

#23

by jhmun - opened 17 days ago

17 days ago

trying to do inference with vllm, getting this error:
[config.py:3785] torch.compile is turned on, but the model meta-llama/Llama-4-Maverick-17B-128E-Instruct does not support it.

Anyone to address this issue?

luccafong

16 days ago

@jhmun could you try with VLLM_DISABLE_COMPILE_CACHE=1

luccafong

16 days ago

@jhmun also could you share your full cmd for repro?

jhmun

16 days ago

Below is the cmd that I tried.

VLLM_DISABLE_TORCH_COMPILE=1 vllm serve {Llama-4-Maverick-17B-128E-Instruct} --tensor-parallel-size 8 --max-model-len 12000 --disable_custom_all_reduce --limit_mm_per_prompt="image=10" --enforce_eager

15 days ago

I am getting same issue even with VLLM_DISABLE_TORCH_COMPILE=1. The Llama4 Scout is working well with similar cmd. It's weird!
Any workaround? Thanks!

richardzou5

Meta Llama org 6 days ago

The warning is erroneous, if the cmd fails for you there is likely something else that is wrong. Can you scroll up and see what else the logs are saying?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment