Consider adding prebuild onnx for cpu

by 234r89r23u89023rui90 - opened Mar 2

Mar 2

Hi,
please consider uploading a prebuild onnx model for cpu inference as well.
I tried to follow the linked tutorial [1], but it requires huge amounts of ram that I do not have access to.
Thank you.

[1] https://github.com/microsoft/onnxruntime-genai/blob/main/examples/python/phi-4-multi-modal.md

kvaishnavi

Microsoft org Mar 3

We have a pre-built ONNX model for CPU that is already ready to upload. We are waiting for some internal requirements to complete before we can publish it.

234r89r23u89023rui90 changed discussion status to closed Mar 3

klnstpr

Mar 11

Any news on that?
Thanks.

234r89r23u89023rui90 changed discussion status to open 19 days ago

234r89r23u89023rui90

19 days ago

@kvaishnavi Do you have any update on this? Considering the speed the field is moving forward with, I fear the CPU ONNX might end up DOA if the publishing process takes months. Thank you.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment