THUDM
/

GLM-4-32B-0414

Text Generation

Model card Files Files and versions Community

Resources

View closed (0)

The model is the best for coding.

#7 opened about 22 hours ago by

When running with a single GPU, I get an error saying the VRAM is insufficient. However, when using multiple GPUs on a single machine, there are many errors. My vllm version is 0.8.4.

#6 opened 1 day ago by

BitsAndBytes quantization inference error

#5 opened 1 day ago by

Some bug when using function call with vllm==0.8.4

#4 opened 2 days ago by

SimpleQA Scores Are WAY off

#3 opened 4 days ago by

Need fp8 version for inerface

#2 opened 4 days ago by

RuntimeError: CUDA error: device-side assert triggered

#1 opened 4 days ago by