LagPixelLOL's picture

91

LagPixelLOL

v2ray

·

LagPixelLOL

AI & ML interests

Looking for compute sponsors, please contact me through my email [email protected]!

Recent Activity

new activity 2 days ago

cognitivecomputations/DeepSeek-V3-0324-AWQ:Stuck when run on 8xH100

updated a model 10 days ago

x2ray/wheels

new activity 10 days ago

cognitivecomputations/DeepSeek-V3-0324-AWQ:why tokenizer_config.json changed for AWQ model.

View all activity

Organizations

v2ray's activity

New activity in cognitivecomputations/DeepSeek-V3-0324-AWQ 2 days ago

Stuck when run on 8xH100

#8 opened 2 days ago by

New activity in cognitivecomputations/DeepSeek-V3-0324-AWQ 10 days ago

why tokenizer_config.json changed for AWQ model.

#7 opened 10 days ago by

New activity in cognitivecomputations/DeepSeek-V3-0324-AWQ 11 days ago

Does FlashMLA support kv cache fp8 dtype and how to enable FlashMLA ?

#6 opened 19 days ago by

New activity in cognitivecomputations/DeepSeek-R1-AWQ 12 days ago

AMD Instinct MI210 + vllm fail to run this model, any solutions please? Is there any other deepseek-r1-671b models that can run succesfully on AMD Instinct MI210 + vllm? Thanks!

#33 opened 13 days ago by

New activity in cognitivecomputations/DeepSeek-R1-AWQ 22 days ago

How to Resolve "GLIBC_2.32 Not Found" Error When Deploying vLLM Environment?

#32 opened 27 days ago by

New activity in cognitivecomputations/DeepSeek-V3-0324-AWQ 27 days ago

Can the 4090 device run this model?

#3 opened 27 days ago by

vllm crach with a slightly longer prompt

#4 opened 27 days ago by

New activity in cognitivecomputations/DeepSeek-V3-0324-AWQ 28 days ago

可以添加一下LICENSE文件吗？

#2 opened 28 days ago by

New activity in cognitivecomputations/DeepSeek-R1-AWQ 28 days ago

Are there any updates to the recommended commands?

#27 opened about 1 month ago by

New activity in cognitivecomputations/DeepSeek-R1-AWQ 29 days ago

Why hasn't the MTP layer of the 61st layer been quantized?

#30 opened 29 days ago by

Is there any testing on the support for running on other memory capacities

#29 opened 29 days ago by

New activity in cognitivecomputations/DeepSeek-R1-AWQ about 1 month ago

Any one can run this model with SGlang framework？

#13 opened 2 months ago by

DeepSeek-R1-AWQ quantized model missing one layer of experts

#28 opened about 1 month ago by

About the group size

#26 opened about 1 month ago by

New activity in cognitivecomputations/DeepSeek-R1-AWQ about 2 months ago

The awq quantization model may encounter garbled characters when performing inference on long texts.

#24 opened about 2 months ago by

How can I quantify my BF16 format model into AWQ?

#25 opened about 2 months ago by

Support for inference with MTP module?

#23 opened about 2 months ago by

New activity in cognitivecomputations/DeepSeek-V3-AWQ about 2 months ago

poor performance for DeepSeek-V3-AWQ

#9 opened about 2 months ago by

The V3-AWQ model's response seems not as expected

#8 opened about 2 months ago by

New activity in cognitivecomputations/DeepSeek-R1-AWQ about 2 months ago

Can't get 48 TPS on 8x H800

#21 opened about 2 months ago by