LagPixelLOL
v2ray
AI & ML interests
Looking for compute sponsors, please contact me through my email [email protected]!
Recent Activity
new activity
2 days ago
cognitivecomputations/DeepSeek-V3-0324-AWQ:Stuck when run on 8xH100
updated
a model
10 days ago
x2ray/wheels
new activity
10 days ago
cognitivecomputations/DeepSeek-V3-0324-AWQ:why tokenizer_config.json changed for AWQ model.
Organizations
v2ray's activity
Stuck when run on 8xH100
1
#8 opened 2 days ago
by
Thai
why tokenizer_config.json changed for AWQ model.
2
#7 opened 10 days ago
by
rockcat-miao
Does FlashMLA support kv cache fp8 dtype and how to enable FlashMLA ?
9
#6 opened 19 days ago
by
CharlesLincoln
How to Resolve "GLIBC_2.32 Not Found" Error When Deploying vLLM Environment?
8
#32 opened 27 days ago
by
lastsummerLi
Can the 4090 device run this model?
3
#3 opened 27 days ago
by
jinzhongwei
vllm crach with a slightly longer prompt
1
#4 opened 27 days ago
by
rockcat-miao
可以添加一下LICENSE文件吗?
2
#2 opened 28 days ago
by
adol-ch
Are there any updates to the recommended commands?
5
#27 opened about 1 month ago
by
NaiveYan
Why hasn't the MTP layer of the 61st layer been quantized?
1
#30 opened 29 days ago
by
yang001002
Is there any testing on the support for running on other memory capacities
1
#29 opened 29 days ago
by
HRan2004
Any one can run this model with SGlang framework?
5
#13 opened 2 months ago
by
muziyongshixin
DeepSeek-R1-AWQ quantized model missing one layer of experts
4
#28 opened about 1 month ago
by
virilo
About the group size
1
#26 opened about 1 month ago
by
Skyeaee

The awq quantization model may encounter garbled characters when performing inference on long texts.
9
#24 opened about 2 months ago
by
wx111
How can I quantify my BF16 format model into AWQ?
1
#25 opened about 2 months ago
by
AlipaySimon

Support for inference with MTP module?
1
#23 opened about 2 months ago
by
yhh001
poor performance for DeepSeek-V3-AWQ
2
#9 opened about 2 months ago
by
fridayl
The V3-AWQ model's response seems not as expected
12
#8 opened about 2 months ago
by
juxing
Can't get 48 TPS on 8x H800
1
#21 opened about 2 months ago
by
Light4Bear
