Model is loaded with use_cache=False by default
#4 opened about 11 hours ago
by
shawnghu
Can this model be run with vllm ?
1
#3 opened 2 months ago
by
just1nseo
Are you considering performing distillation experiments on the qwen70b model?
1
#2 opened 2 months ago
by
lambda1989
Huge thanks and congrats!
2
#1 opened 2 months ago
by
owao