XiangdaLi commited on
Commit
713a5e4
·
1 Parent(s): 9c3c977

Upload Qwen2.5-14B-Instruct-1M GGUF model

Browse files
Files changed (3) hide show
  1. .gitattributes +1 -0
  2. Qwen2.5-14B-Instruct-1M.gguf +3 -0
  3. README.md +4 -0
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ *.gguf filter=lfs diff=lfs merge=lfs -text
Qwen2.5-14B-Instruct-1M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:00517c6fabea6f9f19bdf59af62f3884127c80f7e03bb78b5e96bf060d8e6eb4
3
+ size 29547716384
README.md ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ # Qwen2.5-14B-Instruct-1M-GGUF
2
+ This is a quantized GGUF version of Qwen2.5-14B-Instruct-1M.
3
+ Converted from Safetensors using mixed precision quantization.
4
+ Optimized for efficient inference using llama.cpp or text-generation-webui.