XiangdaLi
commited on
Commit
·
713a5e4
1
Parent(s):
9c3c977
Upload Qwen2.5-14B-Instruct-1M GGUF model
Browse files- .gitattributes +1 -0
- Qwen2.5-14B-Instruct-1M.gguf +3 -0
- README.md +4 -0
.gitattributes
CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
*.gguf filter=lfs diff=lfs merge=lfs -text
|
Qwen2.5-14B-Instruct-1M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:00517c6fabea6f9f19bdf59af62f3884127c80f7e03bb78b5e96bf060d8e6eb4
|
3 |
+
size 29547716384
|
README.md
ADDED
@@ -0,0 +1,4 @@
|
|
|
|
|
|
|
|
|
|
|
1 |
+
# Qwen2.5-14B-Instruct-1M-GGUF
|
2 |
+
This is a quantized GGUF version of Qwen2.5-14B-Instruct-1M.
|
3 |
+
Converted from Safetensors using mixed precision quantization.
|
4 |
+
Optimized for efficient inference using llama.cpp or text-generation-webui.
|