Delta-Vector commited on
Commit
a14d609
·
verified ·
1 Parent(s): 3c8367d

Upload ./README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +407 -0
README.md ADDED
@@ -0,0 +1,407 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ thumbnail: >-
3
+ https://cdn-uploads.huggingface.co/production/uploads/66c26b6fb01b19d8c3c2467b/jg2NWmCUfPyzizm2USjMt.jpeg
4
+ datasets:
5
+ - NewEden/Orion-LIT
6
+ - NewEden/Orion-Asstr-Stories-16K
7
+ - Mielikki/Erebus-87k
8
+ - PocketDoc/Dans-MemoryCore-CoreCurriculum-Small
9
+ - Nitral-AI/ARES-ShareGPT
10
+ - Gryphe/Sonnet3.5-SlimOrcaDedupCleaned-20k
11
+ - NewEden/Claude-Instruct-2.7K
12
+ - NewEden/Claude-Instruct-5K
13
+ - PocketDoc/Dans-Personamaxx-VN
14
+ - NewEden/vanilla-backrooms-claude-sharegpt
15
+ - PocketDoc/Dans-Prosemaxx-Cowriter-3-S
16
+ - NewEden/Misc-Mang-Sharegpt
17
+ - PocketDoc/Dans-Prosemaxx-Adventure
18
+ - PocketDoc/Dans-Failuremaxx-Adventure-3
19
+ - PocketDoc/Dans-Prosemaxx-InstructWriter-ZeroShot
20
+ - PocketDoc/Dans-Prosemaxx-InstructWriter-Continue
21
+ base_model:
22
+ - Delta-Vector/Hamanasu-15B-Instruct
23
+ tags:
24
+ - phi
25
+ - roleplay
26
+ - finetune
27
+ - storywriting
28
+ ---
29
+ ### exl2 quant (measurement.json in main branch)
30
+ ---
31
+ ### check revisions for quants
32
+ ---
33
+
34
+ <!DOCTYPE html>
35
+ <style>
36
+ html, body {
37
+ background: black;
38
+ color: #c9d1d9 !important;
39
+ font-family: 'Segoe UI', Tahoma, Geneva, Verdana, sans-serif;
40
+ margin: 0;
41
+ padding: 0;
42
+ min-height: 100vh;
43
+ }
44
+ .markdown-body {
45
+ color: white;
46
+ margin: 40px auto;
47
+ padding: 40px;
48
+ border-radius: 12px;
49
+ position: relative;
50
+ overflow: hidden;
51
+ }
52
+
53
+ .markdown-body::after {
54
+ content: '';
55
+ position: absolute;
56
+ top: 0;
57
+ left: 0;
58
+ width: 100%;
59
+ height: 100%;
60
+ background: #0c0f18; /* background color */
61
+ pointer-events: none;
62
+ z-index: -999;
63
+ }
64
+
65
+ h1, h2, h3 {
66
+ background: linear-gradient(45deg, #6e00ff, #00ffff);
67
+ -webkit-background-clip: text;
68
+ -webkit-text-fill-color: transparent;
69
+ border-bottom: 1px solid #333;
70
+ padding-bottom: 0.3em;
71
+ }
72
+
73
+ div[style*="border:2px solid #333"],
74
+ div[style*="border: 2px solid #333"],
75
+ div[style*="border:1px solid #333"],
76
+ div[style*="border: 1px solid #333"] {
77
+ background: rgba(22, 27, 34, 0.8) !important;
78
+ border: 2px solid #6e00ff !important;
79
+ box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);
80
+ border-radius: 10px;
81
+ padding: 20px;
82
+ margin: 20px 0;
83
+ }
84
+
85
+ code {
86
+ background-color: #1a1a1a !important;
87
+ border-radius: 4px;
88
+ padding: 0.2em 0.4em;
89
+ color: #00ffff;
90
+ }
91
+
92
+ pre {
93
+ background-color: #1a1a1a !important;
94
+ border: 1px solid #333;
95
+ border-radius: 8px;
96
+ padding: 16px;
97
+ }
98
+
99
+ table {
100
+ width: 100%;
101
+ border-collapse: collapse;
102
+ margin: 20px 0;
103
+ background: rgba(0,0,0,0.2);
104
+ table-layout: fixed;
105
+ color: white;
106
+ }
107
+
108
+ th, td {
109
+ border: 1px solid #333;
110
+ padding: 12px;
111
+ text-align: center;
112
+ color: white;
113
+ }
114
+
115
+ th {
116
+ background: rgba(110, 0, 255, 0.1);
117
+ }
118
+
119
+ td:nth-child(1) {
120
+ width: 1%;
121
+ white-space: nowrap;
122
+ }
123
+
124
+ td:nth-child(2) {
125
+ width: 100%;
126
+ }
127
+
128
+ td > span {
129
+ display: block;
130
+ padding: 4px 8px;
131
+ background: rgba(110, 0, 255, 0.1);
132
+ border-radius: 4px;
133
+ transition: all 0.3s ease;
134
+ }
135
+
136
+ td > span:hover {
137
+ background: rgba(110, 0, 255, 0.2);
138
+ transform: translateY(-1px);
139
+ }
140
+
141
+ a {
142
+ color: #00ffff;
143
+ text-decoration: none;
144
+ transition: all 0.3s ease;
145
+ }
146
+
147
+ a:hover {
148
+ color: #6e00ff;
149
+ text-decoration: none;
150
+ }
151
+
152
+ hr {
153
+ border: 0;
154
+ height: 1px;
155
+ background: linear-gradient(90deg, transparent, #333, transparent);
156
+ margin: 40px 0;
157
+ }
158
+
159
+ img {
160
+ max-width: 100%;
161
+ border-radius: 10px;
162
+ }
163
+
164
+ details summary:hover {
165
+ color: #00ffff;
166
+ }
167
+
168
+ * {
169
+ color-scheme: dark !important;
170
+ }
171
+
172
+ .prose, .max-w-none, .px-4 {
173
+ background-color: transparent !important;
174
+ color: #c9d1d9 !important;
175
+ }
176
+ </style>
177
+ <body>
178
+ <div class="markdown-body">
179
+ <div align="center">
180
+
181
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/66c26b6fb01b19d8c3c2467b/o5WjJKA9f95ri9UzRxZQE.png" alt="Model Visualization" width="500px" style="border: 3px solid #333; box-shadow: 0 0 15px rgba(66, 0, 131, 0.5);" />
182
+
183
+ <br>
184
+ <br>
185
+
186
+ <div style="font-size:1.5em; font-weight:bold; background: linear-gradient(45deg, #6e00ff, #00ffff); -webkit-background-clip: text; -webkit-text-fill-color: transparent;">
187
+ Hamanasu 15B Instruct
188
+ </div>
189
+
190
+ </div>
191
+
192
+ <div style="border:1px solid #333; border-radius:10px; padding:20px; margin:20px 0; background: rgba(0,0,0,0.4);">
193
+
194
+ ## 🌌 Overview
195
+
196
+ <i>Thanks to PocketDoc's Adventure datasets and taking his Dangerous Winds models as inspiration, I was able to finetune a Adventure Version of Hamanasu Instruct </i>
197
+ <i>This model remains non-sloppy and now support adventure-style cards such as my very own [Dinosaur Adventure Card](https://chub.ai/characters/MangyMango/dino-survival-9f0e5499c674)</i>
198
+
199
+ </div>
200
+
201
+ <div style="display: grid; grid-template-columns: repeat(auto-fit, minmax(250px, 1fr)); gap: 20px; margin: 20px 0;">
202
+
203
+ <div style="border:2px solid #333; border-radius:10px; padding:20px; background: rgba(0,0,0,0.2);">
204
+
205
+ ### 📈 Quantizations
206
+ | Type | Link |
207
+ |:---:|:---:|
208
+ | `GGUF` | <span style="display:block; width:100%"></span> |
209
+ | `EXL2` | <span style="display:block; width:100%"></span> |
210
+
211
+ </div>
212
+
213
+ <div style="border:2px solid #333; border-radius:10px; padding:20px; background: rgba(0,0,0,0.2);">
214
+
215
+ ### ⚔️ Hardware
216
+ - 4x RTX 3090 GPUs
217
+ - Epochs: 2
218
+ - Base: `Hamanasu-15B-Instruct`
219
+ </div>
220
+
221
+ </div>
222
+
223
+
224
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
225
+
226
+ ## 💰 Prompting
227
+
228
+ <i>A known quirk of the model is overly verbose responses when generation length is uncapped, Please cap your maximum output tokens to 100~ tokens above what you prefer</i>
229
+
230
+
231
+ ```python
232
+ <|im_start|>system
233
+ You are an uncensored AI, your job is to fulfill thy will of thy user.<|im_end|>
234
+ <|im_start|>User request
235
+ Take off your helmet.<|im_end|>
236
+ <|im_start|>No i shall not. This is the way.
237
+ ```
238
+ </div>
239
+
240
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
241
+
242
+ ## Axolotl Config ꒰(˶• ᴗ •˶)꒱
243
+
244
+ <details>
245
+
246
+ ```yaml
247
+ base_model: Delta-Vector_Hamanasu-15B-Instruct
248
+ model_type: AutoModelForCausalLM
249
+ tokenizer_type: AutoTokenizer
250
+
251
+ plugins:
252
+ - axolotl.integrations.liger.LigerPlugin
253
+ liger_rope: true
254
+ liger_rms_norm: true
255
+ liger_swiglu: true
256
+ liger_fused_linear_cross_entropy: true
257
+
258
+
259
+ load_in_8bit: false
260
+ load_in_4bit: false
261
+ strict: false
262
+
263
+ datasets:
264
+ - path: PocketDoc/Dans-Personamaxx-VN
265
+ type: dan-chat-advanced
266
+ - path: NewEden/vanilla-backrooms-claude-sharegpt
267
+ type: dan-chat-advanced
268
+ - path: PocketDoc/Dans-Prosemaxx-Cowriter-3-S
269
+ type: dan-chat-advanced
270
+ - path: NewEden/Misc-Mang-Sharegpt
271
+ type: dan-chat-advanced
272
+ - path: PocketDoc/Dans-Prosemaxx-Adventure
273
+ type: dan-chat-advanced
274
+ - path: PocketDoc/Dans-Failuremaxx-Adventure-3
275
+ type: dan-chat-advanced
276
+ - path: PocketDoc/Dans-Prosemaxx-InstructWriter-ZeroShot
277
+ type: dan-chat-advanced
278
+ - path: PocketDoc/Dans-Prosemaxx-InstructWriter-Continue
279
+ type: dan-chat-advanced
280
+
281
+ shuffle_merged_datasets: true
282
+ dataset_prepared_path: prepared_data
283
+ val_set_size: 0.0
284
+ output_dir: ./phi4-adventure
285
+
286
+ sequence_len: 16384
287
+ sample_packing: true
288
+ pad_to_sequence_len: true
289
+
290
+ adapter: lora
291
+ lora_model_dir:
292
+ lora_r: 128
293
+ lora_alpha: 16
294
+ lora_dropout: 0.05
295
+ lora_target_modules:
296
+ - gate_proj
297
+ - down_proj
298
+ - up_proj
299
+ - q_proj
300
+ - v_proj
301
+ - k_proj
302
+ - o_proj
303
+
304
+ lora_modules_to_save:
305
+ - embed_tokens
306
+ - lm_head
307
+
308
+
309
+ wandb_project: mag-phi
310
+ wandb_entity:
311
+ wandb_watch:
312
+ wandb_name: adventure-attempt-01
313
+ wandb_log_model:
314
+
315
+ gradient_accumulation_steps: 4
316
+ micro_batch_size: 2
317
+ num_epochs: 2
318
+ optimizer: paged_ademamix_8bit
319
+ lr_scheduler: cosine
320
+ learning_rate: 2e-5
321
+
322
+ train_on_inputs: false
323
+ group_by_length: false
324
+ bf16: auto
325
+ fp16:
326
+ tf32: false
327
+
328
+ gradient_checkpointing: unsloth
329
+ early_stopping_patience:
330
+ resume_from_checkpoint:
331
+ local_rank:
332
+ logging_steps: 1
333
+ xformers_attention:
334
+ flash_attention: true
335
+
336
+ warmup_steps: 15
337
+ evals_per_epoch: 4
338
+ eval_table_size:
339
+ eval_max_new_tokens: 128
340
+ saves_per_epoch: 2
341
+ debug:
342
+ deepspeed: /workspace/axolotl/deepspeed_configs/zero3_bf16_cpuoffload_params.json
343
+ weight_decay: 0.01
344
+ fsdp:
345
+ fsdp_config:
346
+ ```
347
+
348
+ </details>
349
+ </div>
350
+
351
+ <div align="center">
352
+
353
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
354
+
355
+ ## ⚡ Credits
356
+ <div style="display: flex; justify-content: center;">
357
+ <div style="display: grid; grid-template-columns: repeat(auto-fit, minmax(200px, 1fr)); gap: 10px; margin: 20px 0; max-width: 600px;">
358
+
359
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
360
+ <a href="https://huggingface.co/lucyknada">
361
+ <img src="https://img.shields.io/badge/%F0%9F%8C%9F-Lucy_Knada-blueviolet" alt="Lucy Knada">
362
+ </a>
363
+ </div>
364
+
365
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
366
+ <a href="https://huggingface.co/jeiku">
367
+ <img src="https://img.shields.io/badge/%E2%9A%94%EF%B8%8F-jeiku-blueviolet" alt="jeiku">
368
+ </a>
369
+ </div>
370
+
371
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
372
+ <a href="https://huggingface.co/intervitens">
373
+ <img src="https://img.shields.io/badge/%F0%9F%9B%A1%EF%B8%8F-Intervitens-blueviolet" alt="Intervitens">
374
+ </a>
375
+ </div>
376
+
377
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
378
+ <a href="https://huggingface.co/kalomaze">
379
+ <img src="https://img.shields.io/badge/%F0%9F%94%AE-Kalomaze-blueviolet" alt="Kalomaze">
380
+ </a>
381
+ </div>
382
+
383
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
384
+ <a href="https://huggingface.co/kubernetes-bad">
385
+ <img src="https://img.shields.io/badge/%E2%9A%A1-Kubernetes_Bad-blueviolet" alt="Kubernetes Bad">
386
+ </a>
387
+ </div>
388
+
389
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
390
+ <a href="https://huggingface.co/anthracite-org">
391
+ <img src="https://img.shields.io/badge/%F0%9F%8C%91-Anthracite-blueviolet" alt="Anthracite">
392
+ </a>
393
+ </div>
394
+ </div>
395
+ </div>
396
+ </div>
397
+
398
+ ---
399
+
400
+ <div align="center">
401
+ <div style="font-size:0.8em; opacity:0.8;">Made by</div>
402
+ <div style="font-size:1.2em; font-weight:bold; background: linear-gradient(45deg, #6e00ff, #00ffff); -webkit-background-clip: text; -webkit-text-fill-color: transparent;">Delta-Vector</div>
403
+ </div>
404
+
405
+ </div>
406
+ </body>
407
+ </html>