jingyeom commited on
Commit
f29d988
·
verified ·
1 Parent(s): 2586e2c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -34
README.md CHANGED
@@ -19,40 +19,6 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [yanolja/EEVE-Korean-10.8B-v1.0](https://huggingface.co/yanolja/EEVE-Korean-10.8B-v1.0) on the generator dataset.
21
 
22
- ## Model description
23
-
24
- More information needed
25
-
26
- ## Intended uses & limitations
27
-
28
- More information needed
29
-
30
- ## Training and evaluation data
31
-
32
- More information needed
33
-
34
- ## Training procedure
35
-
36
- ### Training hyperparameters
37
-
38
- The following hyperparameters were used during training:
39
- - learning_rate: 1e-06
40
- - train_batch_size: 1
41
- - eval_batch_size: 8
42
- - seed: 42
43
- - distributed_type: multi-GPU
44
- - num_devices: 6
45
- - gradient_accumulation_steps: 16
46
- - total_train_batch_size: 96
47
- - total_eval_batch_size: 48
48
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
- - lr_scheduler_type: cosine
50
- - lr_scheduler_warmup_ratio: 0.03
51
- - num_epochs: 1
52
-
53
- ### Training results
54
-
55
-
56
 
57
  ### Framework versions
58
 
 
19
 
20
  This model is a fine-tuned version of [yanolja/EEVE-Korean-10.8B-v1.0](https://huggingface.co/yanolja/EEVE-Korean-10.8B-v1.0) on the generator dataset.
21
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
22
 
23
  ### Framework versions
24