0617ee5 a8ce399 0617ee5 a8ce399
1
2
3
4
5
6
7
8
9
--- license: mit language: - en --- BlueHeeler-10M is a nanoGPT (GPT-2) 6-head x 6-layer x 192-deep model trained on scripts from the children's show Bluey. `iter 2000: loss 1.2913, time 30647.72ms, mfu 0.05%`