File size: 211 Bytes
0617ee5
 
a8ce399
 
0617ee5
a8ce399
 
 
 
1
2
3
4
5
6
7
8
9
---
license: mit
language:
- en
---

BlueHeeler-10M is a nanoGPT (GPT-2) 6-head x 6-layer x 192-deep model trained on scripts from the children's show Bluey.

`iter 2000: loss 1.2913, time 30647.72ms, mfu 0.05%`