README / README.md
Di Zhang
Update README.md
f62e303 verified
|
raw
history blame
861 Bytes
metadata
title: README
emoji: πŸ‘€
colorFrom: yellow
colorTo: indigo
sdk: static
pinned: false

The first version of LLaMA-O1 has been uploaded to HF now!Here He Comes!

Supervised:

https://huggingface.co/SimpleBerry/LLaMA-O1-Supervised-1129

Base(Pretrain):

https://huggingface.co/SimpleBerry/LLaMA-O1-Base-1127

Supervised Finetune Dataset:

https://huggingface.co/datasets/SimpleBerry/OpenLongCoT-SFT

Pretraining Dataset:

https://huggingface.co/datasets/SimpleBerry/OpenLongCoT-Pretrain-1202

RLHF is on the way! View our GitHub Repo:

https://github.com/SimpleBerry/LLaMA-O1

Our ongoing related researches:

https://huggingface.co/papers/2406.07394

https://huggingface.co/papers/2410.02884

https://huggingface.co/papers/2411.18203

image/png