DopeyTinyLlama-1.1B-v1

An experimental DPO finetune of SmarTinyLlama with Alpaca-QLoRA

Datasets

Trained on bagel style DPO datasets

Prompt Template

Uses chatml style prompt template

Downloads last month
8
Safetensors
Model size
1.1B params
Tensor type
FP16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for vihangd/DopeyTinyLlama-1.1B-v1

Merges
49 models
Quantizations
2 models