vihangd
/

DopeyTinyLlama-1.1B-v1

Text Generation

text-generation-inference

Model card Files Files and versions Community

DopeyTinyLlama-1.1B-v1

An experimental DPO finetune of SmarTinyLlama with Alpaca-QLoRA

Datasets

Trained on bagel style DPO datasets

Prompt Template

Uses chatml style prompt template

Downloads last month: 8

Safetensors

Model size

1.1B params

Tensor type

FP16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for vihangd/DopeyTinyLlama-1.1B-v1

Merges

Quantizations