tahamajs
/

llama-3.2-3b-dpo-lora64-4bit-instruct

Model card Files Files and versions Metrics Training metrics Community

llama-3.2-3b-dpo-lora64-4bit-instruct

Ctrl+K

Ctrl+K

1 contributor

History: 4 commits

tahamajs's picture

Upload DPO fine-tuned checkpoint

55e954e verified 19 days ago

runs
Upload DPO fine-tuned checkpoint 19 days ago
.gitattributes

1.57 kB

Tokenizer for DPO model (Trained with Unsloth) 19 days ago
README.md

5.18 kB

Tokenizer for DPO model (Trained with Unsloth) 19 days ago
adapter_config.json

812 Bytes

Initial commit of DPO model after training 19 days ago
adapter_model.safetensors

389 MB
LFS

Initial commit of DPO model after training 19 days ago
special_tokens_map.json

454 Bytes

Tokenizer for DPO model (Trained with Unsloth) 19 days ago
tokenizer.json

17.2 MB
LFS

Tokenizer for DPO model (Trained with Unsloth) 19 days ago
tokenizer_config.json

51.2 kB

Tokenizer for DPO model (Trained with Unsloth) 19 days ago