metadata
base_model: unsloth/qwen2.5-0.5b-instruct-unsloth-bnb-4bit
tags:
- text-generation-inference
- transformers
- unsloth
- qwen2
- gguf
license: apache-2.0
language:
- en
Generated from unsloth SimPO_colab_notebook.ipynb
https://colab.research.google.com/drive/1qHgk-YRz4pQHKER2QNjMXHgsERKyz8dF#scrollTo=ti7ZnQOY6s0O
How to use?
ollama run hf.co/chenhaodev/qwen-mini-simpo-gguf
Expected Output
check difference between this model vs qwen2.5-0.5b on the dataset https://huggingface.co/datasets/trl-lib/ultrafeedback_binarized/viewer/default/train?views%5B%5D=train&row=0