chenhaodev's picture
Update README.md
4021fbd verified
metadata
base_model: unsloth/qwen2.5-0.5b-instruct-unsloth-bnb-4bit
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - qwen2
  - gguf
license: apache-2.0
language:
  - en

Generated from unsloth SimPO_colab_notebook.ipynb

https://colab.research.google.com/drive/1qHgk-YRz4pQHKER2QNjMXHgsERKyz8dF#scrollTo=ti7ZnQOY6s0O

How to use?

ollama run hf.co/chenhaodev/qwen-mini-simpo-gguf

Expected Output

check difference between this model vs qwen2.5-0.5b on the dataset https://huggingface.co/datasets/trl-lib/ultrafeedback_binarized/viewer/default/train?views%5B%5D=train&row=0