MadeAgents
/

Hammer2.1-7b

Model card Files Files and versions Community

linqq9 commited on Dec 12, 2024

Commit

30720f1

·

verified ·

1 Parent(s): 4875e47

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -43,9 +43,9 @@ The code of Hammer 2.1 models have been in the latest Hugging face transformers
 ## How to Use
 Hammer models offer flexibility in deployment and usage, fully supporting both **vLLM** deployment and **Hugging Face Transformers** tool calling. Below are the specifics on how to make use of these features:
-#### Using vLLM
-##### Option 1: Using Hammer client
 vLLM offers efficient serving with lower latency. To serve the model with vLLM:
 ```
 vllm serve MadeAgents/Hammer2.1-7b --host 0.0.0.0 --port 8000 --tensor-parallel-size 1
@@ -96,7 +96,7 @@ print(response)
 ~~~
-##### Option 2: Using vLLM’s built-in tool calling
 Hammer2.1 supports vllm’s built-in tool calling. This functionality requires vllm>=0.6. If you want to enable this functionality, please start vllm’s OpenAI-compatible service with:
 ~~~
 vllm serve MadeAgents/Hammer2.1-7b --enable-auto-tool-choice --tool-call-parser hermes
@@ -180,7 +180,7 @@ print(chat_response.choices[0].message.content)
 ~~~
-#### Using Hugging Face Transformers
 Hammer2.1’s chat template also includes a tool calling template, meaning that you can use Hugging Face transformers’ tool calling support. This is a simple example of how to use our model using Transformers.
 ~~~
 import torch

 ## How to Use
 Hammer models offer flexibility in deployment and usage, fully supporting both **vLLM** deployment and **Hugging Face Transformers** tool calling. Below are the specifics on how to make use of these features:
+### Using vLLM
+#### Option 1: Using Hammer client
 vLLM offers efficient serving with lower latency. To serve the model with vLLM:
 ```
 vllm serve MadeAgents/Hammer2.1-7b --host 0.0.0.0 --port 8000 --tensor-parallel-size 1
 ~~~
+#### Option 2: Using vLLM’s built-in tool calling
 Hammer2.1 supports vllm’s built-in tool calling. This functionality requires vllm>=0.6. If you want to enable this functionality, please start vllm’s OpenAI-compatible service with:
 ~~~
 vllm serve MadeAgents/Hammer2.1-7b --enable-auto-tool-choice --tool-call-parser hermes
 ~~~
+### Using Hugging Face Transformers
 Hammer2.1’s chat template also includes a tool calling template, meaning that you can use Hugging Face transformers’ tool calling support. This is a simple example of how to use our model using Transformers.
 ~~~
 import torch