ilsp
/

Llama-Krikri-8B-Instruct

Text Generation

text-generation-inference

Model card Files Files and versions Community

droussis commited on Feb 8

Commit

2515953

·

verified ·

1 Parent(s): 8f333f3

Improve things a bit

Files changed (1) hide show

README.md +7 -4

README.md CHANGED Viewed

@@ -19,7 +19,9 @@ Krikri is built on top of [Llama-3.1-8B](https://huggingface.co/meta-llama/Llama
 ![image/png](llama-krikri-image.jpg)
-# Base Model Information
 - Vocabulary extension of the Llama-3.1 tokenizer with Greek tokens
 - 128k context length (approximately 80,000 Greek words)
@@ -41,13 +43,14 @@ Krikri is built on top of [Llama-3.1-8B](https://huggingface.co/meta-llama/Llama
 Chosen subsets of the 91 billion corpus were upsampled resulting in a size of **110 billion tokens**.
-# Instruct Model Information
-🚨 **More information of the post-training corpus and methdology coming soon.** 🚨
 # How to use
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
@@ -73,7 +76,7 @@ outputs = model.generate(input_prompt['input_ids'], max_new_tokens=256, do_sampl
 print(tokenizer.batch_decode(outputs)[0])
 ```
-# How to serve with OpenAI compatible server via vLLM
 ```bash
 vllm serve ilsp/Llama-Krikri-8B-Instruct \

 ![image/png](llama-krikri-image.jpg)
+# Model Information
+## Base Model
 - Vocabulary extension of the Llama-3.1 tokenizer with Greek tokens
 - 128k context length (approximately 80,000 Greek words)
 Chosen subsets of the 91 billion corpus were upsampled resulting in a size of **110 billion tokens**.
+## Instruct Model
+🚨 **More information on the post-training corpus and methdology coming soon.** 🚨
 # How to use
+## With Transformers
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
 print(tokenizer.batch_decode(outputs)[0])
 ```
+## With OpenAI compatible server via vLLM
 ```bash
 vllm serve ilsp/Llama-Krikri-8B-Instruct \