tokyotech-llm
/

Llama-3.1-Swallow-8B-Instruct-v0.1

Text Generation

text-generation-inference

Model card Files Files and versions Community

kazukifujii commited on Oct 5, 2024

Commit

e6b649e

·

verified ·

1 Parent(s): 3868bd1

Update README.md

Files changed (1) hide show

README.md +4 -5

README.md CHANGED Viewed

@@ -184,13 +184,12 @@ The following datasets were used for the instruction tuning.
     - Japanese translation of the lmsys-chat-1m dataset using DeepL, with synthetic instruction data created using the Llama-3.1-405B model.
     - 'wo-pii' indicates removal of personally identifiable information.
-- filtered-magpie-ultra-ja
-    - Subset of magpie-ultra dataset, containing samples rated 'average', 'good', or 'excellent'.
-    - English version (filtered-magpie-ultra-en) translated to Japanese using Gemma 2 27B model.
 - gemma-magpie
-    - Japanese-only dataset.
-    - Generated using prompts for specific category-based question-answering.
 ## Risks and Limitations

     - Japanese translation of the lmsys-chat-1m dataset using DeepL, with synthetic instruction data created using the Llama-3.1-405B model.
     - 'wo-pii' indicates removal of personally identifiable information.
+- filtered magpie-ultra
+    - Subset of the [magpie-ultra](https://huggingface.co/datasets/argilla/magpie-ultra-v0.1) dataset, containing samples rated as 'average,' 'good,' or 'excellent.'.
 - gemma-magpie
+    - Japanese dataset.
+    - Generated using prompts for specific category words.
 ## Risks and Limitations