tokyotech-llm
/

Llama-3.1-Swallow-8B-Instruct-v0.1

Text Generation

text-generation-inference

Model card Files Files and versions Community

Taishi-N324 commited on Oct 5, 2024

Commit

3868bd1

·

verified ·

1 Parent(s): aa75147

Update README.md

Files changed (1) hide show

README.md +13 -0

README.md CHANGED Viewed

@@ -178,6 +178,19 @@ print(output[0].outputs[0].text)
 ### Instruction Tuning
 The following datasets were used for the instruction tuning.
 ## Risks and Limitations

 ### Instruction Tuning
 The following datasets were used for the instruction tuning.
+- lmsys-chat-1m-synth-ja-wo-pii
+    - Japanese translation of the lmsys-chat-1m dataset using DeepL, with synthetic instruction data created using the Llama-3.1-405B model.
+    - 'wo-pii' indicates removal of personally identifiable information.
+- filtered-magpie-ultra-ja
+    - Subset of magpie-ultra dataset, containing samples rated 'average', 'good', or 'excellent'.
+    - English version (filtered-magpie-ultra-en) translated to Japanese using Gemma 2 27B model.
+- gemma-magpie
+    - Japanese-only dataset.
+    - Generated using prompts for specific category-based question-answering.
 ## Risks and Limitations