daekeun-ml
/

Phi-3-medium-4k-instruct-ko-poc-v0.1

Text Generation

text-generation-inference

Model card Files Files and versions Community

daekeun-ml commited on May 26, 2024

Commit

0b62f3f

·

verified ·

1 Parent(s): 6e563cc

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ pipeline_tag: text-generation
 # Phi-3-medium-4k-instruct-ko-poc-v0.1
 ## Model Details
-This model is trained using unsloth toolkit based on Microsoft's phi-3 model with some Korean instruction data added to enhance its Korean generation performance
 Since my role is not as a working developer, but as ML Technical Specialist helping customers with quick PoCs/prototypes, and I was limited by Azure GPU resources available, I only trained with 40,000 samples on a single VM Azure Standard_NC24ads_A100_v4 for PoC purposes. Because I have not done any tokenizer extensions, you need a lot more tokens than English for text generation.
@@ -127,7 +127,7 @@ x = 4.5
 ```
 ### References
-- Base model: [microsoft/phi-2](https://huggingface.co/microsoft/phi-2)
 ## Notes

 # Phi-3-medium-4k-instruct-ko-poc-v0.1
 ## Model Details
+This model is trained using unsloth toolkit based on Microsoft's phi-3 Phi-3-medium-4k-instruct model (https://huggingface.co/unsloth/Phi-3-medium-4k-instruct) with some Korean instruction data added to enhance its Korean generation performance
 Since my role is not as a working developer, but as ML Technical Specialist helping customers with quick PoCs/prototypes, and I was limited by Azure GPU resources available, I only trained with 40,000 samples on a single VM Azure Standard_NC24ads_A100_v4 for PoC purposes. Because I have not done any tokenizer extensions, you need a lot more tokens than English for text generation.
 ```
 ### References
+- Base model: [unsloth/Phi-3-medium-4k-instruct](https://huggingface.co/unsloth/Phi-3-medium-4k-instruct)
 ## Notes