Update README.md
Browse files
README.md
CHANGED
@@ -19,14 +19,15 @@ See the Swallow Model Index section to find other model variants.
|
|
19 |
|
20 |
# Release History
|
21 |
|
|
|
22 |
- **October 08, 2024**: Released [Llama-3.1-Swallow-8B-v0.1](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-8B-v0.1), [Llama-3.1-Swallow-8B-Instruct-v0.1](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.1), [Llama-3.1-Swallow-70B-v0.1](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-70B-v0.1), and [Llama-3.1-Swallow-70B-Instruct-v0.1](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-70B-Instruct-v0.1).
|
23 |
|
24 |
## Swallow Model Index
|
25 |
|
26 |
-
|Model|Llama-3.1-Swallow|Llama-3.1-Swallow-Instruct|
|
27 |
-
|
28 |
-
|8B| [Link](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-8B-v0.1) | [Link](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.1) |
|
29 |
-
|70B| [Link](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-70B-v0.1) | [Link](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-70B-Instruct-v0.1) |
|
30 |
|
31 |

|
32 |
|
@@ -55,7 +56,8 @@ The website [https://swallow-llm.github.io/](https://swallow-llm.github.io/) pro
|
|
55 |
| Llama 3.1 8B | 0.8436 | 0.4461 | 0.4050 | 0.8962 | 0.1794 | 0.3560 | 0.2209 | 0.2077 | 0.4767 | 0.3274 | 0.4359 |
|
56 |
| Llama 3 Youko 8B | 0.8660 | 0.4902 | 0.5155 | 0.8947 | 0.2127 | 0.2840 | 0.2740 | 0.2180 | 0.4493 | 0.2183 | 0.4423 |
|
57 |
| Llama 3 Swallow 8B | 0.8945 | 0.4848 | 0.5640 | 0.8947 | 0.1981 | 0.4240 | 0.2758 | 0.2223 | 0.4699 | 0.2890 | 0.4717 |
|
58 |
-
| Llama 3.1 Swallow 8B | 0.9124 |
|
|
|
59 |
|
60 |
### English tasks
|
61 |
|
@@ -70,7 +72,8 @@ The website [https://swallow-llm.github.io/](https://swallow-llm.github.io/) pro
|
|
70 |
| Llama 3.1 8B | 0.3780 | 0.7017 | 0.6094 | 0.3330 | **0.9045** | 0.6525 | 0.5057 | 0.6176 | 0.3695 | 0.5636 |
|
71 |
| Llama 3 Youko 8B | 0.3500 | 0.6252 | 0.5885 | 0.3247 | 0.8959 | 0.5993 | 0.3571 | 0.5704 | 0.2793 | 0.5100 |
|
72 |
| Llama 3 Swallow 8B | 0.3520 | 0.6563 | 0.5901 | 0.3507 | 0.9006 | 0.6152 | 0.4875 | 0.5936 | 0.3323 | 0.5420 |
|
73 |
-
| Llama 3.1 Swallow 8B | 0.3800 | 0.6711 | 0.6057 | 0.3468 | 0.9032 | 0.6237 | 0.5110 | 0.6153 | 0.3622 | 0.5577 |
|
|
|
74 |
|
75 |
## Evaluation Benchmarks
|
76 |
|
|
|
19 |
|
20 |
# Release History
|
21 |
|
22 |
+
- **November 11, 2024**: Released [Llama-3.1-Swallow-8B-v0.2](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-8B-v0.2) and [Llama-3.1-Swallow-8B-Instruct-v0.2](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.2).
|
23 |
- **October 08, 2024**: Released [Llama-3.1-Swallow-8B-v0.1](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-8B-v0.1), [Llama-3.1-Swallow-8B-Instruct-v0.1](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.1), [Llama-3.1-Swallow-70B-v0.1](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-70B-v0.1), and [Llama-3.1-Swallow-70B-Instruct-v0.1](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-70B-Instruct-v0.1).
|
24 |
|
25 |
## Swallow Model Index
|
26 |
|
27 |
+
|Model|Llama-3.1-Swallow v0.1|Llama-3.1-Swallow-Instruct v0.1|Llama-3.1-Swallow v0.2|Llama-3.1-Swallow-Instruct v0.2|
|
28 |
+
|---|---|---|---|---|
|
29 |
+
|8B| [Link](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-8B-v0.1) | [Link](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.1) | [Link](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-8B-v0.2) | [Link](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.2) |
|
30 |
+
|70B| [Link](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-70B-v0.1) | [Link](https://huggingface.co/tokyotech-llm/Llama-3.1-Swallow-70B-Instruct-v0.1) | | |
|
31 |
|
32 |

|
33 |
|
|
|
56 |
| Llama 3.1 8B | 0.8436 | 0.4461 | 0.4050 | 0.8962 | 0.1794 | 0.3560 | 0.2209 | 0.2077 | 0.4767 | 0.3274 | 0.4359 |
|
57 |
| Llama 3 Youko 8B | 0.8660 | 0.4902 | 0.5155 | 0.8947 | 0.2127 | 0.2840 | 0.2740 | 0.2180 | 0.4493 | 0.2183 | 0.4423 |
|
58 |
| Llama 3 Swallow 8B | 0.8945 | 0.4848 | 0.5640 | 0.8947 | 0.1981 | 0.4240 | 0.2758 | 0.2223 | 0.4699 | 0.2890 | 0.4717 |
|
59 |
+
| Llama 3.1 Swallow 8B v0.1 | 0.9124 | 0.5092 | 0.6011 | 0.8991 | 0.2020 | 0.4600 | 0.2909 | 0.2313 | 0.5182 | 0.2811 | 0.4905 |
|
60 |
+
| Llama 3.1 Swallow 8B v0.2 | 0.9106 | **0.5097** | 0.6272 | 0.8922 | 0.1976 | 0.4640 | **0.2957** | **0.2326** | 0.5253 | 0.3360 | **0.4991** |
|
61 |
|
62 |
### English tasks
|
63 |
|
|
|
72 |
| Llama 3.1 8B | 0.3780 | 0.7017 | 0.6094 | 0.3330 | **0.9045** | 0.6525 | 0.5057 | 0.6176 | 0.3695 | 0.5636 |
|
73 |
| Llama 3 Youko 8B | 0.3500 | 0.6252 | 0.5885 | 0.3247 | 0.8959 | 0.5993 | 0.3571 | 0.5704 | 0.2793 | 0.5100 |
|
74 |
| Llama 3 Swallow 8B | 0.3520 | 0.6563 | 0.5901 | 0.3507 | 0.9006 | 0.6152 | 0.4875 | 0.5936 | 0.3323 | 0.5420 |
|
75 |
+
| Llama 3.1 Swallow 8B v0.1 | 0.3800 | 0.6711 | 0.6057 | 0.3468 | 0.9032 | 0.6237 | 0.5110 | 0.6153 | 0.3622 | 0.5577 |
|
76 |
+
| Llama 3.1 Swallow 8B v0.2 | 0.3820 | 0.6510 | 0.5955 | 0.3473 | 0.9041 | 0.6227 | 0.5208 | 0.6053 | 0.3659 | 0.5549 |
|
77 |
|
78 |
## Evaluation Benchmarks
|
79 |
|