metadata
language:
- en
- nl
datasets:
- yhavinga/mc4_nl_cleaned
- yhavinga/ccmatrix
tags:
- translation
license: apache-2.0
PreTraining
The model was pre-trained on a English and Dutch mC4 cleaned.
Finetuning
The model was finetuned on 128-max token length ccmatrix. Validated on tatoeba.
Note: multi-direction. Prepend either translate Dutch to English:
or translate English to Dutch: