Push tokenizer again

by tomaarsen HF Staff - opened 15 days ago

base: refs/heads/main

←

from: refs/pr/3

Discussion Files changed

+30737

-2

tomaarsen

Sentence Transformers - Cross-Encoders org 15 days ago

In this PR, I'm repushing the tokenizer with a newer version of transformers, with the goal of also generating the tokenizer.json used by the tokenizers-backed fast tokenizers. There should not be any changes in the performance of the tokenizer, the only difference is that Transformers and Sentence Transformers can now directly use the fast tokenizer without having to convert it from the slow tokenizer. This should make loading the model faster.

Tom Aarsen

Push tokenizer again5f63b1b8

tomaarsen changed pull request status to merged 15 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment