Sharded version of This model. Use the tokenizer from there

from transformers import LlamaTokenizer, AutoModelForCausalLM

tokenizer = LlamaTokenizer.from_pretrained("NousResearch/Nous-Hermes-13b")
model = AutoModelForCausalLM.from_pretrained("simsim314/Hermes-13b-hf-shards")
Downloads last month
17
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support