Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

yentinglin
/
Llama-3-Taiwan-8B-Instruct-128k

Text Generation
Transformers
Safetensors
Chinese
English
llama
zhtw
conversational
text-generation-inference
Model card Files Files and versions Community
7
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Size Mismatch Error

#7 opened 10 months ago by
mchl914

128k量化時會出現ValueError: Duplicated tensor name 'output.weight'

🚀 1
3
#5 opened 11 months ago by
Garfield1978

這張表有點怪怪的

#3 opened 11 months ago by
wennycooper

請問是用什麼技術擴展context_window 到128k?

👀 1
#1 opened 11 months ago by
wennycooper
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs