Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
gpt3-8b-multi-3.5t-base
like
8
Follow
NVIDIA
23.3k
Text Generation
English
Megatron-LM
nvidia
Mamba
Mamba-2
SSM
8B
arxiv:
2406.07887
arxiv:
2405.21060
License:
apache-2.0
Model card
Files
Files and versions
Community
1
51d7f04
gpt3-8b-multi-3.5t-base
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
rwaleffe
Update model arguments
51d7f04
11 months ago
release
Update model arguments
11 months ago
.gitattributes
Safe
1.52 kB
initial commit
11 months ago
README.md
Safe
2.18 kB
Upload model
11 months ago
latest_checkpointed_iteration.txt
Safe
8 Bytes
Upload model
11 months ago
mt_nlg_plus_multilingual_ja_zh_the_stack_frac_015_256k.model
Safe
4.57 MB
LFS
Upload model
11 months ago