-
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling
Paper • 2312.15166 • Published • 59 -
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU
Paper • 2312.12456 • Published • 44 -
Cached Transformers: Improving Transformers with Differentiable Memory Cache
Paper • 2312.12742 • Published • 14 -
Mini-GPTs: Efficient Large Language Models through Contextual Pruning
Paper • 2312.12682 • Published • 10
Maximous Black
maximousblk
·
AI & ML interests
None yet
Recent Activity
updated
a collection
19 days ago
papers
updated
a collection
20 days ago
papers
updated
a collection
11 months ago
papers
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet