Akshita Bhagia

akshitab

https://akshitab.github.io

akshitab

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

Establishing Task Scaling Laws via Compute-Efficient Model Ladders

authored a paper 3 days ago

2 OLMo 2 Furious

authored a paper 3 days ago

DataDecide: How to Predict Best Pretraining Data with Small Experiments

View all activity

Organizations

akshitab's activity

authored 3 papers 3 days ago

authored a paper 8 months ago

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3, 2024 • 80

New activity in allenai/OLMo-7B about 1 year ago

Checkpoints

#15 opened about 1 year ago by

borgr

16-bit version?

#13 opened about 1 year ago by

saattrupdan

New activity in allenai/OLMo-1B about 1 year ago

Adding `safetensors` variant of this model

#7 opened about 1 year ago by

SFconvertbot

New activity in allenai/OLMo-7B about 1 year ago

Make architecture consistent with the auto model value

#9 opened about 1 year ago by

osanseviero

Adding `safetensors` variant of this model

#11 opened about 1 year ago by

mzbac

authored 2 papers about 1 year ago

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Paper • 2402.00159 • Published Jan 31, 2024 • 64

OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1, 2024 • 84

updated a collection about 1 year ago

OLMo Suite

Collection

Artifacts for the first set of OLMo models. • 18 items • Updated Mar 13 • 71

authored 5 papers over 1 year ago

What's In My Big Data?

Paper • 2310.20707 • Published Oct 31, 2023 • 11

HINT: Hypernetwork Instruction Tuning for Efficient Zero-Shot Generalisation

Paper • 2212.10315 • Published Dec 20, 2022 • 1

Continued Pretraining for Better Zero- and Few-Shot Promptability

Paper • 2210.10258 • Published Oct 19, 2022

Catwalk: A Unified Language Model Evaluation Framework for Many Datasets

Paper • 2312.10253 • Published Dec 15, 2023 • 8

Paloma: A Benchmark for Evaluating Language Model Fit

Paper • 2312.10523 • Published Dec 16, 2023 • 13