Commit History

Changing HF Space python version
75e90d9

lucalp commited on

Adding HF ZeroGPU torch version compatible
587f6ed

lucalp commited on

Revert "Removing CUDA deps"
7642f0e

lucalp commited on

cleaning things up via gemini 2.5 pro
f2f927b

lucalp commited on

more finishing touches
ad774a9

lucalp commited on

adding patch counts and cleaning up
545bc06

lucalp commited on

tiktoken & llama both plotted
b074257

lucalp commited on

Patches
a528449

lucalp commited on

Working locally, TBD HF space
2af55e5

lucalp commited on

Removing CUDA deps
d52b754

lucalp commited on

Not sure what happened, more deps
847b7ee

lucalp commited on

Adding again
c318efd

lucalp commited on

Removing gcc and gnu deps from req file
9825f33

lucalp commited on

Init for HF space
570eaa9

lucalp commited on

Visualisation working on CPU via CUDA_VISIBLE_DEVICE=-1 python demo_patcher.py 'Daenerys Targaryen is in Game of Thrones, a fantasy epic by George R.R. Martin.'
41ea791

lucalp commited on

Improve HF integration (#98)
1b67cbe
unverified

NielsRogge commited on

Open source weights! (#97)
96d51b5
unverified

par-meta commited on

Cast int sample id to str (#96)
e299427
unverified

Srinivasan Iyer sviyer commited on

Init distributed when loading model (#94)
138c2f3
unverified

Srinivasan Iyer sviyer commited on

Fix eval mask (#93)
19a3f75
unverified

Srinivasan Iyer sviyer commited on

remove selective activation checkpointing (#92)
8c1b1a7
unverified

Srinivasan Iyer sviyer commited on

update (#91)
1e78a49
unverified

par-meta commited on

Get generation working for BLT (#86)
b79eb3e
unverified

par-meta commited on

Fix in-place addition of patch_embds (#85)
2dcf48b
unverified

Hanna commited on

Some fixes for entropy model predictions (#83)
fc946a1
unverified

Srinivasan Iyer sviyer commited on

Update ppl evals to work with blt model, in addition to entropy model (#82)
083656c
unverified

par-meta commited on

Update iterate_data (#81)
f84ee63
unverified

par-meta commited on

Add way to call consolidate (#80)
c110f6b
unverified

Srinivasan Iyer sviyer commited on

When merging configs, do not merge data sources (#79)
a5ceaaa
unverified

Srinivasan Iyer sviyer commited on

Get evals working again. (#46)
7517ac2
unverified

par-meta commited on

Reduce per file resources arrow uses (#77)
63913e4
unverified

par-meta commited on

Let process start before yielding preloaded prefetch buffer, avoid needlessly losing buffer in edge cases (#75)
8f2cf88
unverified

par-meta commited on

Add approximate state persistence (#73)
ea1fc75
unverified

par-meta commited on

Fix rsync to not preserve original permissions, instead use destination (#76)
9bd51df
unverified

par-meta commited on

Correctly reset batch iterator at each arrow create_iter call. (#74)
c727844
unverified

par-meta commited on

Pass mask in packing_iterator, correctly handle last batch, fix masking (#65)
08b8c7c
unverified

par-meta commited on

Initialize rope embeddings properly for the entropy model (#72)
0da051f
unverified

Srinivasan Iyer sviyer commited on

Remove byte tokenizer and add config args to switch between byte/patch packing (#68)
aeb95f1
unverified

par-meta commited on

Add vocab and seq len abstract fields (#66)
ff36aa8
unverified

par-meta commited on

Fix: Correct model_args usage in parallelize_model call (#69)
a6ed14f
unverified

Bocheng Li commited on

Update iterator inheritance, pass file format args, limit iterator (#63)
fc3399e
unverified

par-meta commited on

Make apex logs less noisy (#60)
b0956bd
unverified

par-meta commited on

Make it possible to specify multiple config files (#54)
82ab593
unverified

par-meta commited on

fix(README): correct typo in quickstart instructions (#62)
9f29e0d
unverified

CharlesCNorton commited on

using apex rmsnorm (#57)
f3e8125
unverified

Srinivasan Iyer sviyer commited on

Update README.md (#58)
c49e251
unverified

Srinivasan Iyer commited on

Fix multiprocessing dataloader checkpointing and use it in the train script (#50)
8c61ab5
unverified

par-meta commited on

Test first batch matches (#53)
85c2f28
unverified

par-meta commited on

disable reshard after forward (#56)
9d907fe
unverified

Srinivasan Iyer sviyer commited on

make sure max_encoder_seq_length matches (#55)
48e4ad0
unverified

Srinivasan Iyer sviyer commited on