SAND: Large-Scale Synthetic Arabic OCR Dataset
Collection
A large-scale synthetic Arabic OCR dataset comprising 843,622 book-style document images across 10 fonts, designed to advance VLM for Arabic Texts
•
2 items
•
Updated
•
2