A large-scale synthetic Arabic OCR dataset comprising 843,622 book-style document images across 10 fonts, designed to advance VLM for Arabic Texts
Robotics and Interne-of-Things
riotu-lab
AI & ML interests
None yet
Recent Activity
new activity
2 days ago
ImruQays/16-million-raw-arabic-words:download problem
updated
a collection
4 days ago
SAND: Large-Scale Synthetic Arabic OCR Dataset
updated
a collection
4 days ago
SAND: Large-Scale Synthetic Arabic OCR Dataset
Organizations
None yet
Collections
3
models
19

riotu-lab/ArabianGPT-1.5B-FT-SA-v2
Updated
•
4

riotu-lab/Aranizer-PBE-64k
Updated
•
1

riotu-lab/Aranizer-SP-32k
Updated
•
1

riotu-lab/Aranizer-SP-64k
Updated
•
1

riotu-lab/Aranizer-SP-86k
Updated

riotu-lab/Aranizer-PBE-32k
Updated
•
1

riotu-lab/Aranizer-PBE-86k
Updated

riotu-lab/ArabianGPT-0.8B-Sum-FT
Updated

riotu-lab/ArabianGPT-0.8B-FT-QA
Updated
•
3

riotu-lab/ArabianGPT1.5B-QA-FT
Text Generation
•
Updated
•
8
•
1
datasets
10
riotu-lab/SAND-Extended
Preview
•
Updated
•
15
•
1
riotu-lab/SAND
Preview
•
Updated
•
3.97k
•
1
riotu-lab/arabic_reverse_dictionary
Viewer
•
Updated
•
58.6k
•
31
riotu-lab/ADMD
Viewer
•
Updated
•
980
•
224
•
1
riotu-lab/Synthetic-UAV-Flight-Trajectories
Viewer
•
Updated
•
766k
•
845
•
3
riotu-lab/combined-arabic-dataset
Viewer
•
Updated
•
523k
•
58
•
1
riotu-lab/ARABIC-RAW-TEXT
Viewer
•
Updated
•
100M
•
78
•
3
riotu-lab/ArabicQA_2.1M
Viewer
•
Updated
•
2.14M
•
24
•
2
riotu-lab/Arabic-books-and-research-dataset
Viewer
•
Updated
•
37k
•
37
•
3
riotu-lab/Quran-Tafseers
Viewer
•
Updated
•
56.1k
•
87
•
6