Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion Paper • 2501.17887 • Published Jan 27
Optimized Table Tokenization for Table Structure Recognition Paper • 2305.03393 • Published May 5, 2023
MolGrapher: Graph-based Visual Recognition of Chemical Structures Paper • 2308.12234 • Published Aug 23, 2023
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis Paper • 2206.01062 • Published Jun 2, 2022 • 1
KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents Paper • 2405.00505 • Published May 1, 2024
TableFormer: Table Structure Understanding with Transformers Paper • 2203.01017 • Published Mar 2, 2022