A Unified Toolkit for Deep Learning Based Document Image Analysis
Read and extract text and other content from PDFs in C# (port of PDFBox)
OCR engine for all the languages
Document Layout Analysis resources repos for development with PdfPig.
Doc2Graph transforms documents into graphs and exploit a GNN to solve se...