Transforms PDF, Documents and Images into Enriched Structured Data
🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based