Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.
⚠️ This release fixes a security vulnerability in ingest-file, the component that handles files uploaded to Aleph. Please update your Aleph instance to the latest patched versions of Aleph and ingest-file: ⚠️
Please refer to the release notes for Aleph 3.15.6 for detailed information.
Full Changelog: https://github.com/alephdata/ingest-file/compare/3.20.1...3.20.2
tesserocr
from source instead of using wheels because of https://github.com/sirfz/tesserocr/issues/337. This fixes a regression which might have caused certain image file types to not have been OCRd.clear-cache
command to the ingestors
CLI, which allows one to clear the ingest cache. It also takes a prefix (for instance ocr:
or pdf:
.Full Changelog: https://github.com/alephdata/ingest-file/compare/3.20.0...3.20.1
Full Changelog: https://github.com/alephdata/ingest-file/compare/3.19.3...3.20.0
Full Changelog: https://github.com/alephdata/ingest-file/compare/3.19.2...3.20.0-rc1
Full Changelog: https://github.com/alephdata/ingest-file/compare/3.19.2...3.19.3-rc1
Full Changelog: https://github.com/alephdata/ingest-file/compare/3.18.4...3.19.2
Full Changelog: https://github.com/alephdata/ingest-file/compare/3.18.4...3.19.2-rc1
Full Changelog: https://github.com/alephdata/ingest-file/compare/3.18.4...3.19.1
Full Changelog: https://github.com/alephdata/ingest-file/compare/3.18.4...3.19.0