Py Pdf Parser Versions Save

A Python tool to help extracting information from structured PDFs.

v0.12.0

6 months ago

[0.12.0] - 2023-11-10

Added

  • Added support for opening password protected files (#350)

v0.11.0

9 months ago

[0.11.0] - 2023-08-07

Changed

  • Various dependency updates
  • PyPI releases now use Trusted Publishers

Fixed

  • Fixed typo in docs (#361)

v0.10.2

1 year ago

[0.10.2] - 2022-11-07

Changed

  • Various dependency updates
  • Removed unused PyYAML dependency (#262)

v0.10.1

2 years ago

Fixed

  • The visualise function properly uses the elements parameter in order to filter visualised elements. (#256)

Changed

  • Various dependency updates

v0.10.0

2 years ago

Changed

  • [BREAKING] Changes from using pyqt5 to using tkinter for the visualise tool. This means we don't need the python3-dev as a requirement, and seems to solve endless issues with pyqt5 not finding the correct qt bindings. This is a potential breaking change, although the visualise tool is only in the development version. No code changes are needed, but you will need tkinter installed for visualise to still work.
  • Changed python version from 3.6 to 3.8 in .readthedocs.yml.

v0.9.0

2 years ago

Changed

  • Various dependency updates (matplotlib, pyqt5)
  • Removed all but the tests dockerfile for simplicity. Use Docker BuildKit. We will no longer be pushing images to DockerHub on release. (#203)

v0.8.0

3 years ago

Changed

  • Various dependency updates
  • Updated CI to avoid login issue (#182)

v0.7.0

3 years ago

Changed

  • Ensure we only accept LTTextBoxes at the top level (not LTTextLines) (#155)

v0.6.0

3 years ago

Added

  • Enabled dependabot which should help to keep packages up to date (#124)

Changes

  • Various dependency updates

Fixed

  • Fixed a typo in simple memo example in the documentation. (#121)

v0.5.0

3 years ago

Added

  • New functions on ElementList, move_forwards_from and move_backwards_from, to allow moving forwards and backwards from a certain element in the list easily. (#113)

Changed

  • When the layout parameter all_texts is True, the text inside figures is now also returned as elements in the document. (#99)

Fixed

  • Passing a tolerance less than the width/height of an element no longer causes an error. The tolerance is now capped at half the width/height of the element. (#103)