GImageReader Versions Save

A Gtk/Qt front-end to tesseract-ocr.

ci-latest-master

1 month ago

v3.4.2

3 months ago

gImageReader 3.4.2 (Feb 05 2024):

  • Bufgixes:
    • [Qt] Fix crash in FileTreeModel::findFile with temporary file
    • [Gtk] Correctly notify hOCR tree updates when merging items
    • [Win32] Fix dictionary installation directory
    • Quote x_font property in hOCR documents
    • Assorted Wayland fixes
  • Enhancements:
    • Add support for PoDoFo 0.10.x
    • Apply brightness/contrast/resolution/invert to all selected images
  • Updated translations
  • See https://github.com/manisandro/gImageReader/compare/v3.4.1...v3.4.2 for details

master

5 months ago

v3.4.1

1 year ago

gImageReader 3.4.1 (Jan 29 2023):

  • Bugfixes:
    • Fix warning about text in pdf incorrectly shown even if PDF has no text
    • Adapt for enchant2 dictionary location change
    • [Qt] Fix setting custom font for text editor
    • Fix crash in batch export dialog when selecting folder with no hocr html files
    • Assorted Wayland fixes
    • [Gtk] Fix incorrectly passing export filename to hOCR text and ODT export
    • [Gtk] Rework Utils::string_html_escape to fix possible unicode string corruption
  • Enhancements:
    • Add 2px margin to autodetected areas
    • Allow specifying custom script tessdatas by prepending prefix with script/ and leaving empty lang code
    • Use correct file extensions in crash save files
    • Make WConf visibility persistent
  • Updated translations
  • See https://github.com/manisandro/gImageReader/compare/v3.4.0...v3.4.1 for details

v3.4.0

2 years ago

gImageReader 3.4.0 (Jan 28 2022):

  • Add support for tesseract 5.0
  • Add Qt6 support
  • Add thumbnail view for source documents
  • Add batch mode for recognizing multiple documents
  • Display sources in a tree
  • Allow opening output files directly from the source tree if they exist next to the source with the same basename
  • Allow moving image selection boxes
  • Text: Add multi-tab support
  • HOCR: Allow specifying whether new output is inserted/appended
  • HOCR: Allow opening multiple files at once, also from command line
  • HOCR: Add proof-reading widget (Qt interface only)
  • HOCR: New batch export dialog
  • HOCR: Add quick navigation for low confidence words

v3.3.1

4 years ago

gImageReader 3.3.1 (Jul 28 2019):

  • HOCR: propagate attributes to manually added elements (@foghawk)
  • HOCR: improve spelling of hyphenated words (@foghawk)
  • HOCR: improve spelling of words with special characters (@foghawk)
  • HOCR: allow specifying a DPI to assume for image sources when exporting to PDF (@foghawk)
  • HOCR: allow use to choose whether to sanitize hyphens when exporting to PDF
  • HOCR: Attempt to map 639-2 language codes to ISO 639-1 to set spelling language
  • Allow specifying character whitelist / blacklist for recognition
  • Various bugfixes
  • Translation updates
  • Full details in commit log: https://github.com/manisandro/gImageReader/commits/master

v3.3.0

5 years ago

gImageReader 3.3.0 (Sep 26 2018): This is the first stable release of the 3.3.x series. The main change compared to 3.2.99 is support for the script traineddatas which were introduced with tesseract 4.x.

As with previous releases, the Windows builds using tesseract 4 are still to be considered experimental.

For a full list of changes between 3.2.99 and 3.3.0, see the git commit log.

v3.2.99

6 years ago

gImageReader 3.2.99 (Feb 24 2018) This is the beta release for gImageReader 3.3.0. The main highlight is a much expanded hOCR editor, and many bug fixes. Consult the changelog below for details. Special thanks to @ZaMaZaN4iK and @SantosSi for their valuable contributions both in code and improvement ideas.

There are a number of incomplete translations, so this would be a great moment for interested people to update their translations. gImageReader now hosts its translations on Weblate, so translating is easier than ever!

Please report any issues you might find to ensure a polished 3.3.0 release.

As with previous releases, the Windows builds using tesseract 4 are to be considered experimental.

Binary packages for Linux are available for Ubuntu in the gImageReader-devel PPA and for Fedora in this COPR repository.

Changelog

  • Add support for reading DJVU documents
  • Add support for encrypted PDF files
  • Rewrite HOCR editor and greatly expand its functionality:
    • Allow displaying confidence values in HOCR tree
    • Allow clicking in the canvas to jump to the corresponding item in the HOCR tree
    • Support mass-editing of HOCR child item attributes from parent
    • Honour font family attributes if possible
    • Honour and allow toggling bold and italic attributes
    • Correctly honour the baseline
    • Add search/replace and substitution list support
    • Add preview mode while editing
    • Allow manually adding lines, words and paragraphs
    • Allow swapping items
    • Automatically adjust parent bounding boxes when resizing and removing children
    • Add navigation toolbar to facilitate navigating through the HOCR tree
    • Use relative paths to source files in HOCR HTML document if source files are on same level or below the HOCR file
    • Add export to text
    • Add export to ODT
    • Allow choosing paper size in PDF export
    • Allow setting document metadata in PDF export
    • Allow setting encryption in PDF export
    • [Qt] Allow using QPrinter as PDF export backend, which has better support for complex scripts

v3.2.3

6 years ago

gImageReader 3.2.3 (Jul 01 2017):

  • Fix broken hOCR export
  • Add option to prepend source filename / page to plain text output

Please note that the tesseract4.0.0.git2b854e3 builds are experimental, intended for those who want to try out the latest tesseract 4.0.0 alpha version. Make sure you update your tessata files if you use that version!

v3.2.2

6 years ago

gImageReader 3.2.2 (Jun 30 2017):

  • Attempt to use original source image for PDF output
  • Allow collapsing/expanding branches of hOCR tree via context menu
  • Recognize guillemets as quote characters
  • Fix crash when adding zero-page sources
  • Fix possible crash when rapidly switching documents
  • [Gtk] Fix output pane orientation not properly restored
  • [Gtk] Don't crash when rendering of image fails
  • [Gtk] Fix icons not appearing with recent Gtk versions
  • [Qt] Don't display empty image if rendering of downscaled image fails

Please note that the tesseract4.0.0.git2b854e3 builds are experimental, intended for those who want to try out the latest tesseract 4.0.0 alpha version. Make sure you update your tessata files if you use that version!