Tesseract Versions Save

Tesseract Open Source OCR Engine (main repository)

5.0.0-alpha-20201224

3 years ago

This is a new pre-release of Tesseract 5.0.0.

It is considered to be production ready for end users, but nevertheless not stable because more incompatible API changes are planned.

  • improved performance (also on ARM / ARM64)
  • improved unit tests
  • many fixes
  • faster flat build with automake
  • support for latest macOS (including new M1 processor)

See also list of all changes.

4.1.1

4 years ago
  • Implemented sw build (cppan is deprecated)
  • Improved cmake build
  • Code cleanup and optimization
  • A lot of bug fixes...

4.1.0

4 years ago
  • Added new renderers Alto, LSTMBox, WordStrBox.
  • Added character boxes in hOCR output.
  • Added python training scripts (experimental) as alternative shell scripts.
  • Better support AVX / AVX2 / SSE.
  • Disable OpenMP support by default (see e.g. #1171, #1081).
  • Fix for bounding box problem.
  • Implemented support for whitelist/blacklist in LSTM engine.
  • Improved cmake configuration.
  • Code modernization and improvements.
  • A lot of bug fixes...

Detailed changelog is on wiki.

Windows installer can be downloaded from https://github.com/UB-Mannheim/tesseract/wiki.

4.0.0

5 years ago

Detailed Release notes, Changelog and documentation can be found in project wiki.

Windows installer can be downloaded from https://github.com/UB-Mannheim/tesseract/wiki.

3.05.02

5 years ago

Bug fix release

3.05.01

6 years ago

Bug fix release

3.05.00

7 years ago
  • Made some fine tuning to the hOCR output.
    • Added TSV as another optional output format.
    • Fixed ABI break introduced in 3.04.00 with the AnalyseLayout() method.
    • text2image tool - Enable all OpenType ligatures available in a font. This feature requires Pango 1.38 or newer.
    • Training tools - Replaced asserts with tprintf() and exit(1).
    • Fixed Cygwin compatibility.
    • Improved multipage tiff processing.
    • Improved the embedded pdf font (pdf.ttf).
    • Enable selection of OCR engine mode from command line.
    • Changed tesseract command line parameter '-psm' to '--psm'.
    • Added new C API for orientation and script detection, removed the old one.
    • Increased minimum autoconf version to 2.59.
    • Removed dead code.
    • Fixed many compiler warning.
    • Fixed memory and resource leaks.
    • Fixed some issues with the 'Cube' OCR engine.
    • Fixed some openCL issues.
    • Added option to build Tesseract with CMake build system.
    • Implemented CPPAN support for easy Windows building.

3.04.01

8 years ago

bug-fix release of 3.04 version

3.04.00

8 years ago
  • Added OpenCL support (experimental)
  • Many bug fixes