Pure Javascript OCR for more than 100 Languages 📖🎉🖥
.traineddata
files) are now cleared from cache (#753)
cacheMethod: 'none'
or cacheMethod: 'refresh'
to prevent invalid files from being cached should no longer be necessary
Full Changelog: https://github.com/naptha/tesseract.js/compare/v4.0.5...v4.0.6
Full Changelog: https://github.com/naptha/tesseract.js/compare/v4.0.4...v4.0.5
corePath
is manually specified (#735)
corePath
: for significantly faster performance, set corePath
to a directory that includes both tesseract-core.wasm.js
and tesseract-core-simd.wasm.js
rotateAuto: true
) (#747)Full Changelog: https://github.com/naptha/tesseract.js/compare/v4.0.3...v4.0.4
addJob
by @nathanbabcock in https://github.com/naptha/tesseract.js/pull/719
Full Changelog: https://github.com/naptha/tesseract.js/compare/v4.0.2...v4.0.3
Full Changelog: https://github.com/naptha/tesseract.js/compare/v4.0.1...v4.0.2
recognize
or detect
with invalid image
argument now throws error message (#699)langdata
paths (#697)Full Changelog: https://github.com/naptha/tesseract.js/compare/v4.0.0...v4.0.1
createWorker
is now async
worker = Tesseract.createWorker()
should be replaced with worker = await Tesseract.createWorker()
workerPath
or corePath
now produces error/rejected promise (#654)worker.load
is no longer needed (createWorker
now returns worker pre-loaded)getPDF
function replaced by pdf
recognize option (#488)
imageColor
, imageGrey
, and imageBinary
options (#588)
rotateAuto
and rotateRadians
have been added, which significantly improve accuracy on certain documents
rotateAuto
optionworker.setParameters
) can now be set for single jobs using worker.recognize
options (#665)
worker.recognize(image, {tessedit_char_whitelist: "0123456789"})
load_system_dawg
, load_number_dawg
, and load_punc_dawg
) can now be set (#613)
worker.initialize
now accepts either (1) an object with key/value pairs or (2) a string containing contents to write to a config fileload_number_dawg
to 0:
worker.initialize('eng', "0", {load_number_dawg: "0"});
worker.initialize('eng', "0", "load_number_dawg 0");
loadLanguage
now resolves without error when language is loaded but writing to cache fails
detect
returns null
values when OS detection fails rather than throwing error (#526)Full Changelog: https://github.com/naptha/tesseract.js/compare/v3.0.3...v4.0.0
initialize
step (#602)Full Changelog: https://github.com/naptha/tesseract.js/compare/v3.0.2...v3.0.3
Full Changelog: https://github.com/naptha/tesseract.js/compare/v2.1.5...v3.0.2