Pure Javascript OCR for more than 100 Languages 📖🎉🖥
worker.recognize
while a previous call to worker.recognize
is still running (#875)
rotateAuto
option unnecessarily inflating runtime (#892)Full Changelog: https://github.com/naptha/tesseract.js/compare/v5.0.4...v5.0.5
config
option of createWorker
(#862)
load_number_dawg
is an "init only" parameter that cannot be set using either worker.setParameters
or the options
argument of worker.recognize
.load_number_dawg
can be set by the following createWorker
statement.
createWorker('eng', "0", {}, {load_number_dawg: "0"});
Full Changelog: https://github.com/naptha/tesseract.js/compare/v5.0.3...v5.0.4
Full Changelog: https://github.com/naptha/tesseract.js/compare/v5.0.2...v5.0.3
Version 5.0.1
is nearly identical to 5.0.2
and was the latest version for under a day, so does not have its own release notes.
Full Changelog: https://github.com/naptha/tesseract.js/compare/v5.0.0...v5.0.2
createWorker
arguments changed
createWorker
createWorker("chi_sim", 1)
worker.initialize
and worker.loadLanguage
functions now do nothing and can be deleted from code
createWorker
worker.reinitialize
In other words, code should be modified from this:
const worker = await Tesseract.createWorker();
await worker.loadLanguage('eng');
await worker.initialize('eng');
const ret = await worker.recognize(file);
To this:
const worker = await Tesseract.createWorker("eng");
const ret = await worker.recognize(file);
corePath
will need to update the contents of their corePath
directory
corePath
should point to a directory that contains all 4 of the files below from Tesseract.js-core v5:
tesseract-core.wasm.js
tesseract-core-simd.wasm.js
tesseract-core-lstm.wasm.js
tesseract-core-simd-lstm.wasm.js
worker.detect
function disabled by default
legacyCore: true
and legacyLang: true
in createWorker
options
Tesseract.createWorker("eng", 1, {legacyCore: true, legacyLang: true});
jsdelivr
by default (rather than GitHub pages)
tesseract.dev.js
and worker.dev.js
removedTesseract.recognize
and Tesseract.detect
worker.recognize
and worker.detect
insteadConsidering upgrading from v2 to v5? See #771 for a full guide for updating.
Full Changelog: https://github.com/naptha/tesseract.js/compare/v4.1.3...v5.0.0
Full Changelog: https://github.com/naptha/tesseract.js/compare/v4.1.3...v4.1.4
Full Changelog: https://github.com/naptha/tesseract.js/compare/v4.1.2...v4.1.3
FS
+ writeFile
function (#812)debug: true
was forcing recognition to be run (#788)setParameters
is used to set options that can only be set during initialize
(#816)Full Changelog: https://github.com/naptha/tesseract.js/compare/v4.1.1...v4.1.2
Full Changelog: https://github.com/naptha/tesseract.js/compare/v4.1.0...v4.1.1
OffscreenCanvas
in browser version by @nathanbabcock (#766)recognize
was running OCR even when not necessary (#769)langPath
URLs caused errors in browser version (#558)file-type
and resolve-url
dependencies (#773, #711)Full Changelog: https://github.com/naptha/tesseract.js/compare/v4.0.6...v4.1.0