The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
LanguageDetectorBuilder.withLowAccuracyMode()
has been introduced. By activating it, detection accuracy for short text is reduced in favor of a smaller memory footprint and faster detection performance. (#136)LanguageDetector.destroy()
has been deprecated in favor of the newly introduced method LanguageDetector.unloadLanguageModels()
.LanguageDetector.destroy()
has been introduced that frees internal resources to prevent memory leaks within application server deployments. (#110, #116)LanguageDetector
, multiple instances now share the same language models and access them asynchronously. (#91)LanguageDetector.computeLanguageConfidenceValues
computed wrong values under certain circumstances. (#87)LanguageDetector
made use of functionality that was
introduced in Java 8 which made the library unusable for Java 6 and 7.
Thanks to @levant916 for reporting this bug. (#69)./gradlew jarWithDependencies
produces a jar file whose dependencies
do not conflict anymore with the same dependencies of different versions in the same project. (#67)LanguageDetector
can now provide confidence scores for each evaluated language. (#11)LanguageModelFilesWriter
) and test data files (TestDataFilesWriter
) has been stabilized. (#37)LanguageDetectorBuilder
in order to build LanguageDetector
from languages written in a certain script. (#61)ktlint
has been added to help with a consistent coding style. (#47)There is a breaking change in this release:
fromAllBuiltIn...
have been renamed to fromAll...
to make them more succinct and clear. (#61)Language.UNKNOWN
within the public api (#24).