Lingua Go Versions Save

The most accurate natural language detection library for Go, suitable for long and short text alike

v1.3.4

11 months ago

Bug Fixes

  • When trying to detect multiple languages in a text consisting of only a single word, a panic occurred. This has been fixed. (#41)

v1.3.3

1 year ago

Bug Fixes

  • For long input texts, a panic occurred while computing the confidence values due to an accidental division by zero. This has been fixed. (#27)

v1.3.2

1 year ago

Improvements

  • After applying some internal optimizations, language detection is now faster, at least between 20% and 30%, approximately. For long input texts, the speed improvement is greater than for short input texts.

v1.3.1

1 year ago

Bug Fixes

  • For long input texts, an error occurred while computing the confidence values due to numerical underflow when converting probabilities. This has been fixed.

v1.3.0

1 year ago

Improvements

  • The min-max normalization method for the confidence values has been replaced with applying the softmax function. This gives more realistic probabilities. (#25)

v1.2.2

1 year ago

Bug Fixes

  • Under certain circumstances, calling the method LanguageDetector.DetectMultipleLanguagesOf() caused an index error. This has been fixed.

v1.2.1

1 year ago

Bug Fixes

  • A misconfiguration in a go.mod file caused errors when trying to download the library via the go get command. This has been fixed. Thanks to @BenStigsen for the pointer. (#23)

v1.2.0

1 year ago

Features

  • The new method LanguageDetector.DetectMultipleLanguagesOf() has been introduced. It allows to detect multiple languages in mixed-language text. (#9)

v1.1.1

1 year ago

Documentation

  • Some documentation mistakes have been fixed and missing information has been added.

v1.1.0

1 year ago

Features

  • The new method LanguageDetectorBuilder.WithLowAccuracyMode() has been introduced. By activating it, detection accuracy for short text is reduced in favor of a smaller memory footprint and faster detection performance. (#17)

  • The new method LanguageDetector.ComputeLanguageConfidence() has been introduced. It allows to retrieve the confidence value for one specific language only, given the input text. (#19)

Improvements

  • The computation of the confidence values has been revised and the min-max normalization algorithm is now applied to the values, making them better comparable by behaving more like real probabilities. (#16)

  • The language models are now serialized as protocol buffers instead of json. Thanks to this change, they are now loaded into memory twice as fast as before. (#22)

Bug Fixes

  • The unigram counts in the statistics engine were not retrieved correctly. This has been fixed, producing more correct detection results. (#14)

Compatibility

  • The lowest supported Go version is 1.18 now. Older versions are no longer compatible with this library.

Miscellaneous

  • The library now has a fresh and colorful new logo. Why? Well, why not? (-: