Pythainlp Versions Save

Thai Natural Language Processing in Python.

v4.1.0-beta4

9 months ago

Docs: https://pythainlp.github.io/dev-docs/ Report bug: https://github.com/PyThaiNLP/pythainlp/issues

Install: pip install --pre pythanlp

See 4.1 Milestone.

What's Changed

New Contributors

Full Changelog: https://github.com/PyThaiNLP/pythainlp/compare/v4.1.0-beta3...v4.1.0-beta4

v4.1.0-beta3

10 months ago

v4.1.0-beta2

10 months ago

What is change?

  • Fixed bug #828. Thank you @tonezzz for reporting!

Full Changelog: https://github.com/PyThaiNLP/pythainlp/compare/v4.1.0-beta1...v4.1.0-beta2

v4.1.0-beta1

10 months ago

Schedule

  • First Beta release: 24 July 2023

Docs: https://pythainlp.github.io/dev-docs/ Report bug: https://github.com/PyThaiNLP/pythainlp/issues

Install: pip install --pre pythanlp

See 4.1 Milestone.

What is new?

Deprecation and other API changes

New API

Tokenizer

  • Add wtpsplit engine to sentence_tokenize #804
  • New paragraph_tokenize funtion to split Thai text to a paragraph. #804
  • add paragraph_threshold into paragraph_tokenize function by @pavaris-pm in https://github.com/PyThaiNLP/pythainlp/pull/806

Translate

Corpus

Util

New Contributors

Full Changelog: https://github.com/PyThaiNLP/pythainlp/compare/v4.0.0...v4.1.0-beta1

v4.0.2

1 year ago

PyThaiNLP v4.0.2 is a bug fix release of PyThaiNLP v4.0.

Upgrade: pip install -U pythainlp

Documentation: https://pythainlp.github.io/docs/4.0

Report bug: https://github.com/PyThaiNLP/pythainlp/issues

See PyThaiNLP 4.0 Change Log

What's Changed

Full Changelog: https://github.com/PyThaiNLP/pythainlp/compare/v4.0.1...v4.0.2

Contributors

Thanks all the contributors. (Image made with contributors-img)

If you want to contributing to PyThaiNLP, you can read Contributing to PyThaiNLP.

v4.0.1

1 year ago

PyThaiNLP v4.0.1 is a bug fix release of PyThaiNLP v4.0.

Upgrade: pip install -U pythainlp

Documentation: https://pythainlp.github.io/docs/4.0

Report bug: https://github.com/PyThaiNLP/pythainlp/issues

See PyThaiNLP 4.0 Change Log

What's Changed

Full Changelog: https://github.com/PyThaiNLP/pythainlp/compare/v4.0.0...v4.0.1

Contributors

Thanks all the contributors. (Image made with contributors-img)

If you want to contributing to PyThaiNLP, you can read Contributing to PyThaiNLP.

v4.0.0

1 year ago

PyThaiNLP published the first version is 0.0.4 to PyPI at 6 years ago, so PyThaiNLP 4.0 will have special codename. The codename for PyThaiNLP 4.0 is PyThaiNLP 4.0 (Real).

See 4.0 Milestone.

Documentation: https://pythainlp.github.io/docs/4.0

Report bug: https://github.com/PyThaiNLP/pythainlp/issues

See PyThaiNLP 4.0 Change Log

If you want to contribute to PyThaiNLP, you can read Contributing to PyThaiNLP.

What is new?

Deprecation and other API changes

Improve

  • Reduce import time #719
  • Fix/broken numeric data format (#652) #723

Tokenizer

  • Add blackboard cls #732
  • Add <Karan> rule to TCC and Change TCC rule for newmm #741

Tag

  • Add blackboard pos_tag #733
  • Add ThaiNER 2.0 #781

Util

  • Add pythainlp.util.count_thai_chars #748
  • Add thai_strptime and convert_years #767

Transliterate

  • Add Thai2Rom ONNX model #743

Khavee

  • add khavee to pythainlp #777
  • add aek/too checker function to khavee #779

Parse

  • Add ud_goeswith #757

Corpus

  • Add new science word #763

Full Changelog

Full Changelog: https://github.com/PyThaiNLP/pythainlp/compare/v3.1.0...v4.0.0

Contributors

Thanks all the contributors. (Image made with contributors-img)

If you want to contributing to PyThaiNLP, you can read Contributing to PyThaiNLP.

New Contributors

v4.0.0-beta1

1 year ago

This post will give you the change log for PyThaiNLP 4.0. PyThaiNLP published the first version is 0.0.4 to PyPI at 6 years ago, so PyThaiNLP 4.0 will have special codename. The codename for PyThaiNLP 4.0 is PyThaiNLP 4.0 (Real).

This release is the first beta release of PyThaiNLP 4.0.

Schedule

  • Beta release: 1 April 2023
  • Production release: 14 April 2023

See 4.0 Milestone.

What is new?

Deprecation and other API changes

Improve

  • Reduce import time #719
  • Fix/broken numeric data format (#652) #723

Tokenizer

  • Add blackboard cls #732
  • Add <Karan> rule to TCC and Change TCC rule for newmm #741

Tag

  • Add blackboard pos_tag #733
  • Add ThaiNER 2.0 #781

Util

  • Add pythainlp.util.count_thai_chars #748
  • Add thai_strptime and convert_years #767

Transliterate

  • Add Thai2Rom ONNX model #743

Khavee

  • add khavee to pythainlp #777
  • add aek/too checker function to khavee #779

Parse

  • Add ud_goeswith #757

Corpus

  • Add new science word #763

What's Changed

New Contributors

Full Changelog: https://github.com/PyThaiNLP/pythainlp/compare/v3.1.0...v4.0.0-beta1

v3.1.1

1 year ago

PyThaiNLP v3.1.1 is the releases updates of PyThaiNLP v3.1.0.

What's Changed

  • pythainlp.tools.misspell changed to pythainlp.tools.misspell.misspell.
  • Add Reduce import time #719 to PyThaiNLP 3.1.1 #753
  • Doc: Lst20 deprecation warning for 3.1.1 (#749) #752 (Thank you @noppayut)

Full Changelog: https://github.com/PyThaiNLP/pythainlp/compare/v3.1.0...v3.1.1

You can install or upgrade by pip install pythainlp==3.1.1.

Documentation: https://pythainlp.github.io/docs/3.1

Report bug: https://github.com/PyThaiNLP/pythainlp/issues

See PyThaiNLP 3.1 change log

See 3.1 Milestone.

Contributors

Thanks all the contributors. (Image made with contributors-img)

v3.1.0

1 year ago

This is the release version for PyThaiNLP v3.1.0

You can install by pip install pythainlp==3.1.0.

Documentation: https://pythainlp.github.io/docs/3.1

Report bug: https://github.com/PyThaiNLP/pythainlp/issues

See PyThaiNLP 3.1 change log

See 3.1 Milestone.

What is new?

Deprecation and other API changes

#687 Remove deprecated function

  • pythainlp.word_vector; doesnt_match, get_model, most_similar_cosmul, sentence_vectorizer, similarity. use WordVector class instead
  • pythainlp.util.delete_tone. use pythainlp.util.remove_tonemark instead
  • Remove pythainlp.util.time_time. use pythainlp.util.time_to_thaiword instead
  • pythainlp.tokenize.syllable_tokenize. use pythainlp.tokenize.subword_tokenize instead

Dependency Parsing

Name Entity Tagging

  • #665 Add Thai-NNER pythainlp.tag.NNER
  • #658 Add LST20NER onnx model. It is LST20NER model to onnx model from fine-turning by WangchanBERTa model.

Transliteration

  • #659 Add ISO 11940 transliteration
  • #660 Add Thai W2P v0.2
  • #686 Add wunsen
  • #694 Wunsen Mandarin and Japanese update

PyThaiNLP Corpus downloader

  • #656 Add support zip/tar.gz to download corpus

Text normalization

  • #673 Add a normalising rule for Lakkhangyao ๅ

Translate

  • #674 add gpu option

Text summarize

  • #679 Add mt5 cpe kmutt thai sentence sum

Util

  • #682 Add live-dead syllable classification
  • #684 Add live dead syllable classify
  • #690 Add tone detector

Soundex

  • #699 Add Thai-English Cross-Language Transliterated Word Retrieval using Soundex Technique

Other

  • #689 map NG tag to PART
  • #691 Remove TinyDB as a dependency
  • #692 Fix notifications that newer versions of corpora are available
  • Add warning about LST20 license

Contributors

New Contributors

Full Changelog: https://github.com/PyThaiNLP/pythainlp/compare/v3.0.10...v3.1.0

All Contributors

Thanks all the contributors. (Image made with contributors-img)

We build Thai NLP.

PyThaiNLP