JapaneseTokenizers Versions Save

aim to use JapaneseTokenizer as easy as possible

1.6

5 years ago

For mecab wrapper class, you could call

  • jumandic. It's alternative dictionary for mecab tokenizer. jumandic has rich morphological information.
  • unidic. It's continuously maintained by NINJA. See more information here, JP only.

And, in this version, some arguments in mecab wrapper class is deleted because it's not consistent anymore.

1.5

5 years ago

1.4

5 years ago
  • could not install some packages in setup.py because pip.main function is removed. Now setup.py calls subprocess.check_call function instead.
  • pyknp package is updated and Jumanpp module is removed in the latest version. Now, JapaneseTokenizer package calls the latest pyknp

1.3.7

6 years ago

The following issue is fixed, which is specific to MacOS, I guess. https://github.com/Kensuke-Mitsuzawa/JapaneseTokenizers/issues/47

1.3.3

6 years ago

1.3.2

6 years ago

There is a case which mecab + neologd returns a 10th-additional field. That causes value error inside a process of mecab-wrapper module. Detailed info. in https://github.com/Kensuke-Mitsuzawa/JapaneseTokenizers/pull/28

1.3.1

6 years ago
  • Jumanpp interface works faster than the previous version.
    • This is only for UNIX distributions. Windows OS is an exception.