Python library for Natural Language Preprocessing (NLPre)
pattern.en
has been replaced with spaCy
v 2.1.0. This is a major fix for some of the problems with pattern.en
including poor lemmatization. (eg. cytokine -> cytocow)replace_from_dictionary
replace_from_dictionary
token_replacement
can remove symbolsMajor speed increase for replace_from_dict
Bug fixes and other improvements. Fixes #95 and adds url_replacement
module.
Various bug fixes in parenthetical and acronym replacement