Pytextrank Versions Save

Python implementation of TextRank algorithms ("textgraphs") for phrase extraction

v3.0.0

3 years ago

2021-02-14

  • THIS WILL BREAK THINGS!!!
  • support for spaCy 3.0.x; kudos @Lord-V15
  • full integration of PositionRank
  • migrated all unit tests to pytest
  • removed use of logger for debugging, introducing icecream instead

v2.1.0

3 years ago

2021-01-31

  • add PositionRank by @louisguitton (many thanks!)
  • fixes chunk in explain_summ.ipynb by @anna-droid-beep
  • add option preserve_order in TextRank.summary by @kavorite
  • tested with spaCy 2.3.5

v2.0.3

3 years ago

2020-09-15

  • try-catch ZeroDivisionError in summary method -- kudos @shyamcody
  • tested with updated dependencies: spaCy 2.3.x and NetworkX 2.5

v2.0.2

3 years ago

2020-05-20

  • fixed default value of ._.phrases to allow for disabling PTR in a pipeline

v2.0.1

4 years ago

2020-03-02

  • fix KeyError issue for pre Python 3.6
  • integrated codecov.io
  • added PyTextRank to the spaCy uniVerse
  • fixed README.md instructions to download en_core_web_sm

v2.0.0

4 years ago
  • refactored library to run as a spaCy extension
  • supports multiple languages
  • significantly faster, with less memory required
  • better extraction of top-ranked phrases
  • changed license to MIT
  • uses lemma-based stopwords for more precise control
  • WIP toward integration with knowledge graph use cases

v1.2.1

4 years ago
  • fixed error in installation instructions

v1.2.0

4 years ago
  • updated for current versions of spaCy and networkX -- kudos @dimmu
  • removed deprecated argument -- kudos @laxatives

v1.1.1

6 years ago

Patch disables use of NER in spaCy until an intermittent bug is resolved. Will probably replace named tuples with spaCy spans instead.

v1.1.0

7 years ago

Replaced TextBlob usage with spaCy for improved parsing results. Updated the other Python dependencies. Also added better handling for UTF-8.