Spacymoji Versions Save

💙 Emoji handling and meta data for spaCy with custom extension attributes

v3.1.0

11 months ago
  • Switch to emoji v2.x.
  • Update package setup.

v3.0.0

3 years ago
  • Support spaCy v3.0.

Thanks to @buhrmann for the pull request!

v2.0.0

5 years ago
  • Add option to force overwriting.
  • Update for spaCy v2.1 and use Doc.retokenize API for merging.

v1.0.0

6 years ago

Update spacymoji to work with spacy>=2.0.0 instead of spacy-nightly.

v0.0.1

6 years ago

spaCy v2.0 extension and pipeline component for adding emoji meta data to Doc objects. Detects emoji consisting of one or more unicode characters, and can optionally merge multi-char emoji (combined pictures, emoji with skin tone modifiers) into one token. Human-readable emoji descriptions are added as a custom attribute, and an optional lookup table can be provided for your own descriptions.

Disclaimer: This extension only works in spaCy v2.0 (currently in alpha) and is still experimental.