Colibri Core Versions Save

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.

v2.5.9

10 months ago

[Ko van der Sloot]

  • Major code cleanup: range-based for loops, override statement, convert most stream pointers to references, more const parameters

[Maarten van Gompel]

  • cleanup: remove last traces of python2 support, refactored exceptions
  • Set up wheel building

This release does not provide a shared library; use static linking instead.

v2.5.8

10 months ago
  • Python setup.py no longer attempts to build colibri core C++ library, must be done in a manual step before.
  • Set up continuous integration and wheel building
  • Python test fix

v2.5.7

10 months ago

[Maarten van Gompel]

  • Fixed long option parsing
  • Fixed column length mismatch in TSV header/data output
  • Fixed a build problem
  • Updated installation instructions

[Ko van der Sloot]

  • Significant cleanup of the code
  • Updated for newer autoconfig versions

v2.5.6

1 year ago

[Maarten van Gompel]

  • codemeta.json: updating according to (proposed) CLARIAH requirements (CLARIAH/clariah-plus#38)
  • Dockerfile: added

[Ko van der Sloot]

  • Code cleanup
    • added some exceptions for unwanted cases detected by scan-build
    • out-dommented DOFLEXFROMCOOC and cached_DOFLEXFROMCOOC variables, they seem useless
    • removed unused assignments

v2.5.5

4 years ago

Thanks to @kosloot, various warnings on clang were fixed in this minor release.

v2.5.4

4 years ago

Implemented the ability to prune subsumed n-grams (retaining only the longer non-subsumed versions). Introduces a new PRUNESUBSUMED variable for PatternModelOptions. Note: This is an aggressive form of pruning that should also work for unordered models, matching is based on types rather than individual tokens (all subsumed types are pruned).

v2.5.3

4 years ago

Bugfix release: Certain options from PatternModelOptions were not available to the python binding yet.

v2.5.2

4 years ago

Bugfix release: Pattern size and category constraints were not working for several methods (getcooc/getleftcooc/getrightcooc/getleftneighbours/getrightneighbours) #44

v2.5.1

4 years ago

Very minor update release:

  • Updated codemeta metadata
  • Added ClassEncoder.find()

v2.5.0

5 years ago

Better handling of large patterns, PatternPointer size descriptor is now 64 bits (fixes #42) at cost of a small increase in memory consumption in various computations.

(The experimental and relatively unused PatternPointerModels are not backwards compatible, contact me if this is a problem)