PopPUNK Versions Save

PopPUNK 👨‍🎤 (POPulation Partitioning Using Nucleotide Kmers)

v2.6.1

7 months ago

Many small bug fixes

What's Changed

New Contributors

Full Changelog: https://github.com/bacpop/PopPUNK/compare/v2.6.0...v2.6.1

v2.6.0

1 year ago

Main changes:

  • Lineage fits now use reciprocal best match with --reciprocal-only, --count-unique-distances and --max-search-depth, which gives better results.
  • Fixes for threshold model assignment

What's Changed

New Contributors

Full Changelog: https://github.com/bacpop/PopPUNK/compare/v2.5.0...v2.6.0

v2.5.0

1 year ago

Minimum sketchlib version for this release is v2.0.0

New features:

  • Dendropy replaced with faster & more reliable alternatives #203
  • A new logo #202
  • Improve iterative PopPUNK code
  • Documentation update and improvements #191
  • Deal better with name clash when querying #190
  • Make manual start a bit easier to use #174
  • Replace t-SNE with mandrake
  • Output .microreact files, and allow direct creation of Microreact instances with an API key
  • Various QC additions to help with multi-cluster merges #194

Bug fixes:

  • Various fixes to cytoscape visualisation #185 #196 #210
  • Hide progress bars when using --plot-fit
  • Stop always checking query-query dists when clustering (and potential bug adding them to network twice)
  • Fix N QC when working with reads #207

What's Changed

New Contributors

Full Changelog: https://github.com/bacpop/PopPUNK/compare/v2.4.0...v2.5.0

v2.4.0

3 years ago

Minimum sketchlib version for this release is v1.7.0

To use --gpu-graph requires cudf and cugraph to be installed from the nvidia conda channel, which is not part of the standard installation)

New features:

  • Adds minimum spanning tree computation and visualisation #141 #148
  • Add two new network scores based on betweenness #146
  • Move boundary code into a C++ extension in this package #146 #158
  • Adds GPU accelerated graphs #87 #148
  • Adds a docker container which is used for web.poppunk.net #151 #162
  • New github actions for testing and building the web API #151
  • Add progress bars in for model assignment #155
  • Parallelise model assignment #155
  • Adds the VLKC terminology, and 'unword' cluster names #161

Bug fixes:

  • Correctly specify thread count with rapidnj #139
  • Regenerate random match changes after --update-db #149
  • Fix issue with label order when using --update-db more than once #152
  • Update some scripts/ to work with newer versions of numpy and scikit-learn #160
  • Keep hyphens in sample names in trees #159
  • Fix a plot name #158
  • Pin some package versions #140 #142

v2.3.0

3 years ago

This is a major (API-breaking) update which moves the assign and visualisation functions into their own programs, to make the program more modular. The minimum version of pp-sketchlib required is 1.6.0.

New features:

  • Lineage assign mode uses matrix code in pp-sketchlib #108
  • New algorithm for clique pruning #110
  • Visualisation and query moved out of main, and into their own programs #112 #115 #129
  • Simpler CLI defaults #125
  • Updated documentation #122
  • Add edge weights to graph #123
  • Add API for use of poppunk_assign with a http server #124 #131
  • Add corrected/uncorrected distances when plotting k-mer fits #136

Bug fixes:

  • More stable generation of documentation #132
  • Fixes continue mode for QC function #134
  • Fixes long length QC fail #137

v2.2.0

3 years ago

The first bug fix will affect many results, and all users are encouraged to upgrade

New features:

  • More thorough sample QC using pp-sketchlib features (#101)
  • Update to pp-sketchlib v1.5.1 (#104)

Bug fixes:

  • Misordered labels with older versions of pp-sketchlib (#95)
  • TypeError with visualisations (#99)
  • networkx still used in reference prune program (#97)

v2.1.1

3 years ago

NB python >=3.8 is now required (#81, #76)

v2.0.2

4 years ago

This accommodates updates to pp-sketchlib introduced since v1.1.0, mostly passing through new command line parameters (e.g. --use-gpu). See https://github.com/johnlees/pp-sketchlib/releases/tag/v1.3.0 for more info.

v2.0.0

4 years ago

This is a major new release of PopPUNK, which uses a new 'backend' to do sketching and distance calculation, pp-sketchlib

This changes the input format and the API somewhat, will be incompatible with previous database versions, and generates slightly different distance results. If you need backwards compatibility the previous version can still be run by specifying --use-mash.

New features:

  • Use pp-sketchlib as the backend. This is ~2x faster for sketching and 50-100x faster for distance calculations. Databases are ~1/4 of the size.
  • Input data is now formatted as a tab separated file with name followed by any associated sequence. Sample names no longer have to be filenames. (Closes #43, #46)
  • Read data can now be handled, including a filter to remove k-mers containing sequencing errors
  • Faster database edits with prune_db and reference_pick
  • Ability to use the previous sketching method and databases by specifying --use-mash

Bug fixes:

  • Better error handling when creating visualisations, so output files are still produced if this fails

v1.2.2

4 years ago

This adds:

  • The ability to use a simple core threshold (#49)
  • Use external clusters in generate viz mode (#55)

Other:

  • Add scripts into package (#56)