Kmer-db is a fast and memory-efficient tool for large-scale k-mer analyses (indexing, querying, estimating evolutionary relationships, etc.).
Changes from the previous release:
-multisample-fasta
mode,distance
mode,all2all
, new2all
, and distance
modes (-sparse
switch) with optional filtering (-above
/-below
),Changes from the last release:
-sparse
switch).new2all
mode (few-fold speed improvement).-multisample-fasta
mode.-multisample-fasta
mode.-extend
switch which allows extending existing kmer database.-multisample-fasta
mode which caused incorrect kmers counting.Added:
-mash-query
which is a mash distance calculated w.r.t. a query length (use if the query is much shorter than database sequences).Fixed:
Note: Starting from this release version numbering conforms to major.minor.patch scheme.
Added:
-phylip-out
in distance
mode which allows storing distance/similarity matrices in Phylip format.Fixed several bugs from 1.51 release:
minhash
mode).build
mode caused by the changes of the data structures. E.g., when tested on full k-mer spectrum of 40715 pathogen genomes, time and memory footprint decreased by 1/3 (1h30 to 1h, 60 to 40GB).Changes:
new2all
mode added,all2all
, new2all
, and one2all
modes,Bugs fixed:
Support of no-AVX2 build.