Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
metrics/distance/guth
namespace.clustering/k-means
.metrics/distance/jaro-winkler
.metrics/distance/overlap
performance.structures
namespace in favor of mnemonist.clustering/record-linkage
callbacks.merge
option to clustering/record-linkage/key-collision
.keyers/fingerprint
namespace back.phonetics/omission
& phonetics/skeleton
back to the keyers
namespace.metrics/distance/levenshtein
performance.hash/crc32
namespace.hash/minhash
namespace.helpers/random#createRandomIndex
function.helpers/random#createChoice
function.helpers/random#createDangerousButPerformantSample
function.helpers/random#createSuffleInPlace
function.clustering/record-linkage/blocking
namespace.clustering/record-linkage/canopy
namespace.distance/metrics/bag
namespace.distance/metrics/lcs
namespace.distance/metrics/length
namespace.distance/metrics/minhash
namespace.distance/metrics/mlipns
namespace.distance/metrics/prefix
namespace.distance/metrics/ratcliff-obershelp
namespace.distance/metrics/sift4
namespace.distance/metrics/smith-waterman
namespace.distance/metrics/suffix
namespace.phonetics/french/fonem
namespace.regexp
namespace.stemmers/french/carry
namespace.stemmers/french/eda
namespace.tokenizers/fingerprint
namespace.asymmetric
option to clustering/naive
.minClusterSize
option to clusterers.metrics/distance/damerau-levenshtein
.metrics/distance/levenshtein
.metrics/distance/hamming
.metrics/distance/hamming
.stats/ngrams
to tokenizers/ngrams
.keyers/omission
to phonetics/omission
.keyers/skeleton
to phonetics/skeleton
.clustering/record-linkage
namespace.stats/tfidf
namespace.keyers
namespace.phonetics/spanish/fonetico
namespace (should use phonogram now).VPTree
performance by building the tree iteratively..default
.clustering/key-collision
namespace.clustering/naive
namespace.clustering/sorted-neighborhood
namespace.clustering/vp-tree
namespace.metrics/distance/identity
namespace.metrics/distance/monge-elkan
namespace.structures/bk-tree#search
arguments.helpers/random
namespace.inflectors/spanish/noun
namespace.keyers/fingerprint
namespace.keyers/ngram-fingerprint
namespace.keyers/omission
namespace.keyers/skeleton
namespace.tag/averaged-perceptron
namespace.parsers/brown
namespace.parsers/conll
namespace.phonetics/onca
namespace.stemmers/spanish/unine
namespace.structures/bk-tree
namespace.structures/symspell
namespace.structures/vp-tree
namespace.sampler
options to clustering/k-means
.stats/descriptive#.quantile
function.stats/descriptive#.median
function.clustering/k-means
where k would be superior to the number of vectors.clustering/k-means
initialCentroids
options.clustering/k-means
where a vector could end up in several clusters.regex/classes
namespace.hasher
option of the ngrams
functions.set-functions
dependency.clustering/k-means
API.tokenizers/syllable/sonoripy
hierarchy definition.metrics/distance/eudex
namespace.phonetics/eudex
namespace.tokenizers/syllables/sonoripy
namespace.tokenizers/syllables/legalipy
API.tokenizers/sentences/naive
API.tokenizers/syllables/legalipy
to correctly handle capitalized words.phonetics/alpha-sis
namespace.phonetics/fuzzy-soundex
namespace.phonetics/phonex
namespace.stemmers/uea-lite
namespace.stats/inferential#sampleCovariance
function.stats/inferential#sampleCorrelation
function.metrics
namespace to metrics/distance
.