Yomguithereal Talisman Versions Save

Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.

0.18.0

7 years ago
  • Adding the metrics/distance/guth namespace.
  • Fixing a bug related to Levenshtein distance prefix trimming.
  • Fixing a bug related to clustering/k-means.

0.17.0

7 years ago
  • Fixing metrics/distance/jaro-winkler.
  • Improving metrics/distance/overlap performance.
  • Dropping the structures namespace in favor of mnemonist.

0.16.0

7 years ago
  • Changing the way the fingerprint API.
  • Providing index of item in some clustering/record-linkage callbacks.
  • Adding merge option to clustering/record-linkage/key-collision.
  • Adding the keyers/fingerprint namespace back.
  • Moving phonetics/omission & phonetics/skeleton back to the keyers namespace.
  • Improving metrics/distance/levenshtein performance.

0.15.0

7 years ago
  • Adding the hash/crc32 namespace.
  • Adding the hash/minhash namespace.
  • Adding the helpers/random#createRandomIndex function.
  • Adding the helpers/random#createChoice function.
  • Adding the helpers/random#createDangerousButPerformantSample function.
  • Adding the helpers/random#createSuffleInPlace function.
  • Adding the clustering/record-linkage/blocking namespace.
  • Adding the clustering/record-linkage/canopy namespace.
  • Adding the distance/metrics/bag namespace.
  • Adding the distance/metrics/lcs namespace.
  • Adding the distance/metrics/length namespace.
  • Adding the distance/metrics/minhash namespace.
  • Adding the distance/metrics/mlipns namespace.
  • Adding the distance/metrics/prefix namespace.
  • Adding the distance/metrics/ratcliff-obershelp namespace.
  • Adding the distance/metrics/sift4 namespace.
  • Adding the distance/metrics/smith-waterman namespace.
  • Adding the distance/metrics/suffix namespace.
  • Adding the phonetics/french/fonem namespace.
  • Adding the regexp namespace.
  • Adding the stemmers/french/carry namespace.
  • Adding the stemmers/french/eda namespace.
  • Adding the tokenizers/fingerprint namespace.
  • Adding the asymmetric option to clustering/naive.
  • Adding the minClusterSize option to clusterers.
  • Adding limited version of metrics/distance/damerau-levenshtein.
  • Adding limited version of metrics/distance/levenshtein.
  • Adding bitwise version of metrics/distance/hamming.
  • Adding normalized version of metrics/distance/hamming.
  • Moving stats/ngrams to tokenizers/ngrams.
  • Moving keyers/omission to phonetics/omission.
  • Moving keyers/skeleton to phonetics/skeleton.
  • Moving similarity clusterers to the clustering/record-linkage namespace.
  • Dropping the stats/tfidf namespace.
  • Dropping the keyers namespace.

0.14.0

7 years ago
  • Dropping the phonetics/spanish/fonetico namespace (should use phonogram now).
  • Improving VPTree performance by building the tree iteratively.
  • Found a way to ease CommonJS by getting rid of pesky .default.

0.13.0

7 years ago
  • Adding the clustering/key-collision namespace.
  • Adding the clustering/naive namespace.
  • Adding the clustering/sorted-neighborhood namespace.
  • Adding the clustering/vp-tree namespace.
  • Adding the metrics/distance/identity namespace.
  • Adding the metrics/distance/monge-elkan namespace.
  • Reversing structures/bk-tree#search arguments.

0.12.0

7 years ago
  • Adding the helpers/random namespace.
  • Adding the inflectors/spanish/noun namespace.
  • Adding the keyers/fingerprint namespace.
  • Adding the keyers/ngram-fingerprint namespace.
  • Adding the keyers/omission namespace.
  • Adding the keyers/skeleton namespace.
  • Adding the tag/averaged-perceptron namespace.
  • Adding the parsers/brown namespace.
  • Adding the parsers/conll namespace.
  • Adding the phonetics/onca namespace.
  • Adding the stemmers/spanish/unine namespace.
  • Adding the structures/bk-tree namespace.
  • Adding the structures/symspell namespace.
  • Adding the structures/vp-tree namespace.
  • Adding the sampler options to clustering/k-means.
  • Adding the stats/descriptive#.quantile function.
  • Adding the stats/descriptive#.median function.
  • Fixing a bug with clustering/k-means where k would be superior to the number of vectors.
  • Fixing a bug with clustering/k-means initialCentroids options.
  • Fixing a bug with clustering/k-means where a vector could end up in several clusters.
  • Dropping the internal regex/classes namespace.
  • Dropping the hasher option of the ngrams functions.
  • Dropping the set-functions dependency.

0.11.0

7 years ago
  • Improving clustering/k-means API.
  • Improving tokenizers/syllable/sonoripy hierarchy definition.

0.10.0

7 years ago
  • Adding the metrics/distance/eudex namespace.
  • Adding the phonetics/eudex namespace.
  • Adding the tokenizers/syllables/sonoripy namespace.
  • Adding some import shortcuts for naives tokenizers.
  • Improving the tokenizers/syllables/legalipy API.
  • Improving the tokenizers/sentences/naive API.
  • Fixing tokenizers/syllables/legalipy to correctly handle capitalized words.

0.9.0

7 years ago
  • Adding the phonetics/alpha-sis namespace.
  • Adding the phonetics/fuzzy-soundex namespace.
  • Adding the phonetics/phonex namespace.
  • Adding the stemmers/uea-lite namespace.
  • Adding the stats/inferential#sampleCovariance function.
  • Adding the stats/inferential#sampleCorrelation function.
  • Moving the metrics namespace to metrics/distance.