Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOT...
Four word embedding models implemented in Python. Supporting arbitrary c...
A Lite Bert For Self-Supervised Learning Language Representations
Touch typing trainer using N-grams as data source, with options to custo...
datagrand 2019 information extraction competition rank9
Colibri core is an NLP tool as well as a C++ and Python library for work...
Cluster and merge similar string values: an R implementation of Open Ref...
A fuzzy matching string distance library for Scala and Java that include...
Get n-grams from text
Fast n-Gram Tokenization
Top-k Approximate String Matching.
Mirror of SRILM
natural language processing
multiprocess unsupervised chinese_detect_words ngram_combination