A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA...
A bidirectional recurrent neural network model with attention mechanism ...
A sentence segmenter that actually works!
A TensorFlow implementation of Neural Sequence Labeling model, which is ...
Text normalization library for Python
Punctuation restoration and spell correction experiments.
Pre-process arabic text (remove diacritics, punctuations and repeating c...
A PyTorch implementation of a punctuation prediction system using (B)LST...
Unicode tokeniser. Ucto tokenizes text files: it separates words from pu...
Apache OpenNLP wrapper for Nodejs