Unsupervised Word Segmentation for Neural Machine Translation and Text G...
Unsupervised text tokenizer focused on computational efficiency
JavaScript BPE Tokenizer Encoder Decoder for OpenAI's GPT-2 / GPT-3 / GP...
Fast and customizable text tokenization library with BPE and SentencePie...
Explains nlp building blocks in a simple manner.
Byte Pair Encoding for Python!
nfelib - bindings Python para e ler e gerir XML de NF-e, NFS-e nacional,...
Go BPE tokenizer (Encoder+Decoder) for GPT2 and GPT3
Subword Encoding in Lattice LSTM for Chinese Word Segmentation
Machine Learning for Phishing Website Detection
GPT3 encoder & decoder tool written in Swift