High performance Chinese tokenizer with both GBK and UTF-8 charset suppo...
基于深度学习的自然语言处理库
一个微型&算法全面的中文分词引擎 | A micro tokenizer for Chinese