Chatbot in 200 lines of code using TensorLayer
An R package for the Quantitative Analysis of Textual Data
高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型
Crawl BookCorpus
Curated List of Persian Natural Language Processing and Information Retr...
中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding E...
An Integrated Corpus Tool With Multilingual Support for the Study of Lan...
Korean corpus repository
微信公众号语料库
❤️Emotional First Aid Dataset, 心理咨询问答、聊天机器人语料库
Some useful Chinese corpus datasets 中文语料小数据
Fuzzing resources for feeding various fuzzers with input. 🔧
A Curated List of Dataset and Usable Library Resources for NLP in Bahasa...
A dataset of millions of news articles scraped from a curated list of da...
ChatGPT 中文语料库 对话语料 小说语料 客服语料 用于训练大模型