:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
A curated list of resources for Chinese NLP 中文自然语言处理相关资料
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
Language Technology Platform
Firefly: 大模型训练工具,支持训练Gemma、MiniCPM、Yi、Deepseek、Orion、Xv...
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导...
百度NLP:分词,词性标注,命名实体识别,词重要性
fastNLP: A Modularized and Extensible NLP Framework. Currently still in ...
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文Op...
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对...
33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with sing...
Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Ext...
An Efficient Lexical Analyzer for Chinese