MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对...
:books:中文突发事件语料库(Chinese Emergency Corpus)-上海大学-语义智能...
A curated list of Open Information Extraction (OIE) resources: papers, c...
ChatGPT 中文语料库 对话语料 小说语料 客服语料 用于训练大模型
UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainia...
DANeS is an open-source E-newspaper dataset by collaboration between DAT...
Utilities for Processing the Switchboard Dialogue Act Corpus
Biomedical NLP Corpus or Datasets.
A Public Corpus for Machine Learning
:books:中文环境突发事件语料库(Chinese Environment Emergency Corpus)-上...
Reading the data from OPIEC - an Open Information Extraction corpus
Utilities for Processing the Meeting Recorder Dialogue Act Corpus
Korean ASR Corpus generated from TEDx talks