Home
Projects
Resources
Alternatives
Blog
Sign In
Nlp Public Dataset
Save
Chinese, English NER, English-Chinese machine translation dataset. 中英文实体识别数据集,中英文机器翻译数据集, 中文分词数据集
Overview
Reviews
Resources
Project README
NLP-dataset (General)
Huggingface, datasets
Awesome-Chinese-NLP, Chinese
CLUEDatasetSearch, Chinese
funNLP, Chinese
ChineseNLPCorpus1, Chinese
ChineseNLPCorpus2, Chinese
CLUE, Chinese
Chinese NLP data by ShannonAI, Chinese
nlp-datasets, Multilingual
awesome-nlp, Multilingual
Word Segmentation (Chinese)
SIGHAN2005
multi-criteria-cws
Chinese NLP data by ShannonAI, Chinese
NER dataset (English)
various NER dataset
CoNLL-2003, Offical
,
CoNLL-2003, other link
WNUT-2016, Twitter
OntoNotes-5.0, broadcase news, braodcase conversation, weblogs, magzine genre
Wikigold
Twitter
kaggle
MUC6
MUC7
NER dataset (Chinese)
MSRA, OntoNotes 4.0, Resume, Weibo
CLUENER
RenMinRiBao
MSRA
Boson
Weibo
Others
Machine Translation (Chinese-English)
WMT 2020
AI challenger
(英中翻译规模最大的口语领域英中双语对照数据集)
UM-Corpus: A Large English-Chinese Parallel Corpus
OpenSubtitles2016
MultiUN
Open Source Agenda is not affiliated with "Nlp Public Dataset" Project. README Source:
quincyliang/nlp-public-dataset
Stars
340
Open Issues
2
Last Commit
3 years ago
Repository
quincyliang/nlp-public-dataset
Tags
Machine Learning Dataset
Nlp Datasets
Open Source Agenda Badge
Submit Review
Review Your Favorite Project
Submit Resource
Articles, Courses, Videos
Submit Article
Submit a post to our blog
From the blog
Dec 11, 2022
How to Choose Which Programming Language to Learn First?
From the blog
Dec 11, 2022
How to Choose Which Programming Language to Learn First?
Home
Projects
Resources
Alternatives
Blog
Sign In
Sign In to OSA
I agree with
Terms of Service
and
Privacy Policy
Sign In with Github