A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.
A curated list of open source Human Preference datasets for LLM instruction-tuning, RLHF and evaluation.
For general NLP datasets and text corpora, check out this awesome list.
Anthropic Helpfulness and Harmlessness Dataset (HH-RLHF)
OpenAssistant Conversations Dataset (OASST1)
Stanford Human Preferences Dataset (SHP)
Human ChatGPT Comparison Corpus (HC3)
HuggingFace H4 StackExchange Preference Dataset