[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences fo...
A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, a...