Implementation of RLHF (Reinforcement Learning with Human Feedback) on t...
A curated list of reinforcement learning with human feedback resources (...
Open-source pre-training implementation of Google's LaMDA in PyTorch. Ad...
The ParroT framework to enhance and regulate the Translation Abilities d...
Let's build better datasets, together!
[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffus...