Baseline implementation of recurrent PPO using truncated BPTT
Official PyTorch Implementation for the "Distilling Datasets Into Less T...