Clean baseline implementation of PPO using an episodic TransformerXL memory
No reviews for this project.