PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy ...
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo ...
PyTorch C++ Reinforcement Learning
PyTorch implementation of Soft Actor-Critic (SAC)
PyTorch Implementation of REINFORCE for both discrete & continuous control
Code for the paper "Evolved Policy Gradients"
Tensorflow implementation of generative adversarial imitation learning
End to end motion planner using Deep Deterministic Policy Gradient (DDPG...
Implement A3C for Mujoco gym envs
PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradi...
Catalyst.RL: A Distributed Framework for Reproducible RL Research
A workbench for online model-free Reinforcement Learning on continuous c...