Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.
This repository provides a TensorFlow implementation of Generatve Adversarial Imitation Learning (GAIL) and Behavioural Cloning (BC) for classic cartpole-v0 environment from OpenAI Gym. (based on Generative Adversarial Imitation Learning, Jonathan Ho & Stefano Ermon.)
Reinforcement Learning algorithm: PPO, is used for generating the expert trajectory data for the CartPole-v0 environment.
python3 run_ppo.py
python3 sample_trajectory.py
python3 run_gail.py
python3 run_behavior_clone.py
python3 test_policy.py
Training and Testing results for GAIL |
For example to test behavioral cloning:
python3 test_policy.py --alg=bc --model=1000
This repository provides a Pytorch implementation of Generatve Adversarial Imitation Learning (GAIL) for bipedwalker-v2 environment from OpenAI Gym.
Bipedwalker-v2
State space (Continuous): (1) hull angle, (2) angular velocity, (3) horizontal speed, (4) vertical speed, (5) position of joints (6) joints angular speed, (7) legs contact with ground, and (8) lidar rangefinder measurements
Action: joint motor torques