Khrylx PyTorch RL Save

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Project README

PyTorch implementation of reinforcement learning algorithms

This repository contains:

policy gradient methods (TRPO, PPO, A2C)
Generative Adversarial Imitation Learning (GAIL)

Important notes

The code now works for PyTorch 0.4. For PyTorch 0.3, please check out the 0.3 branch.
To run mujoco environments, first install mujoco-py and gym.
If you have a GPU, I recommend setting the OMP_NUM_THREADS to 1 (PyTorch will create additional threads when performing computations which can damage the performance of multiprocessing. This problem is most serious with Linux, where multiprocessing can be even slower than a single thread):

export OMP_NUM_THREADS=1

Features

Support discrete and continous action space.
Support multiprocessing for agent to collect samples in multiple environments simultaneously. (x8 faster than single thread)
Fast Fisher vector product calculation. For this part, Ankur kindly wrote a blog explaining the implementation details.

Policy gradient methods

Example

python examples/ppo_gym.py --env-name Hopper-v2

Reference

Generative Adversarial Imitation Learning (GAIL)

To save trajectory

python gail/save_expert_traj.py --model-path assets/learned_models/Hopper-v2_ppo.p

To do imitation learning

python gail/gail_gym.py --env-name Hopper-v2 --expert-traj-path assets/expert_traj/Hopper-v2_expert_traj.p

Open Source Agenda is not affiliated with "Khrylx PyTorch RL" Project. README Source: Khrylx/PyTorch-RL

Stars

1,046

Open Issues

Last Commit

3 years ago

Repository

Khrylx/PyTorch-RL

License

MIT

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/khrylx-pytorch-rl"><img src="https://www.opensourceagenda.com/projects/khrylx-pytorch-rl/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022