PyTorch CPO Save

PyTorch implementation of Constrained Policy Optimization

Project README

PyTorch implementation of Constrained Policy Optimization (CPO)

This repository has a simple to understand and use implementation of CPO in PyTorch. A dummy constraint function is included and can be adapted based on your needs.

Pre-requisites

PyTorch (The code is tested on PyTorch 1.2.0.)
OpenAI Gym.
MuJoCo (mujoco-py)
If working with a GPU, set OMP_NUM_THREADS to 1 using:

export OMP_NUM_THREADS=1

Features

Tensorboard integration to track learning.
Best model is tracked and saved using the value and standard deviation of average reward.

Usage

python algos/main.py --env-name CartPole-v1 --algo-name=CPO --exp-num=1 --exp-name=CPO/CartPole --save-intermediate-model=10 --gpu-index=0 --max-iter=500

Code Reference

Khrylx/PyTorch-RL

Technical Details on CPO

main feasible infeasible

Open Source Agenda is not affiliated with "PyTorch CPO" Project. README Source: SapanaChaudhary/PyTorch-CPO

Stars

Open Issues

Last Commit

2 years ago

Repository

SapanaChaudhary/PyTorch-CPO

License

MIT

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/pytorch-cpo"><img src="https://www.opensourceagenda.com/projects/pytorch-cpo/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022