Pytorch Nfsp Save

Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)

Project README

pytorch-nfsp

An implementation of Deepmind's Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016) with LaserTag-v0. The paper introduces Neural Fictitious Self-Play(NFSP) which is a deep-learning version of FSP in Fictitious Self-Play in Extensive-Form Games (Heinrich et al. 2015).

Requirements

pytorch 0.4
gym
lasertag

Examples

python main.py --env 'LaserTag-small4-v0' for training

python main.py --env 'LaserTag-small2-v0' --render if you want to watch rendered game on screen.

python main.py --env 'LaserTag-small2-v0' --render --evaluate if you want to evaluate/enjoy the model which is already trained. I included models/LaserTag-small*-v0-dqn-model.pth so you can see how trained agents play against each other.

For more details, See arguments.py

LaserTag-small2-v0

small2.gif

LaserTag-small3-v0

small3.gif

LaserTag-small4-v0

small4.gif

Agents are trained with NFSP. Agents get a unit reward for touching other agent with laser beam. If an agent is hit twice then that agent will be sent to random respawn place. An episode consists of 1000 frames.

Open Source Agenda is not affiliated with "Pytorch Nfsp" Project. README Source: younggyoseo/pytorch-nfsp
Stars
40
Open Issues
1
Last Commit
5 years ago
Tags

Open Source Agenda Badge

Open Source Agenda Rating