Mgym Save

A collection of multi-agent reinforcement learning OpenAI gym environments

Project README

Multi-agent gym environments

snakeclip

This repository has a collection of multi-agent OpenAI gym environments.

DISCLAIMER: This project is still a work in progress.

Dependencies

gym
numpy

Installation

git clone https://github.com/cjm715/mgym.git
cd mgym/
pip install -e .

Environments

TicTacToe-v0
RockPaperScissors-v0
PrisonersDilemma-v0
BattleOfTheSexes-v0
StagHunt-v0
MatchingPennies-v0
ElFarol-v0
Snake-v0 (See OpenAI's Request-for-research)

Examples

import gym
import mgym
import random

env = gym.make('TicTacToe-v0')
fullobs = env.reset()
while True:
    print('Player O ') if fullobs[0] else print('Player X')
    a = random.choice(env.get_available_actions())
    fullobs,rewards,done,_ = env.step(a)
    env.render()
    if done:
        break

import gym
import mgym

env = gym.make('MatchingPennies-v0')
env.reset(3)
while True:
    a = env.action_space.sample()
    _,r,done,_ = env.step(a)
    env.render()
    if done:
        break

See further examples in mgym/examples/examples.ipynb.

How are multi-agent environments different than single-agent environments?

When dealing with multiple agents, the environment must communicate which agent(s) can act at each time step. This information must be incorporated into observation space. Conversely, the environment must know which agents are performing actions. Therefore this must be communicated in the action passed to the environment. The form of the API used for passing this information depends on the type of game. The two types are

one-at-a-time play (like TicTacToe, Go, Monopoly, etc) or
simultaneous play (like Soccer, Basketball, Rock-Paper-Scissors, etc).

In the TicTacToe example above, this is an instance of one-at-a-time play. The fullobs is a tuple (next_agent, obs). The variable next_agent indicates which agent will act next. obs is the typical observation of the environment state. The action a is also a tuple given by a = (acting_agent, action) where the acting_agent is the agent acting with the action given by variable action.

Testing

To run tests, install pytest with pip install pytest and run python -m pytest

Open Source Agenda is not affiliated with "Mgym" Project. README Source: cjm715/mgym

Stars

Open Issues

Last Commit

3 years ago

Repository

cjm715/mgym

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/mgym"><img src="https://www.opensourceagenda.com/projects/mgym/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022