Distributed MADDPG Save

Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.

Project README

Distributed-MADDPG

Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.

Distributed Multi-Agent Architecture

Introduction

This work focus on Multi-Agent Cooperation Problem. We proposed a method which consists 3 components:

Related research - MADDPG This algorithm comes from Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Prioritized Batch Data To optimize one-step update without losing diversity, we divide batch data into several parts and prioritize these batches. Using the batch data with maximal loss to do one-step update.
Distributed Multi-Agent Architecture Similar to A3C algorithm, we adopt this Master and Multi-Worker architecture in our work.

Experiment

Implementation

Keras 2.1.2 （tensorflow 1.4 as backend）
mpi4py
Python 3.6
CUDA 8.0 + cuDNN 6.0

Environment

Modified original environment (you can find in my repo) from OpenAI
- Fixed landmark
- Border

Neural Network

Result

Learning Progress

DDPG & MADDPG & PROPOSED

How to run this program

For program using MPI:

mpiexec -np [worker_number] python mpi-xxx.py

mpiexec -np 4 python mpirun_main.py

For others:

python xxx.py

Future Work (4 vs 2)

Thanks to

Open Source Agenda is not affiliated with "Distributed MADDPG" Project. README Source: namidairo777/Distributed-MADDPG

Stars

Open Issues

Last Commit

3 years ago

Repository

namidairo777/Distributed-MADDPG

License

MIT

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/distributed-maddpg"><img src="https://www.opensourceagenda.com/projects/distributed-maddpg/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022