Project README

Super Mario Bros RL

Alt text

Advantage Actor critic [1]
Parallel Advantage Actor critic [2]
Noisy Networks for Exploration [3]
Proximal Policy Optimization Algorithms [4]
Curiosity-driven Exploration by Self-supervised Prediction [5] (WIP)
'Random Network Distillation' pytorch model
'Curiosity-driven Exploration' pytorch model

1. Setup

Requirements

2. How to Train

Modify the parameters in mario_a2c.py as you like.

python3 mario_a2c.py

python3 mario_ppo.py

3. How to Eval

Modify the is_load_model, is_render parameters in mario_a2c.py as you like.

python3 mario_a2c.py

python3 mario_ppo.py

4. Loss/Reward Graph

It use just A2C(PAAC)

It use just ICM and no ext reward.(Curiosity-driven)

References

[1] Actor-Critic Algorithms
[2] Efficient Parallel Methods for Deep Reinforcement Learning
[3] Noisy Networks for Exploration
[4] Proximal Policy Optimization Algorithms
[5] Curiosity-driven Exploration by Self-supervised Prediction

Open Source Agenda is not affiliated with "Mario Rl" Project. README Source: jcwleo/mario_rl

Stars

Open Issues

Last Commit

5 years ago

Repository

jcwleo/mario_rl

License

MIT

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/mario-rl"><img src="https://www.opensourceagenda.com/projects/mario-rl/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

Mario Rl Save

Super Mario Bros RL

1. Setup

Requirements

2. How to Train

3. How to Eval

4. Loss/Reward Graph

References

Open Source Agenda Badge

From the blog

How to Choose Which Programming Language to Learn First?

From the blog

How to Choose Which Programming Language to Learn First?