Coach Versions Save

Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

v1.0.0

4 years ago

TD3 New APIs for Coach usage as a library Updated Getting Started tutorial Batch RL tutorial

v0.12.1

4 years ago

Fixes for breaking API changes (OpenAI Gym, Scipy) OPE: Weighted Importance Sampling Creating a dataset using an agent Printing input size as part of network summary

v0.12.0

5 years ago

ACER Soft Actor-Critic BCQ Batch RL Off-policy evaluation (estimators: DM, DR, Sequential DR, IPS)

v0.11.2

5 years ago

Intel Tensorflow fix.

v0.11.1

5 years ago

Roll out worker memory leak fix wxPython dependency removal

v0.11.0

5 years ago

Horizontal scaling MxNet support ONNX export New documentation

v0.10.0

5 years ago

A complete redesign - non-backward compatible. Enabling multi-agent support.

New features -

PIP package
Benchmarks
Hierarchical Reinforcement Learning (demonstrated by Hierarchical Actor-Critic)
Tutorials
Shared memory (e.g. Replay Buffer) between workers
Tests (unit-tests, reward-based tests, trace-based tests)
Using Coach as a library (see example here)

New Environments -

Toy Environments (Exploration Chain, BitFlip)
DeepMind PySC2 support (Starcraft 2)
DeepMind Control Suite

New Algorithms -

Hindsight Experience Replay
Prioritized Experience Replay
Hierarchical Actor-Critic
UCB with Q-Ensembles

v0.9.0

6 years ago

New features -

CARLA 0.7 simulator integration
Human control of the game play
Recording of human game play and storing / loading the replay buffer
Behavioral cloning agent and presets
Golden tests for several presets
Selecting between deep / shallow image embedders
Rendering through pygame (with some boost in performance)

API changes -

Improved environment wrapper API
Added an evaluate flag to allow convenient evaluation of existing checkpoints
Improve frameskip definition in Gym

Bug fixes -

Fixed loading of checkpoints for agents with more than one network
Fixed the N Step Q learning agent python3 compatibility

v0.8.0

6 years ago

Initial public release