Acme Versions Save

A library of reinforcement learning components and agents

0.4.0

2 years ago

0.3.0

2 years ago

0.2.4

2 years ago

0.2.3

2 years ago

0.2.2

2 years ago

Highlights

This is a minor release to update the dependencies to new stable versions of Tensorflow and Reverb (14cb62f).
New SVG-0 agent (1ecabdb).

Minor changes and fixes

Fixes to docs and testing.
Minor updates to tf-DQN.
Updates to LP tear down.
Update to how signals are handled.
Update to setup/requirements.
Reorganize Builder and integrate jax-specific functionality.
Improvements to typing.

0.2.1

2 years ago

Highlights

Released distributed versions of some agents https://github.com/deepmind/acme/commit/3bc0426bb17797f066a6afe223b563385a2fe839.
Adders use Reverb's TrajectoryWriter (instead of Writer).
Introduce the Builder API https://github.com/deepmind/acme/commit/329aae50695ca0290d6c151b7bbd4cefb1ccf83d.
Add JAX behavior cloning agent https://github.com/deepmind/acme/commit/eb7073fc7bcbd5b2c26ae5e52ab37e7212a5a1d4.
Add Python 3.8 support.

Minor changes and fixes

Use nightly versions of TensorFlow, Reverb, and TensorFlow Probability.
Remove pandas dependency.
Skip tests that depend on Atari ROMs (atari_py no longer provides them as part of its pypi package).
Fix links to ipynb pages.

0.2.0

3 years ago

Highlights

Using stable releases for TensorFlow (>=2.3.0), Reverb, and TensorFlow Probability.
Added Critic Regularized Regression (code, paper)
Added Discrete Batch-Constrained Deep Q-learning (code, paper)
Added EnvironmentLoop.run_episode() for running a single episode.
Update EnvironmentLoop.run() to take num_steps, allowing the control of step count rather than just episode count.
Add more distribution types (e.g. GaussianMixture) which can be used by policies.
Added a environment wrapper for action repeats.
Improvements/tuning to datasets exposed by make_dataset.
Add support for nested / multidimensional rewards and discounts.

Minor changes and fixes

ConstantInfo logger for logging constant information.
Added a should_update parameter to the EnvironmentLoop.
Various modifications and optimizations to the make_reverb_dataset() function.
Improvements to typing and pytype usage.
Other minor bug and documentation fixes.

0.1.8

3 years ago

Minor version to fix a mismatch inversions between tf/tfp.

Other changes include:

a jax R2D2 agent.
movement of various utility/helper functions for additional clarity.
doc fixes.
bugfix for the mcts agent.

0.1.7

3 years ago

0.1.6

3 years ago