Acme Versions Save

A library of reinforcement learning components and agents

0.4.0

2 years ago

0.3.0

2 years ago

0.2.4

2 years ago

0.2.3

2 years ago

0.2.2

2 years ago

Highlights

  • This is a minor release to update the dependencies to new stable versions of Tensorflow and Reverb (14cb62f).
  • New SVG-0 agent (1ecabdb).

Minor changes and fixes

  • Fixes to docs and testing.
  • Minor updates to tf-DQN.
  • Updates to LP tear down.
  • Update to how signals are handled.
  • Update to setup/requirements.
  • Reorganize Builder and integrate jax-specific functionality.
  • Improvements to typing.

0.2.1

2 years ago

Highlights

Minor changes and fixes

  • Use nightly versions of TensorFlow, Reverb, and TensorFlow Probability.
  • Remove pandas dependency.
  • Skip tests that depend on Atari ROMs (atari_py no longer provides them as part of its pypi package).
  • Fix links to ipynb pages.

0.2.0

3 years ago

Highlights

  • Using stable releases for TensorFlow (>=2.3.0), Reverb, and TensorFlow Probability.
  • Added Critic Regularized Regression (code, paper)
  • Added Discrete Batch-Constrained Deep Q-learning (code, paper)
  • Added EnvironmentLoop.run_episode() for running a single episode.
  • Update EnvironmentLoop.run() to take num_steps, allowing the control of step count rather than just episode count.
  • Add more distribution types (e.g. GaussianMixture) which can be used by policies.
  • Added a environment wrapper for action repeats.
  • Improvements/tuning to datasets exposed by make_dataset.
  • Add support for nested / multidimensional rewards and discounts.

Minor changes and fixes

  • ConstantInfo logger for logging constant information.
  • Added a should_update parameter to the EnvironmentLoop.
  • Various modifications and optimizations to the make_reverb_dataset() function.
  • Improvements to typing and pytype usage.
  • Other minor bug and documentation fixes.

0.1.8

3 years ago

Minor version to fix a mismatch inversions between tf/tfp.

Other changes include:

  • a jax R2D2 agent.
  • movement of various utility/helper functions for additional clarity.
  • doc fixes.
  • bugfix for the mcts agent.

0.1.7

3 years ago

0.1.6

3 years ago