Autonomous Learning Library Versions Save

A PyTorch library for building deep reinforcement learning agents.

v0.9.1

2 months ago

Version 0.9.1. Includes the following updates:

  • Adds Gymnasium support
  • Adds Mujoco support
  • Adds hyperparameters logging after running experiments
  • Some other minor improvements to logging
  • Tweaks to SAC/DDPG hyperparameters and implementation
  • Minor workflow improvements
  • Some other dependency upgrades, including upgrading to torch~=2.2
  • Some minor bugfixes

What's Changed

Full Changelog: https://github.com/cpnota/autonomous-learning-library/compare/v0.8.0...v0.9.1

v0.9.1-alpha.3

2 months ago

Publish workflow testing

v0.8.0

1 year ago

This release includes several enhancements:

  • Dependencies have been updated.
  • FeatureNetwork logic was simplified.
  • Soft Actor-Critic was upgraded to a newer version which does not use a separate state-value function.
  • The logging interface was improved.

v0.7.2

2 years ago
  • Update Pytorch to 1.9.0 (#255)
  • Raise RuntimeError if clip_grad is enabled and norm is non-finite (#255)
  • Fix a bug with scaling of log_prob in soft policy (#256)

v0.7.1

2 years ago

A few minor under-the-hood tweaks and fixes:

  • Created a VectorEnvironment class and refactored the way parallel environments work #239
  • Added the parallel_test_agent method to ParallelPreset #240
  • Fixed a bug where using n_envs=1 on Atari presets used Body instead of ParallelBody #241
  • Fix DeepmindAtariBody to only use FrameStackBody if frame_stack > 1 #245
  • Fix bug where test_exploration was not being respected by GreedyPolicy #246
  • Improve stability of SAC by preventing the temperature from dropping below 0 #247
  • Update version of PettingZoo and use new environments in CI #351
  • Fix store_device for prioritized replay #249

v0.7.0

3 years ago

This release contains several new features, refactors, and bugfixes.

Features

  • Save/load agents. #185
  • (Experimental) Multiagent Atari support using PettingZoo. #201
  • Optional to store replay buffer on different device. #187
  • Better environment duplication using cloudpickle. #200
  • Built-in Identity feature network #202
  • Support for Comet.ml #215

Refactoring

  • Agents split into three types: Agent, ParallelAgent, and Multiagent #221
  • Changed the way Presets work to facilitate Save/Load #185
  • Switched from Travis to GitHub Actions for workflow/CI #235
  • Improvements to Environment imports #236
  • Support for latest PyTorch version #235

Bugfixes

  • Fixes a bug in ParallelGreedyPolicy #233
  • Fixes an issue with the Atari life_lost #
  • Documentation clarifications #192 #216

v0.6.0

3 years ago

This release contains some under-the-hood enhancements and bugfixes, most notably, a refactoring of the State class. State now supports adding arbitrary key/value pairs, allowing for more complex state spaces. Additionally, a StateArray class was added that automatically handles stacking/slicing states in various ways which making handling batches of data, multiple timesteps, and many other aspects easier. Here's a full list of the changes:

  • Refacted State and added StateArray object ( #160 and #167)
  • Added support for specifying custom models under all existing presets. Thanks to @michalgregor for the contribution! (#163)
  • Fixed a bug involving SAC evaluation mode. Thanks @michalgregor for identifying and proposing a fix! (#169)
  • Fixed a bug where the name of preconstructed Gym environments were not being handled correctly. Thanks @mctigger for reporting! (#169) Thanks @michalgregor for the fix! (#165, merged by #170)
  • Fixed a bug where the Atari FireReset wrapper was being applied to games with no Fire action, causing them to not run. Thanks @andrewsmike for reporting and fixing the bug! (#168)

v0.5.3

3 years ago

This release contains a hotfix, #155 , which improves the performance of the PPO continuous preset.

v0.5.2

3 years ago

Just some minor bug fixes and documentation improvements:

  • Datetime compatibility for Windows #137 #142
  • Continuous Integration fixes #138
  • SoftDeterministicPolicy scaling fix #140
  • Fix incorrect counting of test trials in parallel experiments #143
  • Remove trailing commas #146
  • First action was being selected using act() instead of eval() in test mode #150
  • Documentation improvements #151

v0.5.1

4 years ago

The previous release was missing the changes from #132 with correct the computation of the frames per second for parallel envs.