Envpool Versions Save

C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

v0.6.2

1 year ago

Bug Fix

  1. Fix action ref counting error (#154, @Trinkle23897)

New Env

  1. dm_control humanoid_CMU (#145, @Benjamin-eecs)

Enhancement

  1. XLA CustomCall interface, now we can use jax.jit on EnvPool (#149, @mavenlin)
  2. Remove Atari wrong bin (#147, @Trinkle23897)
  3. Add dm_control speed test script and single env performance benchmark (#139, @Trinkle23897)
  4. Upgrade mujoco to 2.2.0 (#142, @Trinkle23897)

v0.6.1

2 years ago

Note: Compared with dm_control, EnvPool can have at about 2x free speedup with only single environment setting (#139, @Trinkle23897). For the next release, we are going to use mujoco source code (#141, @Trinkle23897) to make everything faster!

New dm_control Env

  1. Humanoid (#132, @Benjamin-eecs)
  2. Manipulator (#128, @yufansong, @Trinkle23897)
  3. Fish (#134, @Benjamin-eecs)
  4. CartPole (#137, @Benjamin-eecs)
  5. Swimmer (#140, @Benjamin-eecs)
  6. Misc (#138, @Trinkle23897)

v0.6.0

2 years ago

Note: we are still in the progress of adding more environment from dm_control suite and box2d.

New Environment

DeepMind Control Suite

  1. Hopper (#107, @Trinkle23897)
  2. Cheetah (#110, @yufansong)
  3. Walker (#114, @yufansong)
  4. Reacher (#115, @yufansong)
  5. Finger (#118, @yufansong)
  6. Pendulum (#122, @yufansong)
  7. BallInCup (#124, @Benjamin-eecs)
  8. PointMass (#123, @yufansong)
  9. Acrobot (#129, @Benjamin-eecs)
  10. Misc (#106, #125, #130, @Trinkle23897)

Box2D

  1. LunarLander-v2, LunarLanderContinuous-v2 (#111, @Alicia1529)

Others

  1. Pendulum-v1 (#126, @Trinkle23897)

Bug Fix

  1. Fix episodic_life bug in atari env (#121, @Trinkle23897)

Enhancemt

  1. Support dynamic shaped arrays (#98, @mavenlin)
  2. Update Mujoco benchmark result (#99, @Trinkle23897)
  3. Fix code linter (#109, #116, #117, @Trinkle23897)

v0.5.3

2 years ago

This release is for a stable version for benchmarking. We will update the benchmark result for both Atari (Pong-v5) and Mujoco (Ant-v3) soon. We observe Ant-v3 can achieve 2M+ FPS with a 192-core machine.

Enhancement

  1. Add gym_reset_return_info option for returning (obs, info) in gym.Env.reset (#97)
  2. Add clang-tidy check (#85, #95, #100)

v0.5.2

2 years ago

Bug Fix

  1. Fix max_episode_steps for Pusher and Reacher (#93)

Enhancement

  1. add testing macro ENVPOOL_TEST (#93, #94)
  2. add terminate_when_unhealthy and exclude_current_positions_from_observation for most of mujoco envs (#93)
  3. eliminate info["qpos0"] and info["qvel0"] in mujoco env when generating wheel (#93)
  4. add instructions for building this project with mainland China users (#92)

v0.5.1

2 years ago
  • Ant-v4 (with @yufansong #74)
  • HalfCheetah-v4 (#75)
  • Hopper-v4 (#76)
  • Humanoid-v4 (#77)
  • HumanoidStandup-v4 (#78)
  • InvertedDoublePendulum-v4 (@Benjamin-eecs, #83)
  • InvertedPendulum-v4 (#79)
  • Pusher-v4 (#82)
  • Reacher-v4 (#81)
  • Swimmer-v4 (#80)
  • Walker2d-v4 (@Benjamin-eecs, #86)

v0.5.0

2 years ago

We have successfully integrated Ant-v4 environment based on the newest deepmind/mujoco package! (#74) More environments are coming soon.

Other enhancement: support element-wise bound, fix classic_control action space (#67)

v0.4.5

2 years ago

Enhancement

  1. Add ViZDoom single-player environment (#47, #54)

v0.4.4

2 years ago

Enhancement

  1. add cleanrl example (#48, #53)
  2. add appo example (#51)
  3. export raw reward in atari env with info["reward"] (#50)

v0.4.3

2 years ago

New Env

Add toy_text (#41, #42, #43, #44, #45, #46)

Enhancement

  1. Add SB3 example (#39)
  2. Add gray_scale option in atari (#37)