Sac N Jax Save

Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch

Project README

SAC with Q-Ensemble for Offline RL

Single-file SAC-N [1] implementation on jax with both flax and equinox. 10x faster than SAC-N on pytorch from CORL [2].

And still easy to use and understand! To run:

python sac_n_jax_flax.py --env_name="halfcheetah-medium-v2" --num_critics=10 --batch_size=256
python sac_n_jax_eqx.py --env_name="halfcheetah-medium-v2" --num_critics=10 --batch_size=256

Optionally, you can pass --config_path to the yaml file, for more see pyrallis docs.

Speed comparison

Main insight here is to jit epoch loop also with jax.lax.fori_loop or jax.lax.scan, not just one update of the networks, as it is usually done (jaxrl2 for instance). With jitting the update only speedup will be approx 1.5x here.

Both runs were trained on same V100 GPU.

return_epochs return_time

References

Open Source Agenda is not affiliated with "Sac N Jax" Project. README Source: Howuhh/sac-n-jax

Stars

Open Issues

Last Commit

11 months ago

Repository

Howuhh/sac-n-jax

License

MIT

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/sac-n-jax"><img src="https://www.opensourceagenda.com/projects/sac-n-jax/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022