Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
utils.convert_one_hot_to_int
that caused the downcasting of an action. This function now returns a normal int
as opposed to a np.int8
: 59c20641eecfc5d23076f8bee6805f6101ba0a2dgym
support so that emdp works out of the box with algorithms that are compatible OpenAI gym. #12torch
-compatible analytic functions to make them differentiable. #14