Reinforcement Learning for WeChat Jump
End-to-end training Wechat-Jump AI using DDPG algorithm
actor
Critic
std=0.2
python train.py
python infer.py