🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits...
A hyperparameter optimization framework, inspired by Optuna.
Yahoo! news article recommendation system by linUCB
My solutions to Yandex Practical Reinforcement Learning course in PyTorc...