Gitstar Ranking
Users
Organizations
Repositories
Rankings
Users
Organizations
Repositories
Sign in with GitHub
iyaja
Fetched on 2026/03/14 10:11
iyaja
/
Nonstationary-k-arm-Bandits
Python code for a basic RL solution for the Non-stationary (action value function changes with time) k-arm bandit problem. Based on the book "Reinforcement learning: An introduction" by S.Sutton and Andrew G. Barto -
View it on GitHub
Star
3
Rank
3279731