iyaja/Nonstationary-k-arm-Bandits

iyaja

Fetched on 2026/07/13 21:45

Python code for a basic RL solution for the Non-stationary (action value function changes with time) k-arm bandit problem. Based on the book "Reinforcement learning: An introduction" by S.Sutton and Andrew G. Barto - View it on GitHub

Star

Rank

3388333

iyaja

iyaja / Nonstationary-k-arm-Bandits