Gitstar Ranking
Users
Organizations
Repositories
Rankings
Users
Organizations
Repositories
Sign in with GitHub
iyaja
Fetched on 2025/01/07 20:32
iyaja
/
Nonstationary-k-arm-Bandits
Python code for a basic RL solution for the Non-stationary (action value function changes with time) k-arm bandit problem. Based on the book "Reinforcement learning: An introduction" by S.Sutton and Andrew G. Barto -
View it on GitHub
Star
3
Rank
2834093