An extremely simple PPO implementation used to train a 2 headed neural network to solve the Cart Pole problem. - View it on GitHub
Star
0
Rank
13746406