dunovank/pytorch-a2c-ppo-acktr-gail

dunovank

Fetched on 2026/07/13 18:34

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL). - View it on GitHub

Star

Rank

14117806

dunovank

dunovank / pytorch-a2c-ppo-acktr-gail