The Wayback Machine - https://web.archive.org/web/20200530102843/https://github.com/topics/proximal-policy-optimization
Skip to content
#

proximal-policy-optimization

Here are 67 public repositories matching this topic...

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

  • Updated Mar 3, 2020
  • Python

This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)

  • Updated Nov 15, 2019
  • Python

Improve this page

Add a description, image, and links to the proximal-policy-optimization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the proximal-policy-optimization topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.
Morty Proxy This is a proxified and sanitized view of the page, visit original site.