-
Updated
May 29, 2020 - Python
#
proximal-policy-optimization
Here are 67 public repositories matching this topic...
Simple Reinforcement learning tutorials
machine-learning
tutorial
reinforcement-learning
q-learning
dqn
policy-gradient
sarsa
tensorflow-tutorials
a3c
deep-q-network
ddpg
actor-critic
asynchronous-advantage-actor-critic
double-dqn
prioritized-replay
sarsa-lambda
dueling-dqn
deep-deterministic-policy-gradient
proximal-policy-optimization
ppo
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
reinforcement-learning
deep-learning
deep-reinforcement-learning
pytorch
atari
hessian
second-order
continuous-control
actor-critic
ale
mujoco
proximal-policy-optimization
ppo
advantage-actor-critic
a2c
acktr
natural-gradients
roboschool
kfac
kronecker-factored-approximation
-
Updated
Mar 3, 2020 - Python
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
reinforcement-learning
deep-reinforcement-learning
pytorch
generative-adversarial-network
policy-gradient
trpo
fisher-vectors
pytorch-rl
proximal-policy-optimization
ppo
a2c
-
Updated
Apr 23, 2020 - Python
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
python
machine-learning
research
reinforcement-learning
deep-learning
deep-reinforcement-learning
pytorch
artificial-intelligence
policy-gradient
ddpg
sac
cem
cmaes
evolution-strategies
mujoco
deep-deterministic-policy-gradient
proximal-policy-optimization
ppo
td3
soft-actor-critic
-
Updated
Feb 26, 2020 - Jupyter Notebook
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
algorithm
deep-learning
atari2600
flappy-bird
deep-reinforcement-learning
pytorch
dqn
ddpg
sac
actor-critic
trpo
dueling-dqn
trust-region-policy-optimization
proximal-policy-optimization
ppo
a2c
soft-actor-critic
-
Updated
Nov 15, 2019 - Python
Omegastick
commented
Oct 18, 2019
Some time around ae030395f56efca50a51335fe4f3367caf950066 we regressed and the example code in gym_client.cpp doesn't converge any more. Presumably because of some difference in our observation normalization compared to the OpenAI Baselines one.
I'll look in more detail this weekend and confirm if it's that exact commit causing the problem.
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
deep-reinforcement-learning
pytorch
multi-agent
deep-q-network
ddpg
actor-critic
deep-deterministic-policy-gradient
proximal-policy-optimization
ppo
advantage-actor-critic
a2c
acktr
-
Updated
Nov 11, 2017 - Python
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
deep-reinforcement-learning
pytorch
policy-gradient
reinforcement-learning-algorithms
pytorch-tutorial
proximal-policy-optimization
ppo
pytorch-implmention
-
Updated
Jan 25, 2020 - Python
Implementing reinforcement-learning algorithms for pysc2 -environment
python
reinforcement-learning
tensorflow
deepmind
proximal-policy-optimization
ppo
starcraft2
a2c
pysc2
-
Updated
Dec 12, 2017 - Python
Trading Environment(OpenAI Gym) + PPO(TensorForce)
-
Updated
Mar 30, 2020 - Python
Curiosity-driven Exploration by Self-supervised Prediction
reinforcement-learning
pytorch
icm
proximal-policy-optimization
advantage-actor-critic
curiosity-driven
-
Updated
Apr 28, 2020 - Python
PyTorch implementation of Proximal Policy Optimization
machine-learning
reinforcement-learning
deep-learning
cuda
openai-gym
pytorch
proximal-policy-optimization
-
Updated
Dec 20, 2017 - Python
reinforcement-learning
tensorflow
relational-networks
proximal-policy-optimization
ppo
explainable-ai
self-attention
-
Updated
Apr 15, 2019 - Python
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
reinforcement-learning
deep-learning
pytorch
icm
proximal-policy-optimization
ppo
mountaincar-v0
cartpole-v1
intrinsic-curiosity-module
generalized-advantage-estimation
pendulum-v0
-
Updated
Jan 12, 2019 - Python
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms: A2C and PPO
reinforcement-learning
deep-reinforcement-learning
pytorch
recurrent-neural-networks
multi-process
a3c
minigrid
recurrent
actor-critic
proximal-policy-optimization
ppo
advantage-actor-critic
a2c
reward-shaping
-
Updated
Jul 25, 2019 - Python
Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.
reinforcement-learning
tensorflow
openai-gym
pytorch
behavioral-cloning
generative-adversarial-networks
imitation-learning
biped
proximal-policy-optimization
ppo
gail
gym-biped
-
Updated
Dec 26, 2018 - Python
Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization
-
Updated
Nov 8, 2018 - Python
This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
-
Updated
Jul 30, 2018 - Python
Implementation of Scheduled Policy Optimization for task-oriented language grouding
-
Updated
Jul 16, 2018 - ASP
It's the pytorch implementation of google research football.
-
Updated
Jun 14, 2019 - Python
deep-learning
deep-q-learning
proximal-policy-optimization
ppo
advantage-actor-critic
a2c
pysc2
pysc2-mini-games
reinfrocement-learning
-
Updated
Jul 14, 2018 - Python
OpenAI Gym compatible reinforcement learning environment for Space Fortress https://arxiv.org/abs/1809.02206
benchmark
reinforcement-learning
deep-learning
deep-reinforcement-learning
openai-gym
rainbow
pytorch
dqn
gym-environment
proximal-policy-optimization
ppo
a2c
space-fortress
gym-application
-
Updated
Apr 29, 2019 - Python
RLbox: Solving OpenAI Gym with TensorFlow
tensorflow
deep-reinforcement-learning
openai-gym
dqn
atari
continuous-control
mujoco
deep-rl
proximal-policy-optimization
ppo
-
Updated
Apr 19, 2018 - Python
Proximal Policy Optimization with Tensorflow 2.0
reinforcement-learning
policy-gradient
reinforcement-learning-algorithms
proximal-policy-optimization
ppo
ppo2
tensorflow2
-
Updated
Oct 14, 2019 - Python
Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some explanation
reinforcement-learning
deep-reinforcement-learning
pytorch
gym
frozenlake-v0
proximal-policy-optimization
cartpole-v0
lunar-lander
random-network-distillation
bipedalwalker
ppo-rnd
frozenlake-not-slippery
-
Updated
Dec 9, 2019 - Python
Udacity Deep Reinforcement Learning Nanodegree Program
machine-learning
reinforcement-learning
deep-reinforcement-learning
pytorch
proximal-policy-optimization
-
Updated
Jul 12, 2019 - ASP
reinforcement-learning
generative-adversarial-network
imitation-learning
proximal-policy-optimization
gail
-
Updated
Sep 3, 2019 - Python
Reinforcement learning agent using Proximal Policy Optimization (PPO) and Unity
python
machine-learning
reinforcement-learning
deep-learning
unity
tensorflow
machine-learning-algorithms
game-development
neural-networks
proximal-policy-optimization
-
Updated
Jan 26, 2019 - C#
Improve this page
Add a description, image, and links to the proximal-policy-optimization topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the proximal-policy-optimization topic, visit your repo's landing page and select "manage topics."


Hi, I am trying to use the PPO algorithm; however, it's not clear how to construct the stochastic policy. Should I use the Gaussian policy network?
Cool library by the way; I like the modularity!