Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

sungsulim/RLControl

Open more actions menu

Repository files navigation

RLControl

Implementation of Continuous Control RL Algorithms.

Repository used for our paper Actor-Expert: A Framework for using Q-learning in Continuous Action Spaces.

webpage: https://sites.google.com/ualberta.ca/actorexpert

Available Algorithms

Installation

Create virtual environment and install necessary packages through "pip3 -r requirements.txt"

Usage

Settings for available environments and agents are provided in jsonfiles/ directory

Example:

ENV=Pendulum-v0 (must match jsonfiles/environment/*.json name)

AGENT=ddpg (must match jsonfiles/agent/*.json name)

INDEX=0 (useful for running sweeps over different settings and doing multiple runs)

Run: python3 main.py --env_json jsonfiles/environment/$ENV.json --agent_json jsonfiles/agent/$AGENT.json --index $INDEX

(--render and --monitor is optional, to visualize/monitor the agents' training, only available for openai gym or mujoco environments. --write_plot is also available to plot the learned action-values and policy on Bimodal1DEnv domain.)

  • ENV.json is used to specify evaluation settings:

    • TotalMilSteps: Total training steps to be run (in million)
    • EpisodeSteps: Steps in an episode (Use -1 to use the default setting)
    • EvalIntervalMilSteps: Evaluation Interval steps during training (in million)
    • EvalEpisodes: Number of episodes to evaluate in a single evaluation
  • AGENT.json is used to specify agent hyperparameter settings:

    • norm: type of normalization to use
    • exploration_policy: "ou_noise", "none": Use "none" if the algorithm has its own exploration mechanism
    • actor/critic l1_dim, l2_dim: layer dimensions
    • learning rate
    • other algorithm specific settings

About

Implementation of Continuous Control RL Algorithms

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Morty Proxy This is a proxified and sanitized view of the page, visit original site.