Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

NickKaparinos/Stanford-CS-234-RL-2022

Open more actions menu

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Stanford-CS-234-RL-2022

Solutions to the Stanford CS:234 Reinforcement Learning 2022 course assignments.

Course website: https://web.stanford.edu/class/cs234/

Assignment 1

Frozen Lake Markov Decision Process using Value Iteration and Policy Iterasion

         Policy Iteration                   Value Iteration          

Assignment 2

Tabular Q Learning and Deep Q Learning

Learning Curve on the test environment:

Assignment 3

Policy Gradient Methods and REINFORCE

Learning Curve of the REINFORCE algorithm on CartPole-v0:

Assignment 4

Aplication of Bandit Algorithms in the medical setting

Comparison of different Bandit Algorithms:

Assignment 5

Aplication of Upper Confidence Bandit in personalized Recomendation Systems

Comparison of different arm update strategies:

About

Solutions to the Stanford CS:234 Reinforcement Learning 2022 course assignments.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

Morty Proxy This is a proxified and sanitized view of the page, visit original site.