Skip to content

Implementations of RL algorithms using tensorflow based off of OpenAI spinningup tutorials

Notifications You must be signed in to change notification settings

Jjschwartz/rlalgs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

55 Commits
 
 
 
 
 
 
 
 

Repository files navigation

RL Algorithms

A collection of implementations of RL algorithms in Python. Developed for my own personal learning as I work through papers and tutorials.

Algorithms implemented

  1. Simple Policy gradient:
  • using only a policy network and no advantage function
  • also implemented using reward-to-go
  1. Vanilla Policy Gradient
  • using reward-to-go, simple advantage function (Q(s, a) - V(s)) and GAE
  1. Deep Q-network with experience replay
  • Based off of the original DQN paper (Mnih et al (2013))
  1. Synchronous Actor Critic (A2C)

Resources used

About

Implementations of RL algorithms using tensorflow based off of OpenAI spinningup tutorials

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages