Jjschwartz / rlalgs Public

Notifications You must be signed in to change notification settings
Fork 1
Star 0

Implementations of RL algorithms using tensorflow based off of OpenAI spinningup tutorials

0 stars 1 fork Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
rlalgs		rlalgs
.gitignore		.gitignore
README.md		README.md
setup.py		setup.py

Repository files navigation

RL Algorithms

A collection of implementations of RL algorithms in Python. Developed for my own personal learning as I work through papers and tutorials.

Algorithms implemented

Simple Policy gradient:

using only a policy network and no advantage function
also implemented using reward-to-go

Vanilla Policy Gradient

using reward-to-go, simple advantage function (Q(s, a) - V(s)) and GAE

Deep Q-network with experience replay

Based off of the original DQN paper (Mnih et al (2013))

Synchronous Actor Critic (A2C)

Resources used

OpenAI Spinning Up

About

Implementations of RL algorithms using tensorflow based off of OpenAI spinningup tutorials

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%