Model-based Policy Gradients
reinforcement-learning openai-gym pytorch computation-graph gym policy-gradient finite-difference backpropagation computational-graphs mujoco model-based ilqg ilqr ilqg-mujoco mujoco-py policy-gradients policy-optimization direct-policy-search mujoco-dynamics
-
Updated
Mar 12, 2020 - Python