S. L. Brunton (2022, January 21), Overview of Deep Reinforcement Learning Methods. Disponível em: https://doi.org/10.52843/cassyni.kfnzpy. Acesso em 14 de março de 2024.
Andrej Karpathy. Deep Reinforcement Learning: Pong from Pixels. Andrej Karpathy blog, 2016.
Sanyam Kapoor. Police Gradients in a Nutshell. Towards Data Science, 2018.
Intro to Policy Optimization. OpenAI Spinning Up.