Sejam
e
então
Construa
Bellman, Richard. “The theory of Dynamic Programming” Bulletin of the American Mathematical Society 60 (1954): 503-515.
Steve Brunton. Model Based Reinforcement Learning: Policy Iteration, Value Iteration, and Dynamic Programming.
--- <!-- _paginate: false
--- <style scoped> h1 { /* text-align: center; */ color: #ffffff } h3 { /* text-align: center; */ color: #dddddd } </style> ![bg](styles/bg_inteli_01.png) ### Reflexão # Os juros do conhecimento
--- <style scoped> h1 { /* text-align: center; */ color: #ffffff } </style> ![bg](styles/bg_inteli_01.png) # E quando não dá pra calcular o valor diretamente?