Tutorial by Examples

Minimal Example

Q-learning is a variant of model-free reinforcement learning. In Q-learning we want the agent to estimate how good a (state, action) pair is so that it can choose good actions in each state. This is done by approximating an action-value function (Q) that fits in equation below: Where s and a are ...

tensorflow • Q-learning

Page 1 of 1

Advertise with us
Contact us
Cookie Policy
Privacy Policy

Get monthly updates about new articles, cheatsheets, and tricks.