General Info[]
What[]
A policy is a function that takes a state and returns an action
Used For[]
Policies are used in markov decision processes to describe actions the agent should take from certain states
Optimal Policy[]
What[]
The optimal policy is the policy that maximizes expected utility if followed