Cs188AI Wiki
Advertisement

General Info[]

Mdp search tree

What[]

Markov decision process search is an example of non deterministic search, and finds the expected value of any given state.

How[]

  1. Compute regular expectimax search, except we introduce a new idea of chance or (q nodes). They represent when we have committed to an action, but are still unsure of which final state s' we end up in

Mathematical Definitions[]

The following mathematical definitions are the same as the expectimax computation. They are also known as the Bellman Equations.

Advertisement