MDP planning tutorial pointers

MDP background

The paper that introduced POMDPs(?).

Local planning; tree building/lookahead search

Local planning in smooth MDPs

Policy search

Policy gradients

Global convergence of policy gradients

Check bibliography in these papers and citations! New papers every day!

Value function approximation

Other notable sources