Solve the final V* values for the given grid world. All given values are exit states. Fill in all missing cells. Don’t use value iteration. Show all your work/logic/steps, etc.
Assume: Gamma = 0.9. Noise Model: [0.8, 0.1, 0.1], that is, 80% chance of going in the intended direction and 10% in each of perpendicular ones.
Using pen and paper is preferable.