Solve this GridWorld

Question

Solve this GridWorld

Calculate the final V* values for the given grid world. Fill in all missing cells. All given values are terminal states. Use value iteration/policy iteration, any method that you like.

Assume 50% discount (that is, gamma = 0.5). Assume 0.8, 0.1, 0.1 noise, that is, probability of going to the intended direction is 0.8, and probability of going left/right is 0.1 each.

GridWorld
10	10	10	10	10
10				10
10				10
10				10
10	10	10	10	10

asked May 13, 2019 in Informed Search by Amrinder Arora AlgoMeister (1.7k points)

2 Answers

yuemolei · Answer 1 · 2020-03-24T06:34:37+0000

10	10	10	10	10
10	a	b	a	10
10	b	c	b	10
10	a	b	a	10
10	10	10	10	10

a=0.5*(0.9*10+0.1*b) (1

b=0.5*(0.8*10+0.2*a) (2

c=0.5*b

because of (1) and (2)

//a=4.5+0.05*b

//b=4+0.1*a

a=4.7+0.005a

a≈4.72361809

b≈4.47236181

c≈2.2361809

Divya Sree Vadlamudi · Answer 2 · 2024-05-06T20:21:15+0000

10    10    10    10    10
10    a       b       a    10
10    b       c       b    10
10    a       b       a    10
10    10    10    10    10

a = 0.5*(0.8*10 + 0.1*b + 0.1*10) = 0.5*(9+0.1*b)
b = 0.5*(0.8*10 + 0.1*a + 0.1*b) = 0.5*(8 + 0.1*a + 0.1*b)
c = 0.5*(0.8*b + 0.1*b + 0.1*b) = 0.5*(b)
On solving, a = 4.72 b = 4.459 c = 2.23

Categories

Most popular tags

Solve this GridWorld

Please log in or register to add a comment.

Please log in or register to answer this question.

2 Answers

Please log in or register to add a comment.

Please log in or register to add a comment.

Related questions