user5311361
user5311361

Reputation: 47

How do I describe optimal policy (pi*) of bellman's equation?

I tried to find what is pi* in many resources like this link. But, I can not find what is pi*. Is V* is same as V_pi*?

Screenshot of the question

Upvotes: 0

Views: 147

Answers (1)

suat
suat

Reputation: 4289

π* is used to represent the "optimal policy". V* and Q* are optimal value functions. Optimal value functions lead to optimal policies.

Have a look at the Section 4.6 at https://web.fe.up.pt/~eol/schaefer/diplom/ReinforcementLearning.htm

Upvotes: 2

Related Questions