Reputation: 47
I tried to find what is pi* in many resources like this link. But, I can not find what is pi*. Is V* is same as V_pi*?
Upvotes: 0
Views: 147
Reputation: 4289
π* is used to represent the "optimal policy". V* and Q* are optimal value functions. Optimal value functions lead to optimal policies.
Have a look at the Section 4.6 at https://web.fe.up.pt/~eol/schaefer/diplom/ReinforcementLearning.htm
Upvotes: 2