Reputation: 666
I understand epsilon-greedy algorithm, but there is one point of confusion.
Upvotes: 1
Views: 1856
Reputation: 5402
Epsilon-greedy is a policy, not an algorithm. It is exclusive of discrete action problems: you select the action according to
argmax Q(s,a) with probability 1-epsilon
random otherwise
You can use with Q-learning, SARSA, DDPG, policy gradient, ...
Upvotes: 3