Kagaratsch
Kagaratsch

Reputation: 1023

Proximal Policy Optimization Algorithms paper - definition of "KL" operation?

In the original paper on Proximal Policy Optimization Algorithms

https://arxiv.org/pdf/1707.06347.pdf

in equation (4) the authors use an operation denoted by KL[]. Unfortunately, they never give a definition for it.

My question:

What does the KL[] operation stand for?

Upvotes: 1

Views: 83

Answers (1)

Mark Ma
Mark Ma

Reputation: 1362

Maybe it's KL divergence?

KL divergence is used to compare differences between two probability distribution.

Upvotes: 3

Related Questions