Thanks for pointing out the erroneous statement, cases with positive advantage are indeed not…

The explanation of the PPO appears to be needlessly convoluted and the quoted text describing the…
5
1
Arjun Rao
Wouter van Heeswijk, PhD
·Follow
Aug 16, 2023
--
Thanks for pointing out the erroneous statement, cases with positive advantage are indeed not always clipped. I've revised the explanation.
--
--
Written by Wouter van Heeswijk, PhD1.7K Followers
·36 Following
Assistant professor in Financial Engineering and Operations Research. Writing about reinforcement learning, optimization problems, and data science.
No responses yet
Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams