--

Thanks for pointing out the erroneous statement, cases with positive advantage are indeed not always clipped. I've revised the explanation.

--

--

Wouter van Heeswijk, PhD
Wouter van Heeswijk, PhD

Written by Wouter van Heeswijk, PhD

Assistant professor in Financial Engineering and Operations Research. Writing about reinforcement learning, optimization problems, and data science.

No responses yet