Aug 16, 2023
Thanks for pointing out the erroneous statement, cases with positive advantage are indeed not always clipped. I've revised the explanation.
Thanks for pointing out the erroneous statement, cases with positive advantage are indeed not always clipped. I've revised the explanation.
Assistant professor in Financial Engineering and Operations Research. Writing about reinforcement learning, optimization problems, and data science.