Double Clipping: Less-Biased Variance Reduction in Off-Policy Evaluation (2309.01120v1)

Published 3 Sep 2023 in cs.LG

Abstract: "Clipping" (a.k.a. importance weight truncation) is a widely used variance-reduction technique for counterfactual off-policy estimators. Like other variance-reduction techniques, clipping reduces variance at the cost of increased bias. However, unlike other techniques, the bias introduced by clipping is always a downward bias (assuming non-negative rewards), yielding a lower bound on the true expected reward. In this work we propose a simple extension, called $\textit{double clipping}$, which aims to compensate this downward bias and thus reduce the overall bias, while maintaining the variance reduction properties of the original estimator.

Citations (2)

View on Semantic Scholar

Collections

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Double Clipping: Less-Biased Variance Reduction in Off-Policy Evaluation (2309.01120v1)

Collections

Summary

Follow-up Questions

Authors (5)

Don't miss out on important new AI/ML research

Double Clipping: Less-Biased Variance Reduction in Off-Policy Evaluation (2309.01120v1)

Collections

Summary

Follow-up Questions

Related Papers

Authors (5)

Don't miss out on important new AI/ML research