Off-Policy Evaluation with Out-of-Sample Guarantees

Sofia Ek, Dave Zachariah, Fredrik D. Johansson, Peter Stoica

Action editor: Alain Oliviero Durmus.

https://openreview.net/forum?id=XnYtGPgG9p

#coverage #policy #inferences

Off-Policy Evaluation with Out-of-Sample Guarantees

We consider the problem of evaluating the performance of a decision policy using past observational data. The outcome of a policy is measured in terms of a loss (aka. disutility or negative reward)...

OpenReview