Heavy-tailed distributions present a tough setting for inference. They are also common in industrial applications, particularly with internet transaction datasets, and machine learners often analyze such data without considering the biases and risks associated with the misuse of standard tools. This chapter outlines a procedure for inference about the mean of a (possibly conditional) heavy-tailed distribution that combines nonparametric analysis for the bulk of the support with Bayesian parametric modeling – motivated from extreme value theory – for the heavy tail. The procedure is fast and massively scalable. The work should find application in settings wherever correct inference is important and reward tails are heavy; we illustrate the framework in causal inference for A/B experiments involving hundreds of millions of users of eBay.com.
Lopes, H., Taddy, M. and Gardner, M. (2019), "Scalable Semiparametric Inference for the Means of Heavy-tailed Distributions", Topics in Identification, Limited Dependent Variables, Partial Observability, Experimentation, and Flexible Modeling: Part B (Advances in Econometrics, Vol. 40B), Emerald Publishing Limited, pp. 141-156. https://doi.org/10.1108/S0731-90532019000040B008Download as .RIS
Emerald Publishing Limited
Copyright © 2019 Emerald Publishing Limited