A Fitting Pareto Tails to Wealth Survey Data: A Practioners’ Guide
Keywords:Pareto distribution, complex survey data, wealth distribution
Taking survey data of household wealth as our major example, this short article discusses some of the issues applied researchers are facing when fitting (Type I) Pareto distributions to complex survey data. The contribution of this article is threefold. First, we show how the ordering of the data vector is related to alternative definitions of the empirical CCDF. Second, we provide an intuitive reinterpretation of the bias-corrected estimator developed by Gabaix and Ibragimov (2011), in terms of the alternative definitions of the empirical CCDF, which allows us to generalize their result to the case of complex survey data. Third, we provide computational formulas for standard Kolmogorov-Smirnov (KS) and Cramer-von Mises (CvM) goodness- of-fit tests for complex survey data. Taken together the article provides a concise and hopefully useful presentation of the fundamentals of Pareto tail- fitting with complex survey data.
Aigner, D.J. and A.S. Goldberger 1970 “Estimation of Pareto’s law from grouped observation”, Journal of the American Statistical Association 65(330): 712-723.
Bach, S., A. Thiemann, and A. Zucco 2018 “Looking for the missing rich: Tracing the top tail of the wealth distribution”, DIW Berlin Discussion Paper.
Clauset, A., C.R. Shalizi and M.E.J. Newman 2009 “Power-law distributions in empirical data”, SIAM Review 51(4): 661-703.
D’Agostino, R.B. and M.A. Stephens (eds) 1986 Goodness-of-Fit Techniques in D.B. Owen (ed.), Statistics: Textbooks and Monographs Vol. 68. New York: Marcel Dekker, Inc.
Dalitz, C. 2018 “Estimating wealth distribution: Top tail and inequality”, Cornell University arXiv, preprint arXiv 1807.03592.
Gabaix, X. and R. Ibragimov 2011 “Rank - 1/2: A simple way to improve the OLS estimation oftail exponent”, Journal of Business and Economic Statistics 29(1): 24-39.
Jayadev, A. 2008 “A power law tail in India’s wealth distribution: Evidence from survey data”, Physica A: Statistical Mechanics and its Applications 387(1): 270-276.
Kratz, M. and S.I. Resnick 1996 “The qq-estimator and heavy tail”, Communications in Statistics. Stochastic Models 12(4): 699-724.
Monahan, J. F. 2011 Numerical Methods of Statistics 2nd ed. New York: Cambridge University Press.
Survey of Consumer Finances 2016. The study is sponsored by the Federal Reserve Board in cooperation with the Department of the Treasury. Data have been collected by the National Opinion Research Center (NORC) at the University of Chicago.
Vermeulen, P. 2018 “How fat is the top tail of the wealth distribution?”, Review of Income and Wealth 64(2): 357-387.
Wildauer, R. and J. Kapeller 2019 “Rank correction: A new approach to differential nonresponse in wealth survey data”, Greenwich Papers in Political Economy73.