journal article Jan 28, 2014

Predictive mean matching imputation of semicontinuous variables

Statistica Neerlandica Vol. 68 No. 1 pp. 61-90 · Wiley
View at Publisher Save 10.1111/stan.12023
Abstract
Multiple imputation methods properly account for the uncertainty of missing data. One of those methods for creating multiple imputations is predictive mean matching (PMM), a general purpose method. Little is known about the performance of PMM in imputing non‐normal semicontinuous data (skewed data with a point mass at a certain value and otherwise continuously distributed). We investigate the performance of PMM as well as dedicated methods for imputing semicontinuous data by performing simulation studies under univariate and multivariate missingness mechanisms. We also investigate the performance on real‐life datasets. We conclude that PMM performance is at least as good as the investigated dedicated methods for imputing semicontinuous data and, in contrast to other methods, is the only method that yields plausible imputations and preserves the original data distributions.
Topics

No keywords indexed for this article. Browse by subject →

References
28
[2]
Alfons A. "Applications of statistical simulation in the case of eu‐silc: using the r package simframe" Journal of Statistical Software (2010)
[8]
Heckman J.(1976) The common structure of statistical models of truncation sample selection and limited dependent variables and a simple estimator for such models.NBER Chapters pages120–137.
[9]
Heeringa S. (2002)
[13]
Manning W. C.Morris J.Newhouse L.Orr N.Duan E.Keeler A.Leibowitz K.Marquis M.MarquisandC.Phelps(1981) A two‐part model of the demand for medical care: preliminary results from the health insurance study.Economics and Health Economics.Amsterdam:North‐Holland.
[16]
Raghunathan T. P.SolenbergerandJ.Van Hoewyk(2002) IVEware: imputation and variance estimation software. Ann Arbor MI: Survey Methodology Program Survey Research Center Institute for Social Research University of Michigan.
[19]
Sargasso.nl(2012) De haagse twitter stolp.
[21]
Schafer J.andM.Olsen(1999) Modeling and imputation of semicontinuous survey variables. InProceedings of the Federal Committee on Statistical Methodology Research Conference.
[24]
Estimation of Relationships for Limited Dependent Variables

James Tobin

Econometrica 10.2307/1907382
[26]
Van Buuren S. "MICE: multivariate imputation by chained equations in R" Journal of Statistical Software (2011)
[27]
Multiple imputation using chained equations: Issues and guidance for practice

Ian R. White, Patrick Royston, Angela M. Wood

Statistics in Medicine 10.1002/sim.4067
Cited By
144
Quality of Life Research
Related

You May Also Like

Modelling association football scores

M. J. Maher · 1982

251 citations

Estimating the evidence – a review

Nial Friel, Jason Wyse · 2012

137 citations

Het uitzetten van waarnemingen op waarschijnlijkheids‐papier1

A. Benard, E. C. Bos‐Levenbach · 1953

124 citations

Methodological challenges of register‐based research

Bart F. M. Bakker, Piet J. H. Daas · 2011

26 citations