journal article Oct 13, 2009

Balance diagnostics for comparing the distribution of baseline covariates between treatment groups in propensity‐score matched samples

Statistics in Medicine Vol. 28 No. 25 pp. 3083-3107 · Wiley
Abstract
AbstractThe propensity score is a subject's probability of treatment, conditional on observed baseline covariates. Conditional on the true propensity score, treated and untreated subjects have similar distributions of observed baseline covariates. Propensity‐score matching is a popular method of using the propensity score in the medical literature. Using this approach, matched sets of treated and untreated subjects with similar values of the propensity score are formed. Inferences about treatment effect made using propensity‐score matching are valid only if, in the matched sample, treated and untreated subjects have similar distributions of measured baseline covariates. In this paper we discuss the following methods for assessing whether the propensity score model has been correctly specified: comparing means and prevalences of baseline characteristics using standardized differences; ratios comparing the variance of continuous covariates between treated and untreated subjects; comparison of higher order moments and interactions; five‐number summaries; and graphical methods such as quantile–quantile plots, side‐by‐side boxplots, and non‐parametric density plots for comparing the distribution of baseline covariates between treatment groups. We describe methods to determine the sampling distribution of the standardized difference when the true standardized difference is equal to zero, thereby allowing one to determine the range of standardized differences that are plausible with the propensity score model having been correctly specified. We highlight the limitations of some previously used methods for assessing the adequacy of the specification of the propensity‐score model. In particular, methods based on comparing the distribution of the estimated propensity score between treated and untreated subjects are uninformative. Copyright © 2009 John Wiley & Sons, Ltd.
Topics

No keywords indexed for this article. Browse by subject →

References
47
[1]
The central role of the propensity score in observational studies for causal effects

Paul R. Rosenbaum, Donald B. Rubin

Biometrika 10.1093/biomet/70.1.41
[16]
Tu JV (2004)
[19]
Moher D "The CONSORT statement: revised recommendations for improving the quality of reports of parallel‐group randomized trials" Journal of the American Medical Association (2001)
[26]
Cohen J (1988)
[27]
Hedges LV (1985)
[30]
Rosner B (1995)
[32]
Hoaglin DC (1983)
[33]
Casella G (1990)
[41]
Sackett DL "Down with odds ratios! for publication" Evidence‐Based Medicine (1996)
[44]
The number needed to treat: a clinically useful measure of treatment effect

Richard J Cook, David L Sackett

BMJ 10.1136/bmj.310.6977.452
[45]
Jaeschke R "Basis statistics for clinicians 3: assessing the effects of treatment: measures of association" Canadian Medical Association Journal (1995)
Cited By
5,504
Annals of Emergency Medicine
New England Journal of Medicine
Global Change Biology
Nature Medicine
Metrics
5,504
Citations
47
References
Details
Published
Oct 13, 2009
Vol/Issue
28(25)
Pages
3083-3107
License
View
Cite This Article
Peter C. Austin (2009). Balance diagnostics for comparing the distribution of baseline covariates between treatment groups in propensity‐score matched samples. Statistics in Medicine, 28(25), 3083-3107. https://doi.org/10.1002/sim.3697
Related

You May Also Like