COURSE OVERVIEW
The course begins by offering an intuitive explanation of Rubin’s classification scheme for missingness mechanisms (MCAR, MAR, MNAR). It then moves on to a discussion of the deficiencies in a number of commonly used ad-hoc approaches to handling missing data, before introducing the method of multiple imputation, a principled approach for handling missing data under the MAR assumption. An overview of Stata 12’s multiple imputation capabilities is provided, together with a comparison of the different imputation approaches. The course concludes by offering a brief introduction to an alternative approach to handling missingness, that of inverse probability weighting. Throughout we emphasize that the choice of statistical methods for handling missing data should be made based on the assumptions believed to be reasonable for the study in hand.
TARGET AUDIENCE
Researchers with strong quantitative skills and experience in statistical analysis in Stata.
COURSE REQUISITES
It is also assumed that participants will have followed the Invited Speaker session of the conference, in which an introduction to the missing data problem will be discussed.
PROGRAM
SESSION I
The missingness mechanism and Rubin’s taxonomy (MCAR, MAR, MNAR)
A critique of ad-hoc approaches
SESSION II
An introduction to parametric multiple imputation
SESSION II: MULTIPLE IMPUTATION IN STATA
Setting up Stata for multiple imputation
mputation with the multivariate normal model
Imputation by chained equations
Analysing multiply imputed datasets
SESSION IV
A brief introduction to inverse probability weighting
Untestable assumptions and the importance of sensitivity analyses
Conclusions