Missing Data in Randomized Control Trials John W

  • Slides: 82
Download presentation
Missing Data in Randomized Control Trials John W. Graham The Prevention Research Center and

Missing Data in Randomized Control Trials John W. Graham The Prevention Research Center and Department of Biobehavioral Health Penn State University jgraham@psu. edu IES/NCER Summer Research Training Institute, July 2008

Sessions in Four Parts n n n (1) Introduction: Missing Data Theory (2) Attrition:

Sessions in Four Parts n n n (1) Introduction: Missing Data Theory (2) Attrition: Bias and Lost Power (3) A brief analysis demonstration n Multiple Imputation with n n n NORM and Proc MI (4) Hands-on Intro to Multiple Imputation

Recent Papers n Graham, J. W. , Cumsille, P. E. , & Elek-Fisk, E.

Recent Papers n Graham, J. W. , Cumsille, P. E. , & Elek-Fisk, E. (2003). Methods for handling missing data. In J. A. Schinka & W. F. Velicer (Eds. ). Research Methods in Psychology (pp. 87_114). Volume 2 of Handbook of Psychology (I. B. Weiner, Editor-in-Chief). New York: John Wiley & Sons. n Graham, J. W. , (2009, in press). Missing data analysis: making it work in the real world. Annual Review of Psychology, 60. n Collins, L. M. , Schafer, J. L. , & Kam, C. M. (2001). A comparison of inclusive and restrictive strategies in modern missing data procedures. Psychological Methods, 6, 330_351. n Schafer, J. L. , & Graham, J. W. (2002). Missing data: our view of the state of the art. Psychological Methods, 7, 147177.

Part 1: A Brief Introduction to Analysis with Missing Data

Part 1: A Brief Introduction to Analysis with Missing Data

Problem with Missing Data n Analysis procedures were designed for complete data . .

Problem with Missing Data n Analysis procedures were designed for complete data . . .

Solution 1 n n n Design new model-based procedures Missing Data + Parameter Estimation

Solution 1 n n n Design new model-based procedures Missing Data + Parameter Estimation in One Step Full Information Maximum Likelihood (FIML) SEM and Other Latent Variable Programs (Amos, LISREL, Mplus, Mx, LTA)

Solution 2 n Data based procedures n n e. g. , Multiple Imputation (MI)

Solution 2 n Data based procedures n n e. g. , Multiple Imputation (MI) Two Steps n Step 1: Deal with the missing data n n n (e. g. , replace missing values with plausible values Produce a product Step 2: Analyze the product as if there were no missing data

FAQ n Aren't you somehow with imputation? . . . helping yourself

FAQ n Aren't you somehow with imputation? . . . helping yourself

NO. Missing data imputation. . . n does NOT give you something for nothing

NO. Missing data imputation. . . n does NOT give you something for nothing n DOES let you make use of all data you have. . .

FAQ n Is the imputed value what the person would have given?

FAQ n Is the imputed value what the person would have given?

NO. When we impute a value. . n We do not impute for the

NO. When we impute a value. . n We do not impute for the sake of the value itself n We impute to preserve important characteristics of the whole data set . . .

We want. . . n unbiased parameter estimation n n Good estimate of variability

We want. . . n unbiased parameter estimation n n Good estimate of variability n n e. g. , b-weights e. g. , standard errors best statistical power

Causes of Missingness n Ignorable MCAR: Missing Completely At Random n MAR: Missing At

Causes of Missingness n Ignorable MCAR: Missing Completely At Random n MAR: Missing At Random n n Non-Ignorable n MNAR: Missing Not At Random

MCAR (Missing Completely At Random) n MCAR 1: Cause of missingness completely random process

MCAR (Missing Completely At Random) n MCAR 1: Cause of missingness completely random process (like coin flip) n MCAR 2: Cause uncorrelated with variables of interest n Example: parents move n n No bias if cause omitted

MAR (Missing At Random) n Missingness may be related to measured variables n But

MAR (Missing At Random) n Missingness may be related to measured variables n But no residual relationship with unmeasured variables n n Example: reading speed No bias if you control for measured variables

MNAR (Missing Not At Random) n Even after controlling for measured variables. . .

MNAR (Missing Not At Random) n Even after controlling for measured variables. . . n Residual relationship with unmeasured variables n Example: drug use reason for absence

MNAR Causes n The recommended methods assume missingness is MAR n But what if

MNAR Causes n The recommended methods assume missingness is MAR n But what if the cause of missingness is not MAR? n Should these methods be used when MAR assumptions not met?

YES! These Methods Work! n Suggested methods work better than “old” methods n Multiple

YES! These Methods Work! n Suggested methods work better than “old” methods n Multiple causes of missingness n n Only small part of missingness may be MNAR Suggested methods usually work very well

Methods: "Old" vs MAR vs MNAR n MAR methods (MI and ML) n n

Methods: "Old" vs MAR vs MNAR n MAR methods (MI and ML) n n n are ALWAYS at least as good as, usually better than "old" methods (e. g. , listwise deletion) Methods designed to handle MNAR missingness are NOT always better than MAR methods

Analysis: Old and New

Analysis: Old and New

Old Procedures: Analyze Complete Cases (listwise deletion) n may produce bias n you always

Old Procedures: Analyze Complete Cases (listwise deletion) n may produce bias n you always lose some power n (because you are throwing away data) n reasonable if you lose only 5% of cases n often lose substantial power

Analyze Complete Cases (listwise deletion) 1 0 1 1 1 n n n 1

Analyze Complete Cases (listwise deletion) 1 0 1 1 1 n n n 1 1 0 1 1 1 1 1 0 very common situation only 20% (4 of 20) data points missing but discard 80% of the cases

Other "Old" Procedures n Pairwise deletion n n Mean substitution n n May be

Other "Old" Procedures n Pairwise deletion n n Mean substitution n n May be of occasional use for preliminary analyses Never use it Regression-based single imputation n generally not recommended. . . except. . .

Recommended Model-Based Procedures n n Multiple Group SEM (Structural Equation Modeling) Latent Transition Analysis

Recommended Model-Based Procedures n n Multiple Group SEM (Structural Equation Modeling) Latent Transition Analysis (Collins et al. ) n A latent class procedure

Recommended Model-Based Procedures n Raw Data Maximum Likelihood SEM aka Full Information Maximum Likelihood

Recommended Model-Based Procedures n Raw Data Maximum Likelihood SEM aka Full Information Maximum Likelihood (FIML) n Amos (James Arbuckle) n LISREL 8. 5+ (Jöreskog & Sörbom) n Mplus (Bengt Muthén) n Mx (Michael Neale)

Amos 7, Mx, Mplus, LISREL 8. 8 n Structural Equation Modeling (SEM) Programs n

Amos 7, Mx, Mplus, LISREL 8. 8 n Structural Equation Modeling (SEM) Programs n In Single Analysis. . . n Good Estimation n Reasonable standard errors n Windows Graphical Interface

Limitation with Model-Based Procedures n That particular model must be what you want

Limitation with Model-Based Procedures n That particular model must be what you want

Recommended Data-Based Procedures EM Algorithm (ML parameter estimation) n Norm-Cat-Mix, EMcov, SAS, SPSS Multiple

Recommended Data-Based Procedures EM Algorithm (ML parameter estimation) n Norm-Cat-Mix, EMcov, SAS, SPSS Multiple Imputation n NORM, Cat, Mix, Pan (Joe Schafer) n SAS Proc MI n LISREL 8. 5+ n Amos 7

EM Algorithm Expectation - Maximization Alternate between n E-step: predict missing data M-step: estimate

EM Algorithm Expectation - Maximization Alternate between n E-step: predict missing data M-step: estimate parameters n Excellent (ML) parameter estimates n But no standard errors must use bootstrap n or multiple imputation n

Multiple Imputation n Problem with Single Imputation: Too Little Variability n n Because of

Multiple Imputation n Problem with Single Imputation: Too Little Variability n n Because of Error Variance Because covariance matrix is only one estimate

Too Little Error Variance n Imputed value lies on regression line

Too Little Error Variance n Imputed value lies on regression line

Imputed Values on Regression Line

Imputed Values on Regression Line

Restore Error. . . n Add random normal residual

Restore Error. . . n Add random normal residual

Regression Line only One Estimate

Regression Line only One Estimate

Covariance Matrix (Regression Line) only One Estimate n n n Obtain multiple plausible estimates

Covariance Matrix (Regression Line) only One Estimate n n n Obtain multiple plausible estimates of the covariance matrix ideally draw multiple covariance matrices from population Approximate this with Bootstrap n Data Augmentation (Norm) n MCMC (SAS 8. 2, 9) n

Data Augmentation n stochastic version of EM n EM E (expectation) step: predict missing

Data Augmentation n stochastic version of EM n EM E (expectation) step: predict missing data n M (maximization) step: estimate parameters n n Data Augmentation I (imputation) step: simulate missing data n P (posterior) step: simulate parameters n

Data Augmentation n Parameters from consecutive steps. . . too related n i. e.

Data Augmentation n Parameters from consecutive steps. . . too related n i. e. , not enough variability n n after 50 or 100 steps of DA. . . covariance matrices are like random draws from the population

Multiple Imputation Allows: n Unbiased Estimation n Good standard errors provided number of imputations

Multiple Imputation Allows: n Unbiased Estimation n Good standard errors provided number of imputations (m) is large enough n too few imputations reduced power with small effect sizes n

ρ From Graham, J. W. , Olchowski, A. E. , & Gilreath, T. D.

ρ From Graham, J. W. , Olchowski, A. E. , & Gilreath, T. D. (2007). needed? Some practical clarifications of multiple imputation theory. How many imputations are really Prevention Science, 8, 206 -213.

Part 2 Attrition: Bias and Loss of Power

Part 2 Attrition: Bias and Loss of Power

Relevant Papers n Graham, J. W. , (in press). n Collins, Missing data analysis:

Relevant Papers n Graham, J. W. , (in press). n Collins, Missing data analysis: making it work in the real world. Annual Review of Psychology, 60. L. M. , Schafer, J. L. , & Kam, C. M. (2001). A comparison of inclusive and restrictive strategies in modern missing data procedures. Psychological Methods, 6, 330_351. n Hedeker, D. , & Gibbons, R. D. (1997). n Graham, J. W. , & Collins, L. M. (2008). n Graham, J. W. , Palen, L. A. , et al. (2008). Application of random-effects pattern-mixture models for missing data in longitudinal studies, Psychological Methods, 2, 64 -78. Using Modern Missing Data Methods with Auxiliary Variables to Mitigate the Effects of Attrition on Statistical Power. Annual Meetings of the Society for Prevention Research, San Francisco, CA. (available upon request) Attrition: MAR & MNAR missingness, and estimation bias. Annual Meetings of the Society for Prevention Research, San Francisco, CA. (available upon request)

What if the cause of missingness is MNAR? Problems with this statement n n

What if the cause of missingness is MNAR? Problems with this statement n n n MAR & MNAR are widely misunderstood concepts I argue that the cause of missingness is never purely MNAR The cause of missingness is virtually never purely MAR either.

MAR vs MNAR n n "Pure" MCAR, MNAR never occur in field research Each

MAR vs MNAR n n "Pure" MCAR, MNAR never occur in field research Each requires untenable assumptions n e. g. , that all possible correlations and partial correlations are r = 0

MAR vs MNAR n n Better to think of MAR and MNAR as forming

MAR vs MNAR n n Better to think of MAR and MNAR as forming a continuum MAR vs MNAR NOT even the dimension of interest

MAR vs MNAR: What IS the Dimension of Interest? n How much estimation bias?

MAR vs MNAR: What IS the Dimension of Interest? n How much estimation bias? n when cause of missingness cannot be included in the model

Bottom Line. . . n n All missing data situations are partly MAR and

Bottom Line. . . n n All missing data situations are partly MAR and partly MNAR Sometimes it matters. . . n n bias affects statistical conclusions Often it does not matter n bias has tolerably little effect on statistical conclusions (Collins, Schafer, & Kam, Psych Methods, 2001)

Methods: "Old" vs MAR vs MNAR n MAR methods (MI and ML) n n

Methods: "Old" vs MAR vs MNAR n MAR methods (MI and ML) n n n are ALWAYS at least as good as, usually better than "old" methods (e. g. , listwise deletion) Methods designed to handle MNAR missingness are NOT always better than MAR methods

Yardstick for Measuring Bias Standardized Bias = (average parameter est) – (population value) ----------------------------

Yardstick for Measuring Bias Standardized Bias = (average parameter est) – (population value) ---------------------------- X 100 Standard Error (SE) n |bias| < 40 considered small enough to be tolerable

A little background for Collins, Schafer, & Kam (2001; CSK) n Example model of

A little background for Collins, Schafer, & Kam (2001; CSK) n Example model of interest: X Y X = Program (prog vs control) Y = Cigarette Smoking Z = Cause of missingness: say, Rebelliousness (or smoking itself) n Factors to be considered: n n n % Missing (e. g. , % attrition) r. YZ r ZR

r. YZ n Correlation between n cause of missingness (Z) n n e. g.

r. YZ n Correlation between n cause of missingness (Z) n n e. g. , rebelliousness (or smoking itself) and the variable of interest (Y) n e. g. , Cigarette Smoking

r ZR n Correlation between n cause of missingness (Z) n n and missingness

r ZR n Correlation between n cause of missingness (Z) n n and missingness on variable of interest n n e. g. , rebelliousness (or smoking itself) e. g. , Missingness on the Smoking variable Missingness on Smoking (often designated: R or RY) n Dichotomous variable: R = 1: Smoking variable not missing R = 0: Smoking variable missing

CSK Study Design n (partial) Simulations manipulated n n n amount of missingness (25%

CSK Study Design n (partial) Simulations manipulated n n n amount of missingness (25% vs 50%) r. ZY (r =. 40, r =. 90) r. ZR held constant n n r =. 45 with 50% missing (applies to "MNAR-Linear" missingness)

CSK Results (partial) (MNAR Missingness) n n 25% 50% missing, r. YZ = =

CSK Results (partial) (MNAR Missingness) n n 25% 50% missing, r. YZ = = . 40. 90 . . . no problem * "no problem" = bias does not interfere with inference These Results apply to the regression coefficient for X Y with "MNAR-Linear" missingness (see CSK, 2001, Table 2)

But Even CSK Results Too Conservative n Not considered by CSK: r. ZR In

But Even CSK Results Too Conservative n Not considered by CSK: r. ZR In their simulation r. ZR =. 45 n Even with 50% missing and r. YZ =. 90 n n n bias can be acceptably small Graham et al. (2008): n Bias acceptably small (standardized bias < 40) as long as r. ZR <. 24

r. ZR <. 24 Very Plausible Study _____ Health. Wise (Caldwell, Smith, et al.

r. ZR <. 24 Very Plausible Study _____ Health. Wise (Caldwell, Smith, et al. , 2004) AAPT (Hansen & Graham, 1991) Botvin 1 Botvin 2 Botvin 3 n r. ZR (estimated) _____. 106. 093. 044. 078. 104 All of these yield standardized bias < 10

CSK and Follow-up Simulations n Results very promising Suggest that even MNAR biases are

CSK and Follow-up Simulations n Results very promising Suggest that even MNAR biases are often tolerably small n But these simulations still too narrow n

Beginnings of a Taxonomy of Attrition Causes of Attrition on Y (main DV) n

Beginnings of a Taxonomy of Attrition Causes of Attrition on Y (main DV) n n Case 1: not Program (P), not Y, not PY interaction Case 2: P only Case 3: Y only. . . (CSK scenario) Case 4: P and Y only

Beginnings of a Taxonomy of Attrition Causes of Attrition on Y (main DV) n

Beginnings of a Taxonomy of Attrition Causes of Attrition on Y (main DV) n n Case 5: 6: 7: 8: PY interaction only P + PY interaction Y + PY interaction P, Y, and PY interaction

Taxonomy of Attrition n Cases 1 -4 n n often little or no problem

Taxonomy of Attrition n Cases 1 -4 n n often little or no problem Cases 5 -8 n n n Jury still out (more research needed) Very likely not as much of a problem as previously though Use diagnostics to shed light

Use of Missing Data Diagnostics n Diagnostics based on pretest data not much help

Use of Missing Data Diagnostics n Diagnostics based on pretest data not much help n n Hard to predict missing distal outcomes from differences on pretest scores Longitudinal Diagnostics can be much more helpful

Hedeker & Gibbons (1997) n Plot main DV over time for four groups: n

Hedeker & Gibbons (1997) n Plot main DV over time for four groups: n n n for Program and Control for those with and without last wave of data Much can be learned

Empirical Examples n Hedeker & Gibbons (1997) n n Drug treatment of psychiatric patients

Empirical Examples n Hedeker & Gibbons (1997) n n Drug treatment of psychiatric patients Hansen & Graham (1991) n n Adolescent Alcohol Prevention Trial (AAPT) Alcohol, smoking, other drug prevention among normal adolescents (7 th – 11 th grade)

Empirical Example Used by Hedeker & Gibbons (1997) n n IV: Drug Treatment vs.

Empirical Example Used by Hedeker & Gibbons (1997) n n IV: Drug Treatment vs. Placebo Control DV: Inpatient Multidimensional Psychiatric Scale (IMPS) n n n n 1 2 3 4 5 6 7 = = = = normal borderline mentally ill mildly ill moderately ill markedly ill severely ill among the most extremely ill

From Hedeker & Gibbons (1997) Placebo Control IMPS low = better outcomes Drug Treatment

From Hedeker & Gibbons (1997) Placebo Control IMPS low = better outcomes Drug Treatment Weeks of Treatment

Longitudinal Diagnostics Hedeker & Gibbons Example n n Treatment n droppers do BETTER than

Longitudinal Diagnostics Hedeker & Gibbons Example n n Treatment n droppers do BETTER than stayers Control n droppers do WORSE than stayers Example of Program X DV interaction But in this case, pattern would lead to suppression bias n Not as bad for internal validity in presence of significant program effect

AAPT (Hansen & Graham, 1991) n n IV: Normative Education Program vs Information Only

AAPT (Hansen & Graham, 1991) n n IV: Normative Education Program vs Information Only Control DV: Cigarette Smoking (3 -item scale) n n Measured at one-year intervals 7 th grade – 11 th grade

AAPT Control Program Control Cigarette Smoking (high = more smoking; arbitrary scale) Program th

AAPT Control Program Control Cigarette Smoking (high = more smoking; arbitrary scale) Program th th th

Longitudinal Diagnostics AAPT Example n Treatment n droppers do WORSE than stayers n n

Longitudinal Diagnostics AAPT Example n Treatment n droppers do WORSE than stayers n n Control n droppers do WORSE than stayers n n n little steeper increase Little evidence for Prog X DV interaction Very likely MAR methods allow good conclusions (CSK scenario holds)

Use of Auxiliary Variables n n Reduces attrition bias Restores some power lost due

Use of Auxiliary Variables n n Reduces attrition bias Restores some power lost due to attrition

What Is an Auxiliary Variable? n A variable correlated with the variables in your

What Is an Auxiliary Variable? n A variable correlated with the variables in your model n n but not part of the model not necessarily related to missingness used to "help" with missing data estimation Best auxiliary variables: n same variable as main DV, but measured at waves not used in analysis model

Model of Interest

Model of Interest

Benefit of Auxiliary Variables n Example from Graham & Collins (2008) X Y Z

Benefit of Auxiliary Variables n Example from Graham & Collins (2008) X Y Z 1 1 0 1 n n 500 complete cases 500 cases missing Y X, Y variables in the model (Y sometimes missing) Z is auxiliary variable

Benefit of Auxiliary Variables n Effective sample size (N') n n Analysis involving N

Benefit of Auxiliary Variables n Effective sample size (N') n n Analysis involving N cases, with auxiliary variable(s) gives statistical power equivalent to N' complete cases without auxiliary variables

Benefit of Auxiliary Variables n n It matters how highly Y and Z (the

Benefit of Auxiliary Variables n n It matters how highly Y and Z (the auxiliary variable) are correlated For example n n r. YZ = = . 40. 60. 80. 90 increase N N = = 500 500 gives power of of N' N' = = 542 608 733 839 ( 8%) (22%) (47%) (68%)

Effective Sample Size by r. YZ Effective Sample Size r. YZ

Effective Sample Size by r. YZ Effective Sample Size r. YZ

Conclusions n n n Attrition CAN be bad for internal validity But often it's

Conclusions n n n Attrition CAN be bad for internal validity But often it's NOT nearly as bad as often feared Don't rush to conclusions, even with rather substantial attrition Examine evidence (especially longitudinal diagnostics) before drawing conclusions Use MI and ML missing data procedures! Use good auxiliary variables to minimize impact of attrition

Part 3: Illustration of Missing Data Analysis: Multiple Imputation with NORM and Proc MI

Part 3: Illustration of Missing Data Analysis: Multiple Imputation with NORM and Proc MI

Multiple Imputation: Basic Steps n Impute n Analyze n Combine results

Multiple Imputation: Basic Steps n Impute n Analyze n Combine results

Imputation and Analysis n Impute 40 datasets n n Analyze each data set with

Imputation and Analysis n Impute 40 datasets n n Analyze each data set with USUAL procedures n n a missing value gets a different imputed value in each dataset e. g. , SAS, SPSS, LISREL, EQS, STATA Save parameter estimates and SE’s

Combine the Results Parameter Estimates to Report n Average of estimate (b-weight) over 40

Combine the Results Parameter Estimates to Report n Average of estimate (b-weight) over 40 imputed datasets

Combine the Results Standard Errors to Report Weighted sum of: n “within imputation” variance

Combine the Results Standard Errors to Report Weighted sum of: n “within imputation” variance average squared standard error n usual kind of variability n “between imputation” variance sample variance of parameter estimates over 40 datasets n variability due to missing data

Materials for SPSS Regression Starting place http: //methodology. psu. edu n downloads (you will

Materials for SPSS Regression Starting place http: //methodology. psu. edu n downloads (you will need to get a free user ID to download all our free software) missing data software Joe Schafer's Missing Data Programs John Graham's Additional NORM Utilities http: //mcgee. hhdev. psu. edu/missing/index. html