Methods for Dealing with Time Dependent Confounding Daniel

  • Slides: 28
Download presentation
Methods for Dealing with Time. Dependent Confounding Daniel, Sousens, De Stavola, Kenward, Sterne (2013)

Methods for Dealing with Time. Dependent Confounding Daniel, Sousens, De Stavola, Kenward, Sterne (2013)

This article is a tutorial that goes through much of the classic papers on

This article is a tutorial that goes through much of the classic papers on the subject by Jamie Robins and colleagues • Robins JM (1986). A new approach to causal inference in mortality studies with a sustained exposure period – application to control of the healthy worker survivor effect. Mathematical Modelling 7: 1393 -1512. • Robins JM, Hernan MA, Brumback B (2000). Marginal structural models and causal inference in epidemiology. Epidemiology 11: 550 -560. • Robins JM, Blevins D, Ritter G, Wulfsohn M (1992). G-estimation of the effect of prophylaxis therapy for Pneumocystis carinii pneumonia on the survival of AIDS patients. Epidemiology 3: 319336. • Robins JM (1992). Estimation of the time-dependent accelerated failure time model in the presence of confounding factors. Biometrika 79: 321 -334. • …

Classic Confounding

Classic Confounding

Time-dependent confounder not on causal pathway

Time-dependent confounder not on causal pathway

Time-dependent confounder on causal pathway

Time-dependent confounder on causal pathway

Time-dependent confounder on causal pathway plus unmeasured variable

Time-dependent confounder on causal pathway plus unmeasured variable

Causal Effects • • 2. 4. 1. The existence of a causal effect 2.

Causal Effects • • 2. 4. 1. The existence of a causal effect 2. 4. 2. Characterizing the causal effect 2. 4. 3. Joint, total and direct causal effects 2. 4. 4. More parsimonious characterizations – Marginal structural models

Identification • No unmeasured confounding – i. e. , Sequentially ignorable treatment assignment –

Identification • No unmeasured confounding – i. e. , Sequentially ignorable treatment assignment – i. e. , Conditional exchangeability

Static and Dynamic Regimes • Static – Interventions being considered do not depend on

Static and Dynamic Regimes • Static – Interventions being considered do not depend on the values of a time-varying covariate. – Impact of being on ART. • Dynamic – Interventions being considered depend on the values of a time-varying covariate. – Impact of waiting to start ART until CD 4 drops below a certain level. – We’ll talk more about dynamic regimes later in the course.

Simple time-dependent confounding on causal pathway with unmeasured variable

Simple time-dependent confounding on causal pathway with unmeasured variable

G-computation • Standardization • G-computation formula – Generalization of standardization

G-computation • Standardization • G-computation formula – Generalization of standardization

Example E(Y|A 0=0, A 1=0) = (2. 5*270+2. 5*135+ 1. 5*228+1. 5*18)/ (270+135+228+18)= 2.

Example E(Y|A 0=0, A 1=0) = (2. 5*270+2. 5*135+ 1. 5*228+1. 5*18)/ (270+135+228+18)= 2. 12

Forcing the regimen A 0=0, A 1=0 E[Y 0, 0]= (2. 5*900+2. 5*300+1. 5*760+1.

Forcing the regimen A 0=0, A 1=0 E[Y 0, 0]= (2. 5*900+2. 5*300+1. 5*760+1. 5*40)/ (900+300+760+40) = 2. 1

Some intuition • P(u 0, a 0, l 1, a 1, y) =P(u 0)

Some intuition • P(u 0, a 0, l 1, a 1, y) =P(u 0) P(a 0|u 0) P(l 1|u 0, a 0) P(a 1|u 0, a 0, l 1) x P(y|u 0, a 0, l 1, a 1) • Force everyone to have a 0=0 and a 1=0. – Pearl would say, do(a 0=0, a 1=0). • P(u 0, do(a 0=0), l 1, do(a 1=0), y) =P(u 0) P(l 1|u 0, do(a 0=0)) P(y|u 0, do{a 0=0}, l 1, do{a 1=0}) =P(u 0) P(l 1|u 0, a 0=0) P(y|u 0, a 0=0, l 1, a 1=0) by do-calculus • E[Y(0, 0)]=∑u 0 ∑l 1 E[y|u 0, a 0=0, l 1, a 1=0] P(l 1|u 0, a 0=0) P(u 0) = ∑l 1 E[y|a 0=0, l 1, a 1=0] P(l 1|a 0=0) (after some math based on no confounding assumptions implied by the graph)

Discuss students’ implementation of simple example

Discuss students’ implementation of simple example

Other g-computation stuff • Estimating parameters of marginal structural model (done post-hoc) • Confidence

Other g-computation stuff • Estimating parameters of marginal structural model (done post-hoc) • Confidence intervals with bootstrap • Monte Carlo methods needed as number of time points increases • Addressing loss to follow-up under MAR assumption • Software

Inverse probability weighted (IPW) marginal structural models (MSM) • MSM: • Naïve model: •

Inverse probability weighted (IPW) marginal structural models (MSM) • MSM: • Naïve model: • IP Weights: • MSM estimated by fitting the naïve model with IP Weights.

Experimental treatment assignment assumption • We usually call this the positivity assumption

Experimental treatment assignment assumption • We usually call this the positivity assumption

Example P(A 0=0)=1/2 P(A 1=0|L 1=0, A 0=0) =(270+228)/(450+380)=0. 6 Therefore, IP weights for

Example P(A 0=0)=1/2 P(A 1=0|L 1=0, A 0=0) =(270+228)/(450+380)=0. 6 Therefore, IP weights for someone with A 1=0, L 1=0, A 0=0 are 1 / [0. 5 * 0. 6] = 10/3. Similarly, P(A 0=0)=1/2 P(A 1=1|L 1=0, A 0=0) = (180+152)/(450+380)=0. 4 Therefore, IP weights for someone with A 1=1, L 1=0, A 0=0 are 1 / [0. 5 * 0. 4] = 5. Notice that this weighting creates a pseudo-population with 900 subjects with A 1=0, L 1=0, A 0=0 (10/3 * 270 = 900) and 900 with A 1=1, L 1=0, A 0=0 (5 * 180 = 900).

After IPW E[Y 00]= (2. 5*900+2. 5*300+1. 5*760+1. 5*40)/ (900+300+760+40) = 2. 1

After IPW E[Y 00]= (2. 5*900+2. 5*300+1. 5*760+1. 5*40)/ (900+300+760+40) = 2. 1

Stabilized Weights • Stabilized weights with baseline covariates – MSM must include baseline covariates.

Stabilized Weights • Stabilized weights with baseline covariates – MSM must include baseline covariates.

Example P(A 0=0)=1/2 P(A 1=0|L 1=0, A 0=0) =270/450 P(A 1=0|A 0=0) =(270+135+228+18)/1000 Therefore,

Example P(A 0=0)=1/2 P(A 1=0|L 1=0, A 0=0) =270/450 P(A 1=0|A 0=0) =(270+135+228+18)/1000 Therefore, stabilized IP weights for someone with A 1=0, L 1=0, A 0=0 are ((1/2)* 651/1000) / [(1/2) * (270/450)] = 1. 085. Similarly, P(A 0=0)=1/2 P(A 1=1|L 1=0, A 0=0) = 180/450 P(A 1=1|A 0=0) =(180+15+152+2)/1000 Therefore, stabilized IP weights for someone with A 1=1, L 1=0, A 0=0 are (1/2)*(349/1000) / [(1/2) * (180/450)] = 1. 745/2.

Discuss student’s implementation of simple example

Discuss student’s implementation of simple example

Other IPW-MSM stuff • Loss to follow-up

Other IPW-MSM stuff • Loss to follow-up

Comparison of g-computation and IPW

Comparison of g-computation and IPW

Parametric Extensions • Everything is essentially the same except you can no longer use

Parametric Extensions • Everything is essentially the same except you can no longer use saturated models so smoothing assumptions are made. • Model misspecification can be an issue. • Collapsibility can also be an issue.

The g-null paradox • Given enough data, we would reject the causal null hypothesis

The g-null paradox • Given enough data, we would reject the causal null hypothesis even when it is true. • When standard parsimonious models are chosen for the conditional distributions of Y and L, then when combined using the gcomputation formula, the implied distribution for Ya-bar is a complex function of a-bar, so that Ya-bar is not independent of a-bar without additional assumptions.

Strengths and Limitations G-computation Strengths Limitations 1. Extension of standardization 2. Can handle complex

Strengths and Limitations G-computation Strengths Limitations 1. Extension of standardization 2. Can handle complex interventions 3. Each hypothetical intervention can be compared with no intervention (i. e. , the status quo) 1. MSM parameters are post hoc estimated 2. Computationally intensive 3. G-null paradox 4. Model misspecification is a big concern because fitting lots of models. IPW Strengths Limitations 1. Easier to explain to subject-matter experts. (? ) 2. Analyses are more easily implemented. 3. Only models for treatment assignment and MSM need to be specified. 4. Clear set of parameters that are estimated that can be used for testing. 1. Unstable and inefficient if extreme weights. Generally less efficient than properly specified g-computation. 2. Continuous treatments hard to handle. 3. Requires positivity assumption. 4. Interactions with time-varying covariates cannot be studied.