Unit 10 CaseControl CaseCrossover and CrossSectional Studies 1

Unit 10 Learning Objectives: 1. Understand design features of case-control, case-crossover, and cross-sectional studies.

Case Control Studies PRIMARY STEPS: l Define and select the cases. l Assemble an

Case Control Studies l Typically, compare the proportions of exposure by means of a

Defining Cases l Ensure cases are as homogenous as possible. Establish strict diagnostic criteria

Prevalent vs. Incident (Newly Diagnosed) Cases To the extent possible, avoid including prevalent cases!

Prevalent vs. Incident (Newly Diagnosed) Cases Why? l l Determinants of disease duration may

Prevalent vs. Incident (Newly Diagnosed) Cases Why? l With prevalent cases, it is more

Prevalent vs. Incident (Newly Diagnosed) Cases ---However, case-control studies of congenital malformations are inevitably

Selecting Cases l Sources of Cases: – Hospitals, medical care facilities, etc. – General

Selecting Cases l Note: DO NOT compromise validity in the goal of generalization. l

Defining and Selecting Cases l Ascertainment of Disease Status: – Case registries (i. e.

Selecting Controls l Axiom: Selection of an appropriate comparison group is the most difficult

Selecting Controls l Controls are subjects free of the disease (or outcome of interest).

Selecting Controls 1. The prevalence of exposure among controls should reflect the prevalence of

Selecting Controls 3. The time during which a subject is eligible to be a

Selecting Controls Sources of Controls: ------ General population Random digit dialing Neighborhood Friends/relatives Hospital

Selecting Controls General Population Controls: --- Population defined by geographic boundaries (or specific characteristics).

Selecting Controls General Population Controls: --- If entire population is sampled for cases and

Selecting Controls Random Digit Dialing Controls: --- May approximate random sampling from the source

Selecting Controls Neighborhood Controls: --- May approximate random sampling from the source population. ---

Selecting Controls Friend/Relative Controls: --- Tend to be more cooperative than general population controls.

Selecting Controls Friend/Relative Controls: --- The list of potential friend/relative controls is often derived

Selecting Controls Hospital/Clinic-Based Controls: --- Source population refers to people who “feed” the hospital

Selecting Controls Hospital/Clinic-based Controls: --- Controls are ill; distribution of the exposure may not

Selecting Controls General Remarks: --- Often, there is no perfect control group; several groups

Selecting Controls For each control group, how many controls per case? -- the optimal

Selecting Controls AXIOM: The benefit of increased sample size is not as relevant past

Ascertaining Exposure l Sources of exposure data (cases and controls): ---Study subjects (self-report). Particularly

Ascertaining Exposure l How far back should exposure be assessed? ---Define a part of

Nested Case-Control Study Definition: Hybrid design in which a casecontrol study is nested in

Nested Case-Control Study Advantages: ---Exposure data are collected before disease development; eliminates recall bias.

Summary – Case Control Studies ---Selection of an appropriate comparison group is the most

Summary – Case Control Studies Advantages: ---Relatively quick and inexpensive. ---Well suited to evaluation

Summary – Case Control Studies Disadvantages: ---Inefficient for evaluation of rare exposures unless the

Summary – Case Control Studies Disadvantages (cont. ): ---May be difficult to establish the

Review of Recommended Reading PPA and Risk of Hemorrhagic Stroke --- Case control study

Discussion Question 1 The investigators excluded stroke victims who died or did not have

Discussion Question 2 The primary time in which exposure to PPA was assessed occurred

Discussion Question 3 On average, cases were required to recall PPA exposure status over

Discussion Question 4 Not stated in the article, the participation rate for eligible subjects

Discussion Question 5 Interpret the results in table 4. Are the findings similar among

Slides: 43

Download presentation

Unit 10: Case-Control, Case-Crossover, and Cross-Sectional Studies 1

Unit 10 Learning Objectives: 1. Understand design features of case-control, case-crossover, and cross-sectional studies. 2. Understand strengths and limitations of casecontrol, case-crossover, and cross-sectional studies. 3. Recognize potential biases from case-control, case -crossover, and cross-sectional studies. 4. Recognize the impact of using prevalent versus incident cases in case-control studies. 5. Recognize the difficulty in selecting an appropriate control group in case-control studies. 6. Understand the nested case-control study design. 7. Recognize the difference between fixed and timedependent variables, including confounding 2 variables.

Case-Control Studies 3

Case Control Studies PRIMARY STEPS: l Define and select the cases. l Assemble an appropriate comparison group (controls). l Determine and compare the proportion of cases who have experienced the exposure of interest -- with the proportion of controls who experienced the exposure. 4

Case Control Studies l Typically, compare the proportions of exposure by means of a ratio: ODDS RATIO Odds for exposure among cases OR = Odds for exposure among controls D+ D- E+ a b E- c d OR = (a / c) ------(b / d) 5

Defining Cases l Ensure cases are as homogenous as possible. Establish strict diagnostic criteria (e. g. certain histologic characteristics). l Sub-definitions of cases such as definite, probable or possible may be needed. l Analysis can be conducted for each sub-group. 6

Prevalent vs. Incident (Newly Diagnosed) Cases To the extent possible, avoid including prevalent cases! WHY? 7

Prevalent vs. Incident (Newly Diagnosed) Cases Why? l l Determinants of disease duration may be related to the exposure such that the magnitude of the exposure (e. g. low vs. high) may be inaccurate. Prevalent cases with long disease duration may not accurately recall antecedent events. 8

Prevalent vs. Incident (Newly Diagnosed) Cases Why? l With prevalent cases, it is more difficult to ensure that reported events preceded disease development rather than being a consequence of the disease process. 9

Prevalent vs. Incident (Newly Diagnosed) Cases ---However, case-control studies of congenital malformations are inevitably based on prevalent cases. ---Prevalent cases are commonly used in studies of chronic conditions with illdefined onset times (e. g. multiple sclerosis). 10

Selecting Cases l Sources of Cases: – Hospitals, medical care facilities, etc. – General population - locate and obtain data from all or a random sample of individuals from a defined population. 11

Selecting Cases l Note: DO NOT compromise validity in the goal of generalization. l Select cases from a defined population in whom complete and reliable information can be obtained, and where the exposure/disease relationship is presumed to be present. 12

Defining and Selecting Cases l Ascertainment of Disease Status: – Case registries (i. e. cancer) – Office records of physicians – Hospital admission or discharge records – Pathology department log books 13

Selecting Controls l Axiom: Selection of an appropriate comparison group is the most difficult and critical issue in the design of case-control studies. 14

Selecting Controls l Controls are subjects free of the disease (or outcome of interest). – Controls are seldom subjected to medical exam to rule out the disease of interest. – Usually, they are assumed disease free if they have not been diagnosed. 15

Selecting Controls 1. The prevalence of exposure among controls should reflect the prevalence of exposure in the source population. 2. Controls should come from the same source population as cases (e. g. would have been cases if diagnosed with the disease). 16

Selecting Controls 3. The time during which a subject is eligible to be a control should be the time in which the individual is also eligible to be a case. If #1, #2, or #3 are not met = Selection Bias 17

Selecting Controls Sources of Controls: ------ General population Random digit dialing Neighborhood Friends/relatives Hospital or clinic-based 18

Selecting Controls General Population Controls: --- Population defined by geographic boundaries (or specific characteristics). --- Cases may include all cases, or a random sample of all cases. --- Controls should be a random sample of non-diseased individuals eligible to be cases. 19

Selecting Controls General Population Controls: --- If entire population is sampled for cases and controls, can calculate incidence rates of disease in exposed and nonexposed. --- Selection of controls may be costly, time consuming, and exposure recall may not be as accurate as “sick” controls. --- Subjects in general population may be less motivated to participate than hospital-based controls. 20

Selecting Controls Random Digit Dialing Controls: --- May approximate random sampling from the source population. --- Controls are often matched to cases on area code and prefix (i. e. SES matching). --- Probability of contacting each eligible subject may differ due to time of day, number in household, answering machines, etc. 21

Selecting Controls Neighborhood Controls: --- May approximate random sampling from the source population. --- Controls are often matched to cases from the same neighborhood. --- If cases are from a particular hospital, neighborhood controls may include people who would not have been treated at the same hospital had they developed the disease (e. g. VA hospital). 22

Selecting Controls Friend/Relative Controls: --- Tend to be more cooperative than general population controls. --- Often similar to cases on factors such as SES, lifestyle, and ethnic background. --- However, being named as a friend by the case may be related to exposure status of the potential control. 23

Selecting Controls Friend/Relative Controls: --- The list of potential friend/relative controls is often derived from the case; this dependence may add a potential source of bias. --- Hence, friend/relative controls may be too similar to cases regarding the exposure of interest. 24

Selecting Controls Hospital/Clinic-Based Controls: --- Source population refers to people who “feed” the hospital or clinic. --- Usually easier and less expensive than general population controls. --- May be more aware of exposures and likely to cooperate than general population controls (healthier). 25

Selecting Controls Hospital/Clinic-based Controls: --- Controls are ill; distribution of the exposure may not reflect the distribution of exposure in the source population for cases. --- Controls should be limited to diagnoses for which there is no prior indication of a relation with exposure. --- Subjects may have changed their exposure status as a result of being sick. 26

Selecting Controls General Remarks: --- Often, there is no perfect control group; several groups can be selected, if feasible. --- If study results are consistent across control groups, may indicate a valid result, but also possibly similar net bias. --- If different effects are observed, may provide useful information as to nature of the association or potential biases. 27

Selecting Controls For each control group, how many controls per case? -- the optimal case-control ratio is 1: 1 -- when the number of cases is small, the sample size for the study can be increased by using more than one control e. g. 1: 2 1: 3 1: 4 28

Selecting Controls AXIOM: The benefit of increased sample size is not as relevant past the 1: 4 ratio (e. g increase in statistical power). 29

Ascertaining Exposure l Sources of exposure data (cases and controls): ---Study subjects (self-report). Particularly vulnerable to recall bias as cases may recall their exposure history more thoroughly than controls. ---Records (preferably completed before the occurrence of outcome events). ---Interviews with surrogates (spouses, siblings, etc. ). 30

Ascertaining Exposure l How far back should exposure be assessed? ---Define a part of the person’s exposure history considered relevant to the etiology of disease (e. g. the “empirical induction” period). ---Code the exposure data in an etiologically-relevant manner (e. g. magnitude of exposure, years of exposure, ever exposed, etc. ). 31

Nested Case-Control Study Definition: Hybrid design in which a casecontrol study is nested in a cohort study. Cohort Study Population Subjects Develop Disease CASES Exposure Status Ascertained Do Not Develop Disease CONTROLS 32

Nested Case-Control Study Advantages: ---Exposure data are collected before disease development; eliminates recall bias. ---Can be economical if complete exposure ascertainment is limited to only cases and controls nested in the total cohort. Often used in occupational epidemiology where the occupational cohort is the source population. 33

Summary – Case Control Studies ---Selection of an appropriate comparison group is the most challenging and important aspect of the study design. ---In population-based studies, incidence can be calculated when entire population is sampled. ---Hospital-based studies are often easiest and cheapest to conduct, but may be prone to biased exposure ascertainment. 34

Summary – Case Control Studies Advantages: ---Relatively quick and inexpensive. ---Well suited to evaluation of diseases with long induction periods. ---Optimal for evaluation of rare diseases. ---Can examine multiple etiologic factors for a single disease. 35

Summary – Case Control Studies Disadvantages: ---Inefficient for evaluation of rare exposures unless the disease is common among the exposed. ---If not population based, cannot compute incidence among the exposed and nonexposed. 36

Summary – Case Control Studies Disadvantages (cont. ): ---May be difficult to establish the temporal relationship between exposure and disease. ---Prone to bias compared to other analytic designs, in particular, selection and recall bias. 37

Review of Recommended Reading PPA and Risk of Hemorrhagic Stroke --- Case control study investigating exposure to products containing phenylpropanolamine (PPA) and risk of hemorrhagic stroke in persons 18 -49 years of age. --- 702 cases and 1, 376 matched control subjects (randomdigit dialing) from 43 hospitals in 4 states (1994 to 1999) --- Multiple definitions of exposure to PPA, including any use, first use, specific type of product (i. e. appetite suppressant). --- Primary focal time to assess prior exposure history was day/time that symptoms led subject to seek medical attention. --- Trained interviewers used structured instrument to document prior exposure history. --- Analyses conducted separately for men and women. 38

Discussion Question 1 The investigators excluded stroke victims who died or did not have the ability to communicate because they felt that proxy data (i. e. spouse) on exposure status would be unreliable. How might this exclusion of potential case subjects bias (if at all) the study results? Source: NEJM 2000; 343: 1826 -1832. 39

Discussion Question 2 The primary time in which exposure to PPA was assessed occurred immediately preceding the time in which medical attention was sought. What type of bias (if any) could this strategy have introduced? Source: NEJM 2000; 343: 1826 -1832. 40

Discussion Question 3 On average, cases were required to recall PPA exposure status over a more remote period than control subjects. Do you think this strategy offset the potential greater motivation for cases to recall exposures to over-thecounter medications than control subjects? Source: NEJM 2000; 343: 1826 -1832. 41

Discussion Question 4 Not stated in the article, the participation rate for eligible subjects was 75% for cases compared to 36% for controls. How might this differential rate of participation bias (if at all) the study results? Source: NEJM 2000; 343: 1826 -1832. 42

Discussion Question 5 Interpret the results in table 4. Are the findings similar among men and women? Source: NEJM 2000; 343: 1826 -1832. 43