2nd year case control studies 2011 moodle

44
Case-Control Studies Dr Zoë Fox Institute of Neurology and the Research Department of Primary Care and Population Health, UCL

Upload: hallixrap

Post on 24-Sep-2015

219 views

Category:

Documents


3 download

DESCRIPTION

ucl 2nd year case control studies

TRANSCRIPT

  • Case-Control Studies

    Dr Zo Fox

    Institute of Neurology and the Research Department of Primary Care and Population Health, UCL

  • Epidemiology Lectures in Year 1

    Measurement of disease (morbidity & mortality)

    Prevalence/incidence/relative risk

    Variations in disease

    Standardised Mortality Ratios (SMRs)

    Causation

    Observational/experimental studies

    Cohort studies

  • Objectives of todays lecture

    Basic features of case-control studies

    Measures of association between exposure and disease

    Strengths and limitations

    Sources of bias

  • Different purposes: descriptive (often routine data) versus analytical (often new data collected)

    Observational designs:

    Cross-sectional

    Longitudinal (cohort)

    Case-control

    Ecological

    Experimental/Interventional designs

    Randomised trials

    Epidemiological studies:

  • Observational study designs:

    Observational studies

    Cross-sectional

    Cohort

    Case-control

  • Type of analysis developed in the 1950s as an approach to the problem of investigating risk factors

    for diseases with long latent periods and rare

    diseases

    An observational design in which individuals are selected on the basis of whether they do (cases) or do

    not (controls) have the disease under investigation

    Case-control studies

  • Case-control studies

    Observational

    Analytical

    Retrospective

    Ideal for rare diseases

    Quick

    Cheap

    Can investigate aetiology

  • Basic steps in a case-control study

    Definition of a case (symptoms; duration)

    Selection of cases (patients with certain disease condition)

    Definition of controls (subjects without the condition)

    Selection of controls (hospital, community...)

    Measurement of exposure

    Comparing frequency of exposure in cases and controls

  • Cohort Start

    Unexposed

    Exposed

    All healthy Follow-up (wait)

    Disease

    assessment

    Controls

    Cases

    Start

    Look Back

    Case-Control

    E+ E-

    E+ E-

    D- D+

    D- D+

  • Disease

    (cases)

    No disease

    (controls)

    Exposed to factor

    Not

    exposed to factor

    Exposed to factor

    Not

    exposed to factor

    Compare cases

    and controls

    Past time Trace individuals

    Starting point

    A traditional case-control study

  • Study Population

    Ineligible Eligible

    Participation

    Outcome: Yes (cases)

    Exposed Unexposed

    Outcome: No (controls)

    Exposed Unexposed

    No participation

    Case-control study

  • Selection of cases (1)

    Set of standardised criteria used to identify cases

    Criteria should be precise and unambiguous and might be based on:

    Clinical

    Histological

    Specific category of diagnosis (e.g. CDC AIDS definition, Code on death certificates, etc.)

  • Selection of cases (2)

    Incident or prevalent cases

    Incident cases: all new cases identified in the study population over a specific time period

    Prevalent cases: cases within the study population who were already diagnosed with the disease at the time of

    designing the analysis

  • Selection of controls (1)

    Must not have the outcome in question

    Should provide an estimate of the level of exposure in those without the outcome

    Controls are selected from the same population group from which the cases are drawn

    General population (random digit dialling, registers/voting lists, etc)

    People using the same health service (hospital controls)

    Friends/relatives/neighbours of cases

  • Selection of controls (2)

    More than one control per case?

    Multiple control groups of different types?

    Matching?

  • Cases Controls Representative? Bias

    Patients admitted Controls more Spurious

    to Hospital with HIV likely to have reduction

    Patients low CD4+/CD8+ of OR

    with Hodgkin

    lymphoma

    in Hospital

    Patients admitted Controls more Spurious

    to Hospital for renal likely to have increase

    transplantation high CD4+/CD8+ of OR

    Selection bias from a poor choice of

    controls: looking at CD4+/CD8+ ratio in NHL

  • Example: Exposure to armadillos and leprosy Indian J Dermatol Venereol Leprol 2008;74:338-42, Deps PD, Alves BL et al.

    Objective: To see if exposure to armadillos is associated with leprosy

    Setting: Case-control study of 506 individuals with leprosy and 594

    individuals without leprosy in four clinics that participated in the national

    leprosy control programme in Brazil. Controls were patients with other

    chronic diseases from these clinics

    Risk factor of interest: Exposure to armadillos

    Outcome: Leprosy

    Follow-up: Followed back in time from a diagnosis of leprosy

    Conclusions: Direct exposure to armadillos was reported by 68% of

    leprosy cases and by 48% of controls. Exposure to armadillos was

    significantly higher among those with leprosy compared to those without

    leprosy

  • No Leprosy

    N=594

    Leprosy

    N=506

    Start

    Look Back

    Case-Control

    285

    (48%)

    160

    (32%)

    309

    (52%)

    346

    (68%)

  • Armadillo exposure in cases and controls (%)

    Pe

    rce

    nta

    ge

    (%

    )

  • Odds ratio of exposure

    An approximation of relative risk (risk ratio)

    A good estimate of the risk ratio when the disease is rare

    How much more likely are the cases to have been exposed as compared to the controls?

    So, in this example we want to know how much more likely

    is exposure to armadillos among those with leprosy

    compared to those without leprosy

  • Odds ratio = Odds of exposure among controls

    Odds of exposure among cases

    Calculating the odds ratio:

    Probability of being

    exposed among cases

    Probability of NOT being

    exposed among cases

    Odds of exposure among cases =

  • Odds of exposure

    among cases a

    = c

    a/(a+c)

    = c/(a+c)

    Disease +

    (Cases)

    Disease -

    (Controls)

    Exposure + a b

    Exposure - c d

    Calculating the odds ratio:

  • Odds Ratio = a/c ad

    b/d bc =

    Odds of exposure among cases = a/c

    Odds of exposure among controls = b/d

    Disease +

    (Cases)

    Disease -

    (Controls)

    Exposure + a b

    Exposure - c d

    Calculating the odds ratio:

  • Odds ratio for leprosy according to direct

    armadillo exposure:

    Leprosy Cases

    Controls

    Direct exposure to armadillos

    346

    285

    No exposure to armadillos

    160

    309

    Odds ratio = 346 x 309 / 285 x 160 = 2.34

  • Interpretation of OR

    People who have direct contact with armadillos have a 2.34 times higher odds of contracting leprosy than those

    with indirect or no exposure to armadillos

    Those with indirect or no exposure have less than half the odds (i.e. 1/2.34 = 0.43) of contracting leprosy compared

    to those with direct exposure

    You can convert the odds back to probabilities to work out the probability of getting the outcome depending on

    numerous factors

  • Confounding (1)

    Contact with

    armadillos Leprosy

    BCG vaccination

  • Confounding (2)

    No real relationship between contact with armadillos and risk of leprosy

    The relationship may be confounded by factors such as having a BCG vaccination

    Having a BCG vaccination is a confounder

    BCG vaccination needs to be adjusted for (controlled for) in the analysis or patients need to be matched on BCG

    vaccination

  • Matching

    Matching is done to increase the power of the study and to control for confounding factors

    It is important to restrict matching to potential confounding factors and not to match patients on the

    exposure under investigation

    It is not possible to evaluate the association between matching factors and outcome

  • Matched case-control studies

    Match on factors that are likely to be common causes for the exposure and the disease

    It may be impractical to match patients on some factors (e.g. if there are many strata, it is difficult to find suitable

    controls)

    Conditional logistic regression required

    More than one control can be selected for each case

  • Example: matching in the study of

    armadillos and leprosy Risk of leprosy depends on education and BCG vaccination;

    BCG vaccination less likely to get leprosy

    Worse education more likely to get leprosy

    Matching cases and controls on education level and BCG

    vaccination will eliminate confounding by these factors

    For each case [no education; BCG = Y] that is recruited, one or

    more controls [no education; BCG = Y] is recruited

    For each case [no education; BCG = N] that is recruited, one or

    more controls [no education; BCG = N] is recruited

  • Other ways to control for confounding

    Adjustment in analysis:

    Stratified analysis (e.g. within different education levels no education, minimal education,

  • Example: MMR and autism Lancet 2004;364;9438:963-69, Smeeth L, Cook C et al.

    Objective: To see whether MMR vaccination is associated with risk of autism or other pervasive developmental disorders (PDD)

    Setting: Case-control study of 1294 individuals born >1973 with a

    diagnosis of pervasive developmental disorders (PDD) while registered

    with a GP between 1987-2001. Controls were 4469 individuals with no

    diagnosis of PDD recorded matched on age, sex, and GP to cases.

    Risk factor of interest: Exposure to MMR vaccination

    Outcome: Autism diagnosis

    Conclusions: MMR vaccination was not associated with an increased

    risk of PDD. The odds ratio for association between MMR and pervasive

    developmental disorder was 086 (95% CI: 068109)

  • Selection bias:

    It may be difficult to find appropriate controls

    Measurement bias (recall bias/observer bias):

    The retrospective measurement of exposure may be

    problematic:

    Inaccurate - will underestimate any relation

    Reporting bias - overestimation

    Reverse causation - spurious association

    Case-control study: Bias

  • Reverse causality example:

    Low physical activity is related to higher risk of hip fracture in

    a case control study

    But people with hip fracture (or history of hip fracture) have

    lower physical activity

    This can lead to a spurious association - temporality must be

    clarified

    Case-control study: Bias

  • Cases may be more likely to remember events that

    occurred around the time in which they were diagnosed

    with the disease (recall bias)

    Example: cigarette smoking and lung cancer; people with lung cancer may, consciously or not, tend to exaggerate their

    level of exposure to smoking if they believe that cancer was

    caused by smoking

    Assessment of exposure blind to case/control status is

    not always possible (observer bias)

    Case-control study: Bias

  • Multiple choice questions (1):

    A) Case-control

    B) RCT

    C) Cohort study

    D) None of the above

    What types of study can you perform to identify predictors of

    struggling (i.e. academic difficulties, course termination or

    failure of 3 preclinical exams) during medical training?

  • Multiple choice questions (2):

    Overall, 123 students were identified as strugglers and 492 as controls. There were 56/123 (45.5%) strugglers and 130/492 (26.4%) controls who had negative comments on

    their UCAS form. What is the estimated odds ratio of a

    negative UCAS comment for strugglers vs. controls?

    A) -1.3

    B) 1.7

    C) 2.3

    D) 6.0

    E) None of the above, because odds ratios cannot be

    estimated from a case-control study.

  • Multiple choice questions (3):

    A Chi-squared test gave a p-value of

  • Adjusted odds ratio

    In this study, there was an OR (95% CI) of 2.33 (1.55 to 3.50)

    After adjustment for male gender, the odds ratio marginally decreases to 2.25 (1.44 to 3.50). Why?

  • Case-control study nested in cohort/RCT

    Using an existing cohort study/RCT

    Cases: patients who developed the disease

    Controls: a random sample of patients without the disease from the cohort easier to find relevant matches

    Controls are chosen among those at risk of becoming cases

    Cost saving by collecting data on a sample of controls instead of the entire cohort at risk

    Rationale: to reduce cost with laboratory measurements

    Advantage: no reporting / measurement bias because data have been recorded as part of the cohort study

    Nested case-control study

  • Quick (cases already exist, no need to wait)

    Cheap (not necessary to examine large number of people)

    Can examine many exposures

    Suitable to study rare diseases or diseases where there is a long latency between exposure and manifestation

    Suitable to study stable exposures (e.g. genetic markers)

    Sufficient power achievable with relatively small sample size

    Case-control study: strengths

  • Selection of cases/controls can be troublesome

    Not suitable for rare exposure

    Cannot calculate incidence risk or death rates

    Prone to bias (selection bias, recall bias, observer bias)

    In controls, exposure to risk factors and confounders should be

    representative of that in the population "at risk" of becoming cases

    Exposures of controls should be measurable with similar accuracy

    to those of cases

    Limited to a single outcome variable

    It is not possible to validate responses

    Correct interpretation of results is difficult

    Case-control study: limitations

  • It is not usually possible to establish a sequence of events so you are not likely to be able to estimate incidence rates

    It is not possible to estimate the risk or risk ratios because we have artificially selected the number of cases relative to

    the number of controls.

    You can calculate:

    Conditional Odds (matched analysis)

    Odds of having the exposure and Odds ratios

    Statistical Considerations

  • Conclusions

    Case-control studies are valuable for studying disease aetiology

    Like other designs, a case-control study has unique strengths and limitations that must be considered when selecting this

    particular strategy

    More susceptible to bias than other designs; potential sources of bias should ideally be recognised in the design phase in order

    to minimise their effects

    Could be used as a quick/cheap first step to identify potential risk factors for a disease