2nd year case control studies 2011 moodle

Case-Control Studies

Dr Zo Fox

Institute of Neurology and the Research Department of Primary Care and Population Health, UCL

Epidemiology Lectures in Year 1

Measurement of disease (morbidity & mortality)

Prevalence/incidence/relative risk

Variations in disease

Standardised Mortality Ratios (SMRs)

Causation

Observational/experimental studies

Cohort studies

Objectives of todays lecture

Basic features of case-control studies

Measures of association between exposure and disease

Strengths and limitations

Sources of bias

Different purposes: descriptive (often routine data) versus analytical (often new data collected)

Observational designs:

Cross-sectional

Longitudinal (cohort)

Case-control

Ecological

Experimental/Interventional designs

Randomised trials

Epidemiological studies:

Observational study designs:

Observational studies

Cross-sectional

Cohort

Case-control

Type of analysis developed in the 1950s as an approach to the problem of investigating risk factors

for diseases with long latent periods and rare

diseases

An observational design in which individuals are selected on the basis of whether they do (cases) or do

not (controls) have the disease under investigation

Case-control studies

Case-control studies

Observational

Analytical

Retrospective

Ideal for rare diseases

Quick

Cheap

Can investigate aetiology

Basic steps in a case-control study

Definition of a case (symptoms; duration)

Selection of cases (patients with certain disease condition)

Definition of controls (subjects without the condition)

Selection of controls (hospital, community...)

Measurement of exposure

Comparing frequency of exposure in cases and controls

Cohort Start

Unexposed

Exposed

All healthy Follow-up (wait)

Disease

assessment

Controls

Cases

Start

Look Back

Case-Control

E+ E-

E+ E-

D- D+

D- D+

Disease

(cases)

No disease

(controls)

Exposed to factor

Not

exposed to factor

Exposed to factor

Not

exposed to factor

Compare cases

and controls

Past time Trace individuals

Starting point

A traditional case-control study

Study Population

Ineligible Eligible

Participation

Outcome: Yes (cases)

Exposed Unexposed

Outcome: No (controls)

Exposed Unexposed

No participation

Case-control study

Selection of cases (1)

Set of standardised criteria used to identify cases

Criteria should be precise and unambiguous and might be based on:

Clinical

Histological

Specific category of diagnosis (e.g. CDC AIDS definition, Code on death certificates, etc.)

Selection of cases (2)

Incident or prevalent cases

Incident cases: all new cases identified in the study population over a specific time period

Prevalent cases: cases within the study population who were already diagnosed with the disease at the time of

designing the analysis

Selection of controls (1)

Must not have the outcome in question

Should provide an estimate of the level of exposure in those without the outcome

Controls are selected from the same population group from which the cases are drawn

General population (random digit dialling, registers/voting lists, etc)

People using the same health service (hospital controls)

Friends/relatives/neighbours of cases

Selection of controls (2)

More than one control per case?

Multiple control groups of different types?

Matching?

Cases Controls Representative? Bias

Patients admitted Controls more Spurious

to Hospital with HIV likely to have reduction

Patients low CD4+/CD8+ of OR

with Hodgkin

lymphoma

in Hospital

Patients admitted Controls more Spurious

to Hospital for renal likely to have increase

transplantation high CD4+/CD8+ of OR

Selection bias from a poor choice of

controls: looking at CD4+/CD8+ ratio in NHL

Example: Exposure to armadillos and leprosy Indian J Dermatol Venereol Leprol 2008;74:338-42, Deps PD, Alves BL et al.

Objective: To see if exposure to armadillos is associated with leprosy

Setting: Case-control study of 506 individuals with leprosy and 594

individuals without leprosy in four clinics that participated in the national

leprosy control programme in Brazil. Controls were patients with other

chronic diseases from these clinics

Risk factor of interest: Exposure to armadillos

Outcome: Leprosy

Follow-up: Followed back in time from a diagnosis of leprosy

Conclusions: Direct exposure to armadillos was reported by 68% of

leprosy cases and by 48% of controls. Exposure to armadillos was

significantly higher among those with leprosy compared to those without

leprosy

No Leprosy

N=594

Leprosy

N=506

Start

Look Back

Case-Control

285

(48%)

160

(32%)

309

(52%)

346

(68%)

Armadillo exposure in cases and controls (%)

Pe

rce

nta

ge

(%

)

Odds ratio of exposure

An approximation of relative risk (risk ratio)

A good estimate of the risk ratio when the disease is rare

How much more likely are the cases to have been exposed as compared to the controls?

So, in this example we want to know how much more likely

is exposure to armadillos among those with leprosy

compared to those without leprosy

Odds ratio = Odds of exposure among controls

Odds of exposure among cases

Calculating the odds ratio:

Probability of being

exposed among cases

Probability of NOT being

exposed among cases

Odds of exposure among cases =

Odds of exposure

among cases a

= c

a/(a+c)

= c/(a+c)

Disease +

(Cases)

Disease -

(Controls)

Exposure + a b

Exposure - c d


Odds Ratio = a/c ad

b/d bc =

Odds of exposure among cases = a/c

Odds of exposure among controls = b/d

Disease +

(Cases)

Disease -

(Controls)

Exposure + a b

Exposure - c d


Odds ratio for leprosy according to direct

armadillo exposure:

Leprosy Cases

Controls

Direct exposure to armadillos

346

285

No exposure to armadillos

160

309

Odds ratio = 346 x 309 / 285 x 160 = 2.34

Interpretation of OR

People who have direct contact with armadillos have a 2.34 times higher odds of contracting leprosy than those

with indirect or no exposure to armadillos

Those with indirect or no exposure have less than half the odds (i.e. 1/2.34 = 0.43) of contracting leprosy compared

to those with direct exposure

You can convert the odds back to probabilities to work out the probability of getting the outcome depending on

numerous factors

Confounding (1)

Contact with

armadillos Leprosy

BCG vaccination

Confounding (2)

No real relationship between contact with armadillos and risk of leprosy

The relationship may be confounded by factors such as having a BCG vaccination

Having a BCG vaccination is a confounder

BCG vaccination needs to be adjusted for (controlled for) in the analysis or patients need to be matched on BCG

vaccination

Matching

Matching is done to increase the power of the study and to control for confounding factors

It is important to restrict matching to potential confounding factors and not to match patients on the

exposure under investigation

It is not possible to evaluate the association between matching factors and outcome

Matched case-control studies

Match on factors that are likely to be common causes for the exposure and the disease

It may be impractical to match patients on some factors (e.g. if there are many strata, it is difficult to find suitable

controls)

Conditional logistic regression required

More than one control can be selected for each case

Example: matching in the study of

armadillos and leprosy Risk of leprosy depends on education and BCG vaccination;

BCG vaccination less likely to get leprosy

Worse education more likely to get leprosy

Matching cases and controls on education level and BCG

vaccination will eliminate confounding by these factors

For each case [no education; BCG = Y] that is recruited, one or

more controls [no education; BCG = Y] is recruited

For each case [no education; BCG = N] that is recruited, one or

more controls [no education; BCG = N] is recruited

Other ways to control for confounding

Adjustment in analysis:

Stratified analysis (e.g. within different education levels no education, minimal education,

Example: MMR and autism Lancet 2004;364;9438:963-69, Smeeth L, Cook C et al.

Objective: To see whether MMR vaccination is associated with risk of autism or other pervasive developmental disorders (PDD)

Setting: Case-control study of 1294 individuals born >1973 with a

diagnosis of pervasive developmental disorders (PDD) while registered

with a GP between 1987-2001. Controls were 4469 individuals with no

diagnosis of PDD recorded matched on age, sex, and GP to cases.

Risk factor of interest: Exposure to MMR vaccination

Outcome: Autism diagnosis

Conclusions: MMR vaccination was not associated with an increased

risk of PDD. The odds ratio for association between MMR and pervasive

developmental disorder was 086 (95% CI: 068109)

Selection bias:

It may be difficult to find appropriate controls

Measurement bias (recall bias/observer bias):

The retrospective measurement of exposure may be

problematic:

Inaccurate - will underestimate any relation

Reporting bias - overestimation

Reverse causation - spurious association

Case-control study: Bias

Reverse causality example:

Low physical activity is related to higher risk of hip fracture in

a case control study

But people with hip fracture (or history of hip fracture) have

lower physical activity

This can lead to a spurious association - temporality must be

clarified


Cases may be more likely to remember events that

occurred around the time in which they were diagnosed

with the disease (recall bias)

Example: cigarette smoking and lung cancer; people with lung cancer may, consciously or not, tend to exaggerate their

level of exposure to smoking if they believe that cancer was

caused by smoking

Assessment of exposure blind to case/control status is

not always possible (observer bias)


Multiple choice questions (1):

A) Case-control

B) RCT

C) Cohort study

D) None of the above

What types of study can you perform to identify predictors of

struggling (i.e. academic difficulties, course termination or

failure of 3 preclinical exams) during medical training?


Overall, 123 students were identified as strugglers and 492 as controls. There were 56/123 (45.5%) strugglers and 130/492 (26.4%) controls who had negative comments on

their UCAS form. What is the estimated odds ratio of a

negative UCAS comment for strugglers vs. controls?

A) -1.3

B) 1.7

C) 2.3

D) 6.0

E) None of the above, because odds ratios cannot be

estimated from a case-control study.


A Chi-squared test gave a p-value of

Adjusted odds ratio

In this study, there was an OR (95% CI) of 2.33 (1.55 to 3.50)

After adjustment for male gender, the odds ratio marginally decreases to 2.25 (1.44 to 3.50). Why?

Case-control study nested in cohort/RCT

Using an existing cohort study/RCT

Cases: patients who developed the disease

Controls: a random sample of patients without the disease from the cohort easier to find relevant matches

Controls are chosen among those at risk of becoming cases

Cost saving by collecting data on a sample of controls instead of the entire cohort at risk

Rationale: to reduce cost with laboratory measurements

Advantage: no reporting / measurement bias because data have been recorded as part of the cohort study

Nested case-control study

Quick (cases already exist, no need to wait)

Cheap (not necessary to examine large number of people)

Can examine many exposures

Suitable to study rare diseases or diseases where there is a long latency between exposure and manifestation

Suitable to study stable exposures (e.g. genetic markers)

Sufficient power achievable with relatively small sample size

Case-control study: strengths

Selection of cases/controls can be troublesome

Not suitable for rare exposure

Cannot calculate incidence risk or death rates

Prone to bias (selection bias, recall bias, observer bias)

In controls, exposure to risk factors and confounders should be

representative of that in the population "at risk" of becoming cases

Exposures of controls should be measurable with similar accuracy

to those of cases

Limited to a single outcome variable

It is not possible to validate responses

Correct interpretation of results is difficult

Case-control study: limitations

It is not usually possible to establish a sequence of events so you are not likely to be able to estimate incidence rates

It is not possible to estimate the risk or risk ratios because we have artificially selected the number of cases relative to

the number of controls.

You can calculate:

Conditional Odds (matched analysis)

Odds of having the exposure and Odds ratios

Statistical Considerations

Conclusions

Case-control studies are valuable for studying disease aetiology

Like other designs, a case-control study has unique strengths and limitations that must be considered when selecting this

particular strategy

More susceptible to bias than other designs; potential sources of bias should ideally be recognised in the design phase in order

to minimise their effects

Could be used as a quick/cheap first step to identify potential risk factors for a disease

2nd year case control studies 2011 moodle

Documents

disease controls

cases controls representative

new cases

prevalent cases incident

cases criteria

casecontrol study definition

controls cohort

outcome controls