Title: Analytic Epidemiology
1Analytic Epidemiology
- Determining the Etiology of Disease
2Study Development Process
Descriptive Studies Data Collection and Analysis
Model Building and hypothesis formulation
Analyze results and retest
Analytic Studies for Hypothesis testing
3Causality and Causal Relationships
- Must have statistical significance
- Association may be either positive or negative
(if positive, the association is higher than
expected if negative the association is lower
than expected) - Must try to rule out noise (assuring the
comparison of apples to apples by controlling
confounding factors)
4Artifactual or Spurious Associations
- A false or fictitious association can result from
chance occurrence or bias in the study methods - Type 1error occurs from random fluctuation
- Through retesting, we can determine spurious
relationships. - Non-causal associations take place when a factor
and disease are associated indirectly
5Causal Association
- Strength of association
- Dose-Response Relationship
- Consistency of the Association
- Temporally Correct Association
- Specificity of the Association
- Coherence with Existing Information
6Sources of Data
- Primary data information collected directly by
the researcher - Secondary data data that has already been
collected and stored for analysis
7Types of Surveys
- Administrative surveys, medical records, vital
records and statistical data - Telephone surveys
- Self-administered surveys
- Personal interviews
8Measurement Issues
- Measurement is an attempt to assign numbers to
observations according to a set of rules - Variables can be categorical or continuous
- Intent is to translate observations into a system
that allows assessment of the hypothesis
9Types of Categorical Variables
- Nominal variables assigns name or number purely
on arbitrary basis (e.g., race, sex) - Ordinal variables measures assigned from
(typically) a lesser to greater value - Interval variables scale that assigns a number
to an observation based on a constant unit of
measurement - Ratio assigns numbers to observations to
reflect a true point
10Improving the Survey
- How is measure administered?
- Has it been used on similar situations with
success? - Is measure understandable by those being
surveyed? - Is sample accessible and identifiable?
- Is special training required?
- What is length of time in measurement?
- Are results available in timely manner?
11Reliability and Validity Issues
- Reliability the extent to which a measurement
has stability and homogeneity - Validity represents the precision to which the
measure is truly measuring the phenomena being
measured (measure must be reliable to be valid)
12Reliability and Validity Issues
- Content Validity the extent to which the
measure reflects the full concept being studied - Criterion Validity assessed by comparing the
test measure of the phenomenon
13Sensitivity
- Sensitivity (Se) measures how accurately the
test identifies those with the condition or
trait, i.e., correctly identifies or captures
true positives - High sensitivity is needed
- When early treatment is important
- When identification of every case is important
14Specificity
- Specificity (Sp) measures how accurately the
test identifies those without the condition or
trait, i.e., correctly identifies or excludes the
true negatives. - High specificity is needed when
- Re-screening is impractical
- When reducing false positive is important
15Factors to consider in setting cutoffs
- Cost of false positives v. false negatives
- Importance of capturing all cases
- Likelihood population will be re-screened
- Prevalence of the disease (Pe)
- Low Pe requires high Sp, otherwise too many false
positives - High Pe requires high Se, otherwise too many
false negatives
16(No Transcript)
17Determining SE and SP Rates
- SE TP / (TP FN)
- SP Specificity TN / (TN FP)
- False Neg. Rate 1 SE
- False Pos. Rate 1 - SP
- Positive Predictive Value
- TP/(TP FP)
- Negative Predictive Value
- TN/(TN FN)
18SE and PE Example
- You need to test the validity of cervical smears
pap smears to determine the presence of
cancer of the cervix. Smears were taken from 120
women known to have cancer of the cervix and from
580 women who did not have cancer of the cervix.
In the laboratory, the smears were read blind
as positive or negative. Of the total 700
smears, 200 were read as positive, 110 of which
came from the proven cancer cases.
19The Research Cycle
Theory
Operational hypothesis
Empirical findings
Observations and measurements
Statistical Tests
20Types of Studies
- Cross-sectional prevalence rates that may
suggest association (good for developing theory,
but no causal association) - Retrospective (Case-control) good for rare
diseases and initial etiologic studies - Prospective (cohort, longitudinal, follow-up)
yields incidence rates and estimates for risk.
Better for causal association. - Experimental (intervention studies) strongest
evidence for etiology
21Considerations for Study Design
- Stage of hypothesis development
- Nature of disease
- Nature of Exposure
- Nature of study population
- Context of research
22Cross-Sectional Studies
- Single point in time (snapshot studies)
- Risk factors and disease measured at the same
time - Determines prevalence ratios
23Cross-Sectional Study Design
Cases
Exposed
Non-Cases
Sample Population
Cases
Non-Exposed
Non-Cases
24Advantages and Disadvantages of Cross Sectional
Studies
- Advantages
- Gives general description or scope of problem
- Useful in health service evaluation and planning
- Baseline for prospective study
- Identifies cases and controls for retrospective
study - Low-cost
- Disadvantages
- No calculation of risk
- Temporal sequence is unclear
- Not good for rare diseases
- Selective survival can lead to bias
- Selective recall can lead to bias
- Cohort effect may be misleading
25Prospective Study Desgin
- Disease free persons are classified on exposure
at beginning of follow-up period then tracked to
ascertain the occurrence of disease. - Question of Study Do persons with the factor of
interest develop or avoid the disease more
frequently than those without the factor or
exposure
26Prospective Study Design
Cases
Exposure
Non-Cases
Sample Population
Cases
Exposure -
Non-Cases
27Prospective Study Criteria
- Obtain Incidence data
- Obtain the incidence among the exposed A/AB
- Obtain incidence among the non-exposed to
determine relative risk C/CD - Determine Relative Risk A/(AB)/C/(CD)
28Advantages and Disadvantages of Prospective
Studies
- Advantages
- Provides good assessment of temporal sequence
- Evaluate before onset of disease and watch for
disease
- Disadvantages
- Selection bias
- Loss to follow-up
- Expensive
29Retrospective Study Design
- Subjects are selected on the basis of disease
status either cases or controls then classified
on the basis of past exposure - Question of Study Do persons with the outcome of
interest (cases) have the exposure characteristic
(or history of exposure) more frequently than
those without the outcomes (controls)
30Retrospective Study Design
Exposure Positive A
Cases
Exposure Negative B
Exposure Positive C
Controls
Exposure Negative D
31Retrospective Study Method
- Compare the odds of exposure among the cases with
the odds of exposure among the controls - Odds of Exposure Among Cases A/(AC)/C/AC)
or A/C - Odds of Exposure Among Controls
B/(BD)/D/BD) or B/D - Get Odds Ratio or odds of expose among cases/Odds
of exposure among controls (A/C)/(B/D)
32Advantages and Disadvantages of Retrospective
Studies
- Advantages
- Less expensive than cohort (retrospective)
Studies - Quicker than cohort
- Can identify more than one exposure
- Good for rare diseases
- Well design leads to good etiologic investigation
- Disadvantages
- Selective Survival
- Selective recall
- Temporal sequence not as clear
- Not suited for rare exposures
- Gives an indirect measure of risk
- More susceptible to bias
- Limited to single outcome
33Experimental Studies
- Uses an intervention in which the investigator
manipulates a factor and measures the outcome - Elements of a complete experiment
- Manipulation of data
- Use of a control group
- Ability to randomize subjects to treatment groups
34Advantages and Disadvantages of Experimental
Studies
- Advantages
- Prospective direction
- Ability to randomize subjects
- Temporal sequence of cause and effect
- Can control extraneous variables
- Best evidence of causality
- Disadvantages
- Contrive situation
- Impossible to control human behavior
- Ethical Constraints
- External validity uncertain
- Expensive
35Attributable Risk
- The rate of disease in the exposed group
attributable to exposure. - Relative risk measures the strength of the
association - Attributable risk identifies risk of the disease
attributable to exposure or the proportion of
incidence in exposed group attributable to
exposure
36Attributable Risk Calculation
- Begins with (Incidence in the Exposed Group) -
(Incidence in the non-exposed group). - Search for the proportion of AR
37Attributable Risk Calculation
Incidence in the Exposed Group
Incidence in the Exposed Group
-
Incidence in the Exposed Group
38Population Attributable Risk Requirements
- Incidence rate of disease among those exposed to
a trait or characteristic - Incidence rate of disease among those not exposed
to the trait or characteristic - The proportion of the population that has the
trait or characteristic
39PAR Example
Incidence lung cancer, smokers
of smokers in population
Inc. lung cancer, non-smokers
of non-smokers in pop.
EXAMPLE Using NV Rates
(28.0/1000)
(.32)
(17.0/1000)
(.68) 20.5
40Attributable Risk Example
Incidence, total population
Incidence, non-exposed population
-
EXAMPLE Using NV Rates
(20.5/1000)
-
(17.0/1000) 3.5/1000
41PAR
Incidence, total population
Incidence, non-exposed pop.
-
Incidence in total population
EXAMPLE Using NV Rates
(20.5/1000)
-
(17.0/1000)
3.5/20.5 17
20.5
42Intervention Comparisons
- To demonstrate any therapeutic effect uses a
PLACEBO - To demonstrate improved therapy compare to
CONVENTIONAL TREATMENT - To demonstrate the most effective regimen compare
DIFFERENT REGIMENS
43Blinding in Experimental Studies
- The importance of blinding depends on the needed
outcome. Less important if the outcome is clear. - Non-blinded both subject investigator know
the treatment allocation - Single-blinded investigator knows, subject does
not know - Double-blinded neither investigator and subject
44Sources of Bias
- During selection of participants
- Absence of blinding allocation can lead to
differential classification - Other sources of miscalculation
- Withdrawals, ineligible sources, loss to
follow-up - Premature termination
45Selection Bias
- Cases and controls, or exposed and non-exposed
individuals were selected is such that an
apparent association is observed - even if there
is no association. - Biased selection - taking from a pool in which we
know the risk is higher is selection bias. - Small sample size or small response size
46Information Bias
- Methods of information about the subjects in the
study are inadequate and results show information
gathered regarding exposures and/or disease is
incorrect.
- Reporting bias
- Abstracting records
- Bias in interviewing
- Bias from surrogate interviews
- Surveillance bias
- Recall bias
47Other Issues
- Confounding Variables
- To prove that Factor A is a result of disease B,
we say that a third factor, Factor X is a
Confounder if the following is true - Factor X is a known risk factor for Disease B.
- Factor X is associated with Factor A bit is not a
result of Factor A. - Interactions