Statistics 102 - PowerPoint PPT Presentation

About This Presentation
Title:

Statistics 102

Description:

... the more confident we are that the patient has the condition if the test is +. + LR can approach . ... There is no leg swelling or leg pain, ... – PowerPoint PPT presentation

Number of Views:67
Avg rating:3.0/5.0
Slides: 31
Provided by: Mand248
Category:

less

Transcript and Presenter's Notes

Title: Statistics 102


1
Statistics 102
2
Outline
  • I. Sensitivity and Specificity/Likelihood Ratios
  • II. Statistical significance for group data
  • III. Statistical significance for correlational
    data
  • IV. Non-inferiority trials
  • V. Linear regression
  • VI. Logistic regression
  • VII. Stepwise multivariate regression
  • VIII. Type I and II errors/ Sample size estimates

3
I. Sensitivity and Specificity
  • Sensitivity true positives (proportion of
    individuals with the disease who test ranges
    from 0 to 1, or from 0 to 100)
  • 1-Sensitivity false negatives (proportion of
    individuals with the disease who test - ranges
    from 0 to 1, or 0 to 100)
  • If sensitivity 0.8 (80), 1-sensitivity 0.2
    (20)
  • Specificity true negatives (proportion of
    individuals without the disease who test -
    ranges from 0 to 1, or from 0 to 100)
  • 1-Specificity false positives (proportion of
    individuals without the disease who test
    ranges from 0 to 1, or 0 to 100)
  • If specificity 0.92 (92), 1-specificity 0.08
    (8)

4
Use of Sensitivity and 1-Specificity in Receiver
Operating Curves (ROCs) and the Areas under the
ROCs (the AUC)
  • Plots sensitivity of the test (true rate, TPR)
    on Y axis, from 0 to 1 vs. 1-specificity (false
    rate, FPR) on X axis, from 0 to 1 at different
    test cutoffs
  • Perfect Classification AUC1 (area of a
    square with sides1)
  • Random guess AUC0.5 (area of a
    triangle with base and height1)
  • AUC between 0.5 and 1 Test is Better than
    a random guess
  • AUC between 0 and 0.5 Test is Worse than
    a random guess
  • AUC has a 95 CI
  • e.g., 0.78 (0.69-0.87)

5
ROCs with AUCs better than a random guess
(between 0.5 and 1.0)
? sweet spot cut off, a trade off between
sensitivity and specificity
High cut off ??????????????? Low cut off
6
Additional terms that can be derived from
sensitivity and specificity
  • Likelihood ratios does the test usefully change
    the probability (likelihood) of a disease or
    condition?
  • Positive likelihood ratio true/false
    sensitivity/1-specificity.
  • The higher the likelihood ratio, the more
    confident we are that the patient has the
    condition if the test is . LR can approach ?.
  • Negative likelihood ratio false-/true - 1-
    sensitivity/ specificity.
  • The lower the likelihood ratios, the more
    confident we are that the patient does not have
    the condition if the test is -. LR can approach
    0.

7
Example 1 Use of and - likelihood ratios
  • Your patient with COPD has an acute onset of
    worsening dyspnea. He had arthroscopic knee
    surgery 2 weeks ago. There is no leg swelling or
    leg pain, hemoptysis, personal or family history
    PE or DVT, or malignancy. You clinically assess
    the odds of him having a PE as 5050, or equally
    likely that he had a PE as that he did not have a
    PE.
  • If ordered and performed, how would the results
    of a CT angiogram (CTA) of the pulmonary arteries
    change your estimated likelihood of PE in this
    patient? In other words, how good is CTA in
    helping you diagnose or exclude a PE in this
    patient?

8
Example 1, continued
  • Literature (Annals Internal Medicine 136
    286-287, 2002)
  • CTA and pulmonary angiography (gold standard)
    were performed in 250 patients with possible PE.
  • 50 (20) of the patients had PE on pulmonary
    angiography. 200 had no PE on angiography.
  • Results
  • CTA CTA-
  • PE on pulm angio (n50) 35 15 No PE on
    pulm angio (n200) 2 198

9
Example 1, continued
  • Likelihood ratio (LR) calculations
  • CTA sensitivity (true ) 35/50 (.70) , or 70
  • 1-sensitivity (false - ) 15/50 (.30)
  • CTA specificity (true - ) 198/200 (.99), or
    99
  • 1-specificity (false )2/200 (.01)
  • LR sensitivity/1-specificity true/false
    .70/.0170 (PE 70 x as likely as before test).
  • -LR 1-sensitivity /specificity false-/true-
    .30/.99.303 (PE .3 x as likely as before
    test)
  • Annals Internal Medicine 136 286-287, 2002

10
II. Are measured group differences in variables
or outcomes statistically significant? Which
test(s) to use?
  • If data are normally distributed
  • Use paired t (if each subject is his/her own
    control) 1
  • Use unpaired t (group t) if there are two groups
    2
  • If data are skewed (not normally distributed)
  • Is the variable a continuous one, such as age
    or PaO2?
  • Use Mann Whitney U, 3, or
  • Use Wilcoxons sign rank 4
  • Is the variable a categorical one, such as
    gender or age gt 65?
  • Use Fishers exact, 5, or
  • Use chi square test 6
  • If there gt2 study groups
  • Use analysis of variance (ANOVA) 7

11
III. Are correlations between variables

statistically significant? What test(s) to use?
  • If the variables are normally distributed
  • Use Pearsons test 8
  • Pearsons r ranges from -1 to 1.
  • r ? 0 indicates no correlation.
  • P value depends both on r and N.
  • If the variables are skewed (not normally
    distributed)
  • Use Spearmans test 9
  • Spearmans r ranges from -1 to 1
  • r ? 0 indicates no correlation.
  • P values depend both on r and N. Plt 0.05 usually
    used.

12
Example 2 METABOLIC ALKALOSIS
13
IV. Non-Inferiority Trials
  • A New Treatment Can Truly Be
  • Better (Superior)
  • Essentially equal
  • Worse (Inferior) than the usual treatment
  • A Trial Can Test Whether New is
  • Better (superior)
  • Not better (non-superior)
  • Not worse (non-inferior)
  • Worse (Inferior)
  • rarely done

14
Non-inferiority trials
  • Non-inferiority trials are intended to show that
    the effect of a new treatment is not worse than
    that of an active control by more than a
    specified amount.
  • A little like a point spread in football.
  • The non-inferiority margin (NIM) is chosen by the
    investigators before the study (a priori) and can
    be somewhat arbitrary.
  • Study endpoints in non-inferiority trials can be
    efficacy or safety parameters or a combination of
    the two.
  • Study design may include 3 arms with placebo
    group (preferred) or 2 arms with only new and
    usual treatments (much less ideal, since no
    internal validation that new treatment is better
    than placebo)
  • Delta (d) is the measured difference (best
    estimate of the true difference) between the two
    active treatments. This d will have a 95 CI.
  • Example 3 d -4 (95 CI, -9 to 1)

15
Example 3 d -4 (95 CI, -9 to 1), the
control Rx being slightly better
  • If the NIM had been chosen to be -10 by the
    investigators a priori, using Example 3, the new
    drug would be shown to be non-inferior to the
    control, as -10 , the NIM, was less that the 95
    CI for d, -9 to 1.
  • (If the NIM had been chosen to be -5 by the
    investigators a priori, using Example 3, the new
    drug would not be shown to be non-inferior- to
    the control, as -5, the NIM, fell in the 95 CI
    for d, -9 to 1. In this context,
    non-inferiority not shown is the same as being
    inferior.)

16
V. Linear regression
  • Simple regression ymxb
  • one independent variable, x
  • one dependent variable, y
  • If x0, yb, the intercept
  • b can be , as shown, zero, or




?y /?x m, slope
y, dep variable


b
x, indep variable
17
METABOLIC ALKALOSIS
18
V. Linear regression
  • Simple regression ymxb
  • one independent variable, x
  • one dependent variable, y
  • If x0, yb, the intercept
  • b can be , as shown, zero, or -
  • More complex regression ym1x1m2x2b
  • two independent variables
  • one dependent variable
  • If x1x20, yb, the intercept
  • b can be , zero, or -

19
VI. Logistic regression. A popular method
  • A model predicting the probability of a
    dependent categorical outcome, such as death,
    using 2 or more patient-specific independent
    variables.
  • The logit, z, is the total contribution of ALL
    the patient-specific independent variables used
    in the model to predict the outcome, f(z), the
    dependent variable.
  • zß0ß1x1ß2x2 ßnxn.
  • ß0 intercept
  • ß1, ß2, ßn are regression coefficients for
    x1,x2, ... xn
  • If x1, x2, ,xn all 0 (the pt has no risk
    factors), zß0the risk of the dependent outcome
    (such as death) when no factors affecting risk
    are present..
  • If ßn gt 0, then the variable, n, increases the
    risk of the outcome.
  • If ßnlt0, then the variable, n, reduces risk of
    the outcome.
  • Large ßn means the variable, n, has a large
    influence on the outcome.
  • Small ßn means the variable n has a small
    influence on the outcome
  • f(z) likelihood of outcome, such as death
    ez/(ez1) 1/(1e-z)

20
The logistic function is useful because it can
input any z from ? to -? whereas the output,
f(z), will be confined to values between zero and
1.
Note if z0, f(z)0.5, because
1/1e-01/111/20.5
21
Example 4. Logistic regression.
  • Three independent variables , x1, x2, and x3 are
    studied to try to predict the 10-year death risk
    from heart disease. Using data obtained from a
    large study population, the following logistic
    regression model was derived to best fit the
    data zß0ß1x1ß2x2 ß3x3
  • x1 age in years above 50 (age is a continuous
    variable) ß1 2.0
  • x2 sex, where 0 is male and 1 is female (gender
    is a categorical variable) ß2 -1.0
  • x3 blood cholesterol in mmol/L above 5 mmol/L
    (194 mg/dL) ß31.2
  • ß0 -5.0.
  • Risk of deathf(z)1/(1e-z), where z (the
    logit) -5.02.0 x1 - 1.0 x2 1.2 x3.
  • Thus, in a 50 y.o. female with a cholesterol of 5
    mmol/L, z?0-5.0 (see prev Fig)

22
Logistic regression.
  • Three independent variables , x1, x2, and x3 are
    studied to try to predict the 10-year death risk
    from heart disease. Using data obtained from a
    large study population, the following logisitic
    regression model was derived to best fit the
    data zß0ß1x1ß2x2 ß3x3
  • x1 age in years above 50 (age is a continuous
    variable) ß1 2.0
  • x2 sex, where 0 is male and 1 is female (gender
    is a categorical variable) ß2 -1.0
  • x3 blood cholesterol in mmol/L above 5 mmol/L
    (194 mg/dL) ß31.2
  • ß0 -5.0.
  • Risk of deathf(z)1/(1e-z), where z (the
    logit) -5.02.0 x1 - 1.0 x2 1.2 x3.
  • Thus, in a 50 y.o. female with a cholesterol of 5
    mmol/L, z?0-5.0
  • Example 4 What is the risk of death in the next
    10 years from heart disease in a 50 year man with
    a blood cholesterol of 7 mmol/L (272 mg/dL)?
  • z -5.02(50-50) -1(0)1.2(7-5). Thus, z
    -5.0002.4 -2.6
  • Since z -2.6 in this man, f(z)his risk of
    10-year death from heart disease 1/(1e-z)
    1/(1e2.6 ) 0.07, a 7 10-yr risk.
  • The 95 confidence intervals can also easily be
    calculated for f(z).

23
Ex. 4
24
VII. Stepwise multivariate regression
  • If several variables, INDIVIDUALLY, help to
    predict an outcome by univariate analysis, but
    these variables could be closely related to each
    other, stepwise multivariate analysis helps sort
    out independent contributions of the variables.
  • e.g., blood pressure, BMI and type 2 DM EACH
    increase risk of MI
  • This procedure is used primarily in regression
    modeling. At each step, after a new variable is
    added, a test is made to see if some variables
    can be deleted without appreciably increasing the
    discrepancy between the data and the regression
    model.
  • The procedure terminates when the measure is
    maximized or when the available improvement (by
    adding more variables) falls below some critical,
    predetermined value.

25
Example 5. Stepwise multivariate regression.
  • Cohort of ? 300 outpatients with low serum TSH
    undergoing radioiodine uptake and scan.
  • Many, but not all, had thyroid disease (e.g.,
    Graves). Numerous variables were examined to see
    which correlated with a normal uptake and scan
    result.
  • Three of the numerous variables examined
    predicted a normal uptake and scan
  • If patient was using a statin OR 6.5 (95 CI,
    2.9-14.6)
  • If patient was a man OR 2.5 (95 CI, 1.3-4.5)
  • If patient was gt 45 years of age OR 2.0 (95
    CI, 1.1-3.6)
  • Which of these variables independently predicted
    a normal thyroid uptake and scan despite the low
    serum TSH?
  • Is it statin use, being male, and/ or being older
    than predicts normal thyroid function if a
    patient has low serum TSH?

26
Example 5 Stepwise multivariate regression
Step 1 STATIN USE ?2 21.8 Plt0.001
Step 2 OLDER AGE ?28.5 P0.004
Step 3 MALE GENDER ?23.9 Not significant
from Yandell et al. Thyroid 2008 181039-42.
27
VIII. Type 1(a) and Type 2 (ß) Errors
Null Hypothesis there is no differences between
two treatments
tests
Reject the null hypothesis
Accept null hypothesis
Correct decision (no error)
Error
Correct decision (no error)
Error
Type 1 (?) error (P), which can be large or
small
Type 2 (?) error, which can large or small
28
Choosing the size of ? and ? errors
  • The type 1 error, or ? (also called P) is
    conventionally set at 0.05 (5)
  • i.e, chance of a type 1 error if the null
    hypothesis is rejected is lt 5
  • Can state plt0.05 or give exact p value (e.g.,
    p0.001, or p0.049)
  • The type 2 error, or ?, is often set at 2 to 4
    times ? , or 0.10-0.20 (10-20)
  • i.e., chance of making a type 2 error if the null
    hypothesis is accepted is 10-20
  • Power to detect a real difference (and thus to
    reject the null hypothesis ) 1- ?
  • smaller ? (e.g., 0.1), more power (.9)
  • larger ? (e.g., 0.2), less power (.8)
  • If a study is highly powered and the null
    hypothesis is accepted, the chance of there being
    a true difference is quite small.
  • If the study is under-powered and the null
    hypothesis is accepted, there can be little
    confidence that a true difference has been
    excluded.

29
Example 6 Use of a and ß in sample size
planning
A new antibiotic is developed for C. difficile.
How many patients would be needed to be included
in a phase 3 trial to be able to show that this
new drug is superior to metronidazole? To
answer this question, we need to know 1. What is
the expected success rate for metronidazole?
P1 2. What would be a clinically important and
expected improvement in success rate (based on
phase 1/2 studies) with the new drug? P2 3.
What should be the ? (type 1 error) and the ?
(type 2 error) for the study? (Recall Power
1- ?.)
30
Sample size estimation, contd
  • P1 0.75 (metronidazole, based on literature)
  • P2 0.90 (New Rx, based on small phase 1/2
    trials)
  • ? 0.05 (1 in 20)
  • ? 0.10 (1 in 10). Power 0.90 (9 in 10)
  • Needed N1 and N2 158 per group (from Fleiss
    tables), or 316 patients in total
  • If ?10 drop out rate is expected, then
    15816174 per group, or 348 patients in total
    would need be randomized.
  • (This sample size may necessitate a multi-center
    study to enroll sufficient patients during the
    proposed time frame.)
  • Analyze data by intent-to-treat and by evaluable
    patients.
Write a Comment
User Comments (0)
About PowerShow.com