HOTELLING T2 APPROXIMATION FOR BIVARIATE MIXED DICHOTOMOUS - PowerPoint PPT Presentation

1 / 28
About This Presentation
Title:

HOTELLING T2 APPROXIMATION FOR BIVARIATE MIXED DICHOTOMOUS

Description:

HOTELLING T2 APPROXIMATION FOR BIVARIATE MIXED (DICHOTOMOUS & CONTINUOUS) DATA. Imad Khamis ... Bivariate mixed Data. For group I, the response would be X1 and X2 ... – PowerPoint PPT presentation

Number of Views:200
Avg rating:3.0/5.0
Slides: 29
Provided by: stat99
Category:

less

Transcript and Presenter's Notes

Title: HOTELLING T2 APPROXIMATION FOR BIVARIATE MIXED DICHOTOMOUS


1
HOTELLING T2 APPROXIMATION FOR BIVARIATE MIXED
(DICHOTOMOUS CONTINUOUS) DATA
  • Imad Khamis
  • Southeast Missouri State University

2
Outline
  • Objective
  • Bivariate mixed Data
  • Permutation Test
  • Hotelling T2
  • Simulation
  • Results
  • Conclusions

3
Objective
  • The comparison of the means of two treatments
    or populations when more than one variable is
    measured may be done using Hotellings T2
    statistic.
  • In many real world situations the data
    obtained can have a variable which is
    dichotomous, and the assumption of multivariate
    normality upon which Hotellings T2 is based is
    no longer valid.
  • An approximate Hotelling T2 test is proposed
    for bivariate mixed data and empirically
    evaluated in terms of Type I error rate.

4
Bivariate mixed Data
  • Example Suppose we want to test two varieties of
    tomato
  • with respect to their
    resistance to a pest and the
  • average fruit size.
  • On every sampled plant, two
    measurements are
  • made. One, whether the pest is
    absent or
  • present on the plant and
    secondly, the average
  • fruit size of the plant.

5
Bivariate mixed Data
  • For group I, the response would be X1 and X2
  • X1 1 when pest is present on the plant
  • 0 when pest is absent
  • X2 fruit size
  • For group II, the response would be Y1 and Y2
  • Y1 1 when pest is present on the plant
  • 0 when pest is absent
  • Y2 fruit size

6
Bivariate mixed Data
7
Bivariate Mixed Data
  • Aim To test
  • H0 µX µY,
  • Ha µX ? µY
  • where ?X pX1 ?X2 and ?Y pY1 ?Y2
  • are vectors of expected proportion of pests and
    expected fruit
  • size of the plants.

8
Permutation Test
  • The permutation test is based on the computation
    of t-
  • statistic for each of the response variables.
  • The statistics which can be considered are
  • Tmax abs max(t1,t2,..,tk)
  • Tmax max(t1,t2,..,tk)
  • Let t(alpha)max abs denote the critical value of
    Tmax abs for a test at level of
  • significance alpha obtained from the permutation
    distribution.
  • If Tmax abs gt t(alpha)max abs , then treatment
    differences are significant.

9
SAS Code

Proc multtest permutation class trt test
mean(x1-x2) contrast 'x1-x2' 1 -1 run
10
HOTELLING T2
Suppose we have n observations from population 1
and m observations from population 2. There are k
response variables for each population. The
response matrices are represented by

11
HOTELLING T2


Hotellings T2 statistic, which assumes that ?X
?Y ? , Estimated by
12
HOTELLING T2

13
HOTELLING T2 for Bivariate Mixed Data
E(X) and
E(Y)   where px1 P(X1 1),
E(X2), py1 P(Y1 1), and
E(Y2). The covariance between X1 and X2
can be computed by Cov(X1, X2) E(X1X2)
E(X1)E(X2).   First, let us find E(X1X2).   E(X1
X2) E (X1 X2 X1 1)P(X11) E (X1 X2 X1
0)P(X10) (µ21) px1, where µ21
E(X2 X11)
14
The covariance is Cov(X1, X2) (µ21) px1 - px1
µx2
px1(µ21-. µx2) We can
write µx2 (px1)µ21 (1- px1)µ20 where µ20
E(X2 X10). Substituting the above equation,
Cov(X1, X2) px1(1-
px1) (µ21 - µ20). Similarly the covariance
between Y1 and Y2 is given by  Cov(Y1, Y2)
E(Y1Y2) E(Y1)E(Y2). Cov(Y1, Y2) px1(1- px1)
(µ21 - µ20).
15
The variance-covariance matrix for X can be
written as V(X)
16
(No Transcript)
17
The test statistic for testing the hypothesis H0
µx µy is given by
approximately follows an F-distribution with
numerator degrees of freedom 2 and denominator
degrees of freedom n m 3 .
18
Simulation
  • We generated Bernoulli distribution for one
    variable and a normal distribution for the other.
  • The first variable X1 was generated from
    Bernoulli(px1).
  • The second variable X2 was generated conditional
    on X1. Let X21 be the second variable X2
    corresponding to X1 1 and X20 be the second
    variable X2 corresponding to X1 0.

19
Simulation
The values of and were chosen in such
a way so that the difference -
is 5, 10, 20, and 30. Each of these mean
differences were taken in combination with
. The variable 1 (X1) was generated
using different values of px1. The values of px1
considered are 0 .3, 0.4, 0.6, and 0.7. The
same procedure was repeated for the other
population. The variance-covariance matrix was
the same for both cases under H0.
20
Simulation
For this study n and m were equal. The
different values of n and m were 10, 20, 30, and
40. Each combination of px1, , ,
, , n, and m was repeated 5000 times and
the test statistics were computed at each
repetition. The type 1 error rate is taken to
be the relative frequency with which the test
statistics exceeded the critical value in 5000
replications. The critical value is computed at
5 significance level.
21
X alpha from unbiased estimateY alpha from
biased estimate
22
Results N20, p .30
23
Results N20, p.60
24
Results N20, p.70
25
Results N20, p.40
26
Results N40, p.6
27
Results N40, p.3
28
Conclusions
  • Hotelling T2 can also be used for Bivariate
    Mixed Data
  • 2. Mean difference effect
  • Alpha decreases with increasing mean
    difference
  • 3. Variance proportion effect
  • Alpha decreases with increasing variance
    proportion
Write a Comment
User Comments (0)
About PowerShow.com