Title: Linearization Variance Estimators
1Linearization Variance Estimatorsfor Survey
Data Some Recent Work
A. Demnati and J. N. K. RaoStatistics Canada /
Carleton University
A Presentation at the Third International
Conference onEstablishment SurveysJune 18-21,
2007
Montréal, Québec, CanadaJune 20, 2007
2Situation
- looking for a method of variance estimation that
- of nonlinear finite population parameters
- defined explicitly or implicitly
- using calibration weights
3Demnati Rao Approach
- Finite population parameters
- Estimator for both parameters
- Variance estimators associated with and
are different
4Demnati Rao Approach( Survey Methodology, 2004 )
- Write the estimator of a finite population
parameter as
with
if element k is not in sample s
if element k is in sample s
5Demnati Rao Approach( Survey Methodology, 2004 )
- A linearization sampling variance estimator is
given by
with
variance estimator of the H-T
estimator
of the total
is a (N1) vector of arbitrary number
6Demnati Rao Approach( Survey Methodology, 2004 )
- Example Ratio estimator of
For SRS and
7Demnati Rao Approach( Survey Methodology, 2004 )
- Example Ratio estimator of
- is a better choice over customary
- Royall and Cumberland (1981)
8Demnati Rao Approach
- Also in Survey Methodology, 2004
- the Optimal Regression Estimator
- the Generalized Raking Estimator
- Cox Proportional Hazards Model
9Model parameters(Symposium, 2005)
- Finite-population assumed to be generated from a
superpopulation model
- Inference on model parameter
model expectation and variance
design expectation and variance
i) if f 0 then
ii) if f 1 then
where f is the sampling fraction. For
multistage sampling, the psu sampling fraction
plays the role of f.
In case i),
10- Example Ratio estimator when y is assumed to be
random
where Ad is a 2N matrix of random variables
with kth column
where Ab is a 2N matrix of arbitrary real
numbers with kth column
where is an estimator of the total
variance of
11- Estimator of the total variance of
and
when
- A variance estimator of is
given by
with
where
Note that is an
estimator of model covariance
when and
when
12 model variance
sampling variance
where
and
where
13Note remains valid under
misspecification of
Note g-weight appears
automatically in
and the finite population correction 1-n/N is
absent in
14Simulation 1 Unconditional performance
- We generated R2,000 finite populations
, each of size N393 from the ratio model
where
are independent observations generated from a
N(0,1)
are the number of beds for the Hospitals
population
studied in Valliant, Dorfman, and Royall (2000,
p.424-427)
- One simple random sample of specified size n is
drawn from each generated population
15Simulation 1 Unconditional performance
16Simulation 1 Unconditional performance
Figure 1 Averages of variance estimates for
selected sample sizes compared to simulated MSE
of the ratio estimator.
17Simulation 2 Conditional performance
- We generate R20,000 finite populations
, each of size N393 from the ratio model
using the number of beds as
- One simple random sample of size n100 is drawn
from each generated population
- We arranged the 20,000 samples in ascending order
of -values and then grouped them into 20
groups each of size 1,000
18Simulation 2 Conditional performance
Figure 2 Conditional relative bias of the
expansion and ratio estimators of
19Simulation 2 Conditional performance
Figure 3 Conditional relative bias of variance
estimators
20Simulation 2 Conditional performance
Figure 4 Conditional coverage rates of normal
theory confidence intervals based on
, and for nominal level of 95
21g-weighted estimating functions model parameter
- is the solution of weighted estimating equation
- Logistic Regression Model
22Simulation 3 Estimating equations
- We generated R10,000 finite populations
, each of size N393 from the model
- Using the number of beds as
- leads to an average of about 60 for z
- One simple random sample of size n30 is drawn
from each generated population
- Population units are grouped into two classes
with 271 units k having xlt350 in class 1 and 122
units k with xgt350 in class 2
- Post-stratification X(271,122)T
23Simulation 3 Estimating equations
Table 1 Monte Carlo Variances Table 1 Monte Carlo Variances Table 1 Monte Carlo Variances
Parameter No Calibration Post-stratification
0.0133 0.0139
0.0161 0.0167
Table 2 DR variance estimator Table 2 DR variance estimator Table 2 DR variance estimator
Parameter No Calibration Post-stratification
0.0122 0.0123
0.0148 0.0150
Table 3 DR naïve variance estimator Table 3 DR naïve variance estimator Table 3 DR naïve variance estimator
Parameter No Calibration Post-stratification
0.0120
0.0145
24Multiple Weight Adjustments
- Units (or complete) nonresponse
- Due to lack of time, not presented in the talk,
but it is included in the proceeding paper
25Concluding Remarks
- We provided a method of variance estimation for
estimators
- of nonlinear model parameters
- defined explicitly or implicitly
- using multiple weight adjustments
Thank you Very Much