Dipak K. Dey - PowerPoint PPT Presentation

1 / 50

About This Presentation

Title:

Dipak K. Dey

Description:

Berger (1985) Relative frequency, and quantile based elicitation. ... Quantile and graphical approaches for GEV model, Coles and Powel(1996) ... – PowerPoint PPT presentation

Number of Views:68

Avg rating:3.0/5.0

Slides: 51

Provided by: lia49

Category:

more less

Transcript and Presenter's Notes

Title: Dipak K. Dey

1
Prior Elicitation from Expert Opinion
Dipak K. Dey University of Connecticut Some
parts joint with Junfeng Liu Case Western
Reserve University
2
Elicitation

Elicitation is the process of extracting
expert knowledge about some unknown quantity
of interest, or the probability of some future
event, which can then be used to supplement any
numerical data that we may have.
If the expert in question does not have a
statistical background, as is often the case,
translating their beliefs into a statistical
form suitable for use in our analyses can be
a challenging task.

3
Introduction

Prior elicitation is an important and yet under
researched component of Bayesian statistics.
In any statistical analysis there will typically
be some form of background knowledge available in
addition to the data at hand.
For example, suppose we are investigating the
average lifetime of a component. We can do tests
on a sample of components to learn about their
average lifetime, but the designer/ engineer of
the component may have their own expectations
about its performance.

4
Introduction

If we can represent the expert's uncertainty
about the lifetime through a probability
distribution, then this additional (prior)
knowledge can be utilized within the Bayesian
framework.
With a large quantity of data, prior knowledge
tends to have less of an effect on final
inferences. Given this fact, and the various
techniques available for representing prior
ignorance, practitioners of Bayesian statistics
are frequently spared the effort of thinking
about the available prior knowledge.

5
Introduction

It will not always be the case that we will have
sufficient data to be able to ignore prior
knowledge, and one example of this would be in
the uncertainty in computer models application or
modeling extreme events.
Uncertain model input parameters are often
assigned probability distributions entirely on
the basis of expert judgments. In addition,
certain parameters in statistical models can be
hard to make inferences about, even with a
reasonable amount of data.

6
Introduction

The amount of research in eliciting prior
knowledge is relatively low, and various proposed
techniques are often targeted at specific
applications. At the same time, recent advances
in Bayesian computation have allowed far greater
flexibility in modeling prior knowledge. In
general, elicitation can be made difficult by the
fact that we cannot expect the expert to provide
probability distributions for quantities of
interest directly.

7
Introduction

The challenge is then to find appropriate
questions to ask the expert in order to extract
their knowledge, and then to determine a suitable
probabilistic description of the variables we are
interested in based on the information we have
learned from them.

8
Motivation

Three approaches
1 Direct Prior Elicitation
Berger (1985) Relative frequency,
and quantile based elicitation.
2 Predictive prior probability space, which
requires simple
priors and may be burdened with additional
uncertainties
arising from the response model.
(Kadane, et al, 1980 Garthwaite and Dickey,
1988, Al-Awadhi and Garthwaite, 1998, etc.).
3 Nonparametric Elicitation
(Oakley and OHagan, 2002)

9
Symmetric Prior Elicitation

Double bisection method Expert provides q(.25),
q(.5) and q(.75), the three quantiles
IQR q(.75)-q(.25)
Normal prior
Z(q) IQR of std. normal, then, prior mean and
std. dev. are,
q(.5) and IQR/ Z(q) respectively.

10
Students t Prior

Three non redundant quantiles are required to
estimate the df ?. Kadane et.al. (1980) suggested
obtaining q(.5), q(.75) and q(.9375)
a(x) (q(.9375)-q(.5))/(q(.75)-q(.5)) depends on
df ? only
Df is determined from look up table of a(x) vs df
?.

11
Students t Prior

After elicitation of df obtain t?,0.75
Calculate S(q) (q(.75)-q(.5)) 2/ t2?,0.75
for elicitation of scale parameter s.
This idea can be applied to any general
location-scale family.

12
Lognormal Prior

Garthwaite (1989) used split-normal distribution,
OHagan (1998) used 1/6, 3/6 and 5/6 quantiles.

Proposition If X has a log-normal distribution,
i.e.,
, then the variance
and the mean
,where
is the
is the IQR
median of
for standard normal distribution.
13
Direct Prior Elicitation

Simple and limited prior family with only
location and scale parameters (normal,
exponential, etc.)
(2) Location-scale-shape (µ-?-?) parameter joint
elicitation (gamma, skew-normal, Students t,
etc.)

14
Symmetric and Asymmetric Priors

Normal
Students t
Log-normal
Skew-normal
Normal-exponential
Skew-Students t

Location-scale, symmetric
No location scale but shape, symmetric
Location-scale, asymmetric
Location-scale-shape, asymmetric
Location-scale-shape, asymmetric
Location-scale-shape, asymmetric
15
Shape Parameter Elicitation
This is most challenging. Presumably, the
Interquantile-Range-ratio (IQRR
q(.75)-q(.5)/q(.5)-q(.25) is a monotone
function of shape parameter. We have two
cases (1) Shape-parameter is in the
non-sensitive region, absolute value larger
than 1. (2) Shape-parameter is in the
sensitive region, absolute value smaller than 1.
16
Nonsensitive and sensitive regions (Skew-normal)
Non-sensitive
Sensitive
IQRR (interquantile range ratio) vs.
shape parameter
17
Shape Parameter Sensitive Region Gamma Case
18
Parameter Elicitation Guideline
The elicitation input is IQRR and the
hyperparameter is the shape parameter.
We prefer a moderate sensitivity index (SI)
Hyperparameter change / elicitation input change
SI? (IQRR)/? (l)
We look for SI close to 1.
Sensitive region shape parameter is small in
magnitude.
19
Parameter Elicitation on Shape Parameter
Non-Sensitive Region
(1) Elicit shape parameter from plot of

IQRR(?) vs. ? (2) Scale parameter
?
IQR/IQR(?) where, IQR is the interquantile
range from expert, IQR(?) is the standardized
IQR with elicited ? from (1), ? 1 and µ0.
(3) The location parameter is
Q(0.75)- ?
Q(0.75,?) where, Q(0.75) is .75 quantile
from expert, ? comes from (2), and Q(0.75,?) is
the standardized .75 quantile with elicited ?
from (1), ? 1 and µ0.
20
Note
The sensitivity index in IQR(?) vs. ? and
Q(0.75,?) vs. ? is usually moderate.
21
Approximate Scale Parameter Elicitation from
Taylors Expansion (1 Basics)
General approach for any location, scale and
shape Family
1 g() is the characteristic point of density
f(xµ,?,?), say mean, median, mode, etc.
2 g() µ?g(?), where g(?) is the
standardized characteristic point. 3
f(g()µ,?,?) (1/?)f(g(?)0,1,?).
22
Approximate Scale Parameter Elicitation from
Taylors Expansion (2 Method)
Letting (1)-(2) and only keeping first 2 terms on
the right hand side, we get
We get the approximate scale parameter without
considering any consequences as
23
Relative Error in Students t Prior Elicitation
(1 Values)
From Taylors expansion, we have approximate
The exact
Where, 1 v is degrees of freedom 2 IQR is
interquantile range from expert 3 p 0.5 4
is .75 quantile of Students t
distribution with v degrees of freedom
24
Approximate Scale Parameter Elicitation from
Taylors Expansion (3 Relative Error)
Now
(1)-(2)
Denote
(Only related to ?)
The relative error is
25
Relative Error in Students t Prior Elicitation
(2 Plot)
(1) approximate represents Taylor expansion
value
(2) exact represents Taylor expansion value

(3) normal represents , with as
interquantile range for standardized normal
distribution.
(1) (2) approaches 1.0763 as v goes to
infinity.
26
An Important Observation
When shape parameter is highly sensitive to IQRR,
the approximate scale parameter elicitation by
Taylors expansion will be very stable in terms
of relative error.
27
Elicitation of Shape Parameter on Sensitive
Region (Skew-normal, Iteration on characteristic
points)
Iteration based on Taylors expansion at median
, mode or mean
.
(1) Start with current l, from high-proportional-
fidelity by Taylor expansion, we have
(2) The skew(shape) parameter can be obtained by
plotting
(3) Go to (1) until convergence (complete
and )
(4) Location parameter
28
Elicitation on Shape Parameter Sensitive
Region (Skew-normal, Iteration on IQRs)
Iteration based on IQRs
(1) Start with current , we look up
,
then
(2) The skew (shape) parameter can be obtained by
plot
Since
(3) Go to (1) until convergence (complete
and )
(4) Location parameter
29
Graphical Comparison 1 (reference IQR based
iteration)
30
Graphical Comparison 2 (reference median based
iteration)
31
Graphical Comparison 3 (reference mean based
iteration)
32
Graphical Comparison 4 (reference mode based
iteration)
33
Another Important Observation
The IQR based iteration is close to mean based
iteration for skew-normal case, since mean is
explicit , other than numerically
solved.
34
(No Transcript)
35
(No Transcript)
36
Non-Parametric Prior Elicitation

To estimate prior density directly
such that
,

Suppose,
parametric family of distributions,
where
vector of hyper parameters
underlying parameters in
37
Non-Parametric Prior Elicitation
(correlation function) 1 if
decreasing function of
otherwise.
ensures that prior variance covariance matrix
of any set of observation
or functional of
is positive semi-definite.
38
Choice of Covariance function
specifies the true density function.
controls smoothness of the density.
b large implies
is large.
39
Hierarchical prior (Gaussian Process Prior)
Special Case
then
Then
Prior
40
Let D elicited summaries relating to
data

H is a function of

A and

is a function of
41
This implies,
with
42
Posterior
n of elements in D
use MCMC to obtain samples from
43
Other Choices of Centering
a)
b)
c)
Gamma or Log-normal etc.
d)
44
Side Conditions

Given Derivatives or quantiles D will be
appropriately changed. In fact D can incorporate
all the constraints specified in the prior, e.g.,
moments.

45
Psychological Perspective of Imprecise Subjective
Probabilities

Numerical probabilty estimates (N)
Ranges of numerical values (R)
Verbal phrases (V)
Objective
Translate the triplate (N,R,V) to a decision
makers model

46
Imprecisely Assessed Distributions
Contamination
Class of Bi-modal distribution
47
Future problems

Prior elicitation in Extreme Value Modeling
Quantile and graphical approaches for GEV model,
Coles and Powel(1996)
Prior elicitation for short and long tailed
distribution
Spatial modeling
High dimensional modeling

48
References

1. Daneshkhah, A. (2004). Psychological Aspects
Influencing Elicitation of Subjective
Probability. BEEP working paper.
2. Dey, D.K. and Liu, J. (2007). A quantitative
study of quantile based direct prior elicitation
from expert opinion. Bayesian Analysis, 2,
137-166.
3. Garthwaite, P. H., Kadane, J. B., and O'Hagan,
A. (2005). Statistical methods for eliciting
probability distributions. Journal of the
American Statistical Association, 100, 680-701.
4. Jenkinson, D. (2005). The Elicitation of
Probabilities-A Review of the Statistical
Literature. BEEP working paper.
5. Kadane, J.B.,Dickey,J.M., Winkler, R.L.,
Smith, W.S. and Peters, S.C.(1980). Interactive
elicitation of opinion for a normal linear model.
JASA, 75, 845-854.

6. Oakley, J., and O'Hagan, A. (2005).
Uncertainty in prior elicitations a
non-parametric approach. Revised version of
research report No. 521/02 Department of
Probability and Statistics, University of
Sheffield.
7. O'Hagan, A. (2005). Research in elicitation.
Research Report No.557/05, Department of
Probability and Statistics, University of
Sheffield. Invited article for a volume entitled
Bayesian Statistics and its Applications.
8. O' Hagan, A., Buck, C. E., Daneshkhah, A.,
Eiser, J. E., Garthwaite, P. H., Jenkinson, D.
J., Oakley, J. E. and Rakow, T. (2006). Uncertain
Judgements Eliciting Expert Probabilities. This
book Will be published by John Wiley and Sons in
July 2006.