Title: Experts
1Experts consensus building on technology
risks(Expert judgments on phytoremediation The
role of self-confidence in averaging procedures
and Formative Consensus Building (FCP) for
predicting technology risks submitted)
2What will be told
Overview
- Theoretical motivation Statistical versus
consensus building procedures the role of
expert confidence - The case of phytoremediation in Dornach
- The situation
- Technology application, uncertainties and
technology performance - 3. The procedure
- 4. Results
- Quantitative results Is averaging about all
experts the best strategy? - Qualitative results What are the potential and
limits of consensus building procedures - 5. Conclusions/Discussion
3What to do, if
1. Motivation
- you have a new technology/medicine/educational
program at hand and want to reduce the number of
accidents/diseases/failure rate - the situation of technology application is
overly complex (cause impact relationships are
multi-layered) and not completely known - empirical evidence is limited (no von
Mises-Reichenbach situation) - Expert opinions diverge
4and you are interested in
1. Motivation
- the range of outcomes after applying Tech A (i.e.
statements such as The failure/ mortality rate
will be between x and y.) - the probability distribution on the reduction
rate r p(r lt C) z
5Two approaches
1. Motivation
- A) Statistical models (e.g. Johnson, Budescu,
Wallsten, 2001 Averaging when maximizing
independence among experts in formative
measurement procedures) - B) Consensus building procedures (e.g. Susskind,
1999 Organizing open but mediated processes on
what will be judged and which questions to be
answered)
6The situation in DornachLarge scale
contaminantion with Cadmium, Copper, and Zinc
2. The case
72. The case
8Is the flowerful technology of phytoremediation
also a powerful one?
2. The case
92. The case
How Phytoremediation Works
Cropping conditions
Acquisition
Soil parameters pH, clay, conductivity,
Zn
Cu
10Key questions Range/Bounds
Dependent variables
- If a model lot will be treated for ten years,
the cadmium (copper, zinc) concentration will
have a value between ___ mg Cd/Cu/Zn per kg dry
matter of soil and ___ mg Cd/Cu/Zn per kg dry
matter of soil? - Expert ks estimation of lower bound
concentration of the pollutants Cd/Cu/Zn - Expert ks estimation of upper bound
concentration of pollutants Cd/Cu/Zn
11Key questions Probabilities on Remaining
Concentrations
Dependent variables
- Experts probability judgments on attaining a
remaining degree of contamination r - Question on p(remaining concentrationltr)
- Cd 80, 20, 50, 91, 90, 99, 30, 70, 1,
40, 60, 10 - Cu 90, 99, 30, 70, 1, 40, 60, 10
- Zn 80, 30, 10, 99, 50, 95, 20, 1
12Fishing in a pool of experts
Expert sample
- Large scale eight-year national environmental
research program on soil remediation - Project cluster of six projects on
phytoremediation in Dornach (about 25
researchers) - 10 Experts from this cluster with backgrounds
biology, chemistry, environmental engineering,
mathematics, decision sciences and specialized
knowledge on soil chemistry, biological
mechanisms of heavy metal accumulation in plants,
sampling and data analysis, or designing
large-scale remediation engineering applications
13Procedure
- Experts got detailed (anonymized) information
about all experts judgments - Consensus building workshop
- Signing a public statement
- What could/should be answered (sample lot, soil
parameters, technology, key questions) - Gathering and disseminating documented expertise
(Multi-disciplinary state of the art knowledge
79 pages) - Questionnaire with key questions on
- Ranges
- Probabilities of attaining certain reductions
(ca. 10 reduction rates asked per heavy metal)
14H1 Experts confidence provides validity
4. Hypotheses
- Experts that feel more confident are more valid
in the sense that they deviate less from the
real/superexperts judgments - Further The judgments of the high confidence
group is more homogeneous than a low confidence
group
15H2 Statistical models to be compared
4. Hypotheses
- High confidence average among high confidence
experts (N 4) - Low confidence Average among low confidence
experts (N 5) - Average all (N 9)
- Median (N 9)
- Maxcorr Average among high correlated experts (N
4) - Mincorr Average among low correlated expert
16H2 Trucating provides higher validity
4. Hypotheses
- Averaging only the medium responses (only the
judgments of the inner 50 truncated
distribution) improves validity The median
expert does fine
17H3 Showing low correlations in an expert pool is
not an indicator of expertise
4. Hypotheses
- Higher correlated experts provide more valid mean
estimates (compared to a superexpert) than low
correlated experts - (In contradiction to Johnson et al. 2001)
18H4 Consensus Building does/ does not provide
new results
4. Hypotheses
- Not a straight hypothesis more an exploratory
one - Consensus building provides more reliable/valid
vs. fuzzier statements than statistical models - The high confidence group is the base line
19H1 Mean bounds of high and low confidence group
differ
4. Results
- Estimates of upper and lower bounds
- Means differ (Factor 2 in general not
significant) - Variances differ significantly low confidence
experts are less homogeneous (show more variance) - (see Table 1)
20H1 Mean bounds of remaining concentr. of high and
low confidence group differ
4. Results
21H1 Probability judgments of high and low
confidence group differ
4. Results
- Probability judgment on remaining concentrations
- High and low confidence group differ (rep. meas.
ANOVA) - Cd p lt .21 however interaction Probability x
Confidence p lt .04 - Cu p lt .04
- Zn p lt .02
22H1 Probability judgments of high and low
confidence group differ
4. Results
23H2 High confidence experts are more valid
4. Results
- Estimates of upper and lower bounds
- High confidence experts show lower difference to
a superexpert/real measurements in all 6
estimates (Factor 2 however not significant)
24H3 Self confidence provides validity
4. Results
- Mean sum of differences (absolute values) of
experts and superexperts/real meas. probability
judgments for different heavy metals
25H3 The greenhorns are the greensLess confident
are more optimistic
4. Results
264. Results
Y-axis Deviations of probability judgments (sum
score) to a superexpert/meas. The Median is the
best high confidence does fine
Low conf (8)
High conf (2)
Truncin (3)
Truncout (6)
Model (rank)
Mincorr (8)
Maxcorr (3)
Median (1)
Average all (3)
27H4 Qualitative statements consented
- We all agree that the remaining concentration
will be in the range between x and y (grey
area) with a certain probability - For Cadmium The reduction will exceed 15 with
low probability - For Zinc The Majority believes that the
remaining concentration will be between 93 and
98
28Conclusions
- The Formative Consensus Building method (i.e., a
structured, formative, anonymous method
organized by an independent facilitator) should
include - Cooperative definition of the judgmental task
- A common knowledge base
- Statistical procedures of integrating judgments
(better than fuzzy workshop statements) - The validation by a data based super-expert
judgment is a good/ideal research strategy - Measuring distributional knowledge is possible
Statistical procedures do better than discursive
ones take the median expert! - High confidence experts and high correlated
experts provide better judgments (if ....)