Bootstrapping - PowerPoint PPT Presentation

About This Presentation

Title:

Bootstrapping

Description:

Title: Folie 1 Author: Administrator Last modified by: Ich Created Date: 12/1/2006 9:57:45 AM Document presentation format: Bildschirmpr sentation – PowerPoint PPT presentation

Number of Views:99

Avg rating:3.0/5.0

Slides: 20

Provided by: cosm69

Learn more at: http://www.cosmo-model.org

Category:

more less

Transcript and Presenter's Notes

Title: Bootstrapping

1
Bootstrapping using different methods to
estimate statistical differences between model
errors

Ulrich Damrath
COSMO GM Rome 2011

2
Some typical situations occuring during
operational verification
ahhdfkfflflflflflfkfkfkjdjdddnbdnnnd
3
Questions

1.Question Are the differences of scores due to
noise or are they statistical significant?
2. Question Are there significant differences
between the quality of different models?
(Interests user of forecasts)
3. Question Are there significant differences
between the quality of models for different
situations? (Interests developers of models)
Problem BIASes may be normal distributed, but
RMSEs?
A possible solution Application of bootstrap
techniques to get confidence intervals or
quantiles of the distribution
1. Question concerning the bootstrap method How
many replications are necessary to get stable
statistical results?
2. Question concerning the bootstrap method How
should the sample data be grouped in order to
avoid autocorrelation effect?

4
The principle of bootstrapping for a sample with
10 elements
Realisation 1 mean value using elements 5 3 8
7 8 4 7 0 4 3 Realisation 2 mean value using
elements 3 2 0 5 1 2 0 2 2 8 Realisation 3
mean value using elements 5 2 3 6 8 3 8 0 8 6
Realisation 4 mean value using elements 7 5
1 6 4 0 1 2 1 6 Realisation 5 mean value
using elements 6 5 8 6 1 0 0 2 3 2 Realisation
6 mean value using elements 1 0 5 5 6 5 8 5 5
8 Realisation 7 mean value using elements 3
4 4 4 2 8 5 3 2 6 Realisation 8 mean value
using elements 0 8 2 0 6 4 1 6 6 5 Realisation
9 mean value using elements 0 7 5 6 3 2 2 3 8
8 Realisation 10 mean value using elements 2 2
3 6 6 6 6 2 0 0 The mean value of all
realisations (replications) gives the bootstrap
mean. The standard deviation of all mean values
gives the bootstrap standard deviation as
5
Bootstrap properties for three analytical
cases Number of sample values 31
6
Bootstrap properties for three analytical
cases Number of sample values 310
7
Bootstrap properties for three analytical
cases Number of sample values 3100
8
Bootstrap properties for three analytical
cases Number of sample values 31000
9
Bootstrap properties for three analytical
cases Number of sample values 310000
10
Conclusion concerning the convergence of the
method A number of 500
replications seems to be appropriate
to get a stable value for the bootstrap
variance. Setting the sample characteristics
Treating each pair of observations
and forecasts as a single
sample member leeds to large sample sizes with
relatively high autocorrelation.
Therefore values are grouped by
blocks of one, two and four days. Additionally,
a block size was constructed using the optimal
block length LOPT which can be estimated by
with a as a
function of autocorrelation and N as sample size.
11
The real world Dependence of bootstrap standard
deviation and bootstrap confidence intervals on
the number of replications 2m-temperature
forecasts during Summer 2010 and 10m-wind speed
during Winter 2010/2011. BIASes for different
periods, models and weather elements
12
The real world Dependence of bootstrap standard
deviation and bootstrap confidence intervals on
the number of replications 2m-temperature
forecasts during Summer 2010 and 10m-wind speed
during Winter 2010/2011. RMSEs for different
periods, weather elements and types of mean wind
direction over Germany (700 hPa)
13
Quantiles 10 and 90 for different bootstrap
types, Period 01.06.2010 31.08.2010 COSMO-EU
(solid), COSMO-DE (dotted), Element Temperature
2m Top Median and quantiles (green overlapping
quantiles, red no overlapping quantiles) Bottom
another visualisation of the overlapping
intervals (bluish overlapping intervals, deep
red no overlapping intervals)
14
Quantiles 10 and 90 for different bootstrap
types, Period 01.06.2010 31.08.2010 COSMO-EU
(solid), COSMO-DE (dotted), Element Wind speed
10m Top Median and quantiles (green overlapping
quantiles, red no overlapping quantiles) Bottom
another visualisation of the overlapping
intervals (bluish overlapping intervals, deep
red no overlapping intervals)
15
Comparison of overlapping quantile intervals for
different wind directions NW north westerly
flow, SW south westerly flow, NO north
easterly flow, SO south easterly flow
16
Comparison of overlapping quantile intervals for
different wind directions NW north westerly
flow, SW south westerly flow, NO north
easterly flow, SO south easterly flow
17
Some typical situations occuring during
operational verification in 2009, 2010 and 2011
18
Conclusions

Different types of grouping the samples lead to
different result concerning the statistical
significance of the model errors.
Block methods give more or less equivalent
results.
The results for the comparison of different
models may users lead to a decision which model
should be used.
The results for different weather types (flow
directions) may developers give some hints
concerning the development of model physics.

19
References

Efron, B., Tibshirani, R.J.(1993) An
Introduction to the Bootstrap (Chapman
Hall/CRC Monographs on Statistics Applied
Probability) Mudelsee, M. (2010) Climate Time
Series Analysis Classical Statistical and
Bootstrap Methods, Springer Dordrecht,
Heidelberg, London, New York

Write a Comment

User Comments (0)