Bootstrapping - PowerPoint PPT Presentation

About This Presentation
Title:

Bootstrapping

Description:

Title: Folie 1 Author: Administrator Last modified by: Ich Created Date: 12/1/2006 9:57:45 AM Document presentation format: Bildschirmpr sentation – PowerPoint PPT presentation

Number of Views:96
Avg rating:3.0/5.0
Slides: 20
Provided by: cosm69
Category:

less

Transcript and Presenter's Notes

Title: Bootstrapping


1
Bootstrapping using different methods to
estimate statistical differences between model
errors
  • Ulrich Damrath
  • COSMO GM Rome 2011

2
Some typical situations occuring during
operational verification
ahhdfkfflflflflflfkfkfkjdjdddnbdnnnd
3
Questions
  • 1.Question Are the differences of scores due to
    noise or are they statistical significant?
  • 2. Question Are there significant differences
    between the quality of different models?
    (Interests user of forecasts)
  • 3. Question Are there significant differences
    between the quality of models for different
    situations? (Interests developers of models)
  • Problem BIASes may be normal distributed, but
    RMSEs?
  • A possible solution Application of bootstrap
    techniques to get confidence intervals or
    quantiles of the distribution
  • 1. Question concerning the bootstrap method How
    many replications are necessary to get stable
    statistical results?
  • 2. Question concerning the bootstrap method How
    should the sample data be grouped in order to
    avoid autocorrelation effect?

4
The principle of bootstrapping for a sample with
10 elements
Realisation 1 mean value using elements 5 3 8
7 8 4 7 0 4 3 Realisation 2 mean value using
elements 3 2 0 5 1 2 0 2 2 8 Realisation 3
mean value using elements 5 2 3 6 8 3 8 0 8 6
Realisation 4 mean value using elements 7 5
1 6 4 0 1 2 1 6 Realisation 5 mean value
using elements 6 5 8 6 1 0 0 2 3 2 Realisation
6 mean value using elements 1 0 5 5 6 5 8 5 5
8 Realisation 7 mean value using elements 3
4 4 4 2 8 5 3 2 6 Realisation 8 mean value
using elements 0 8 2 0 6 4 1 6 6 5 Realisation
9 mean value using elements 0 7 5 6 3 2 2 3 8
8 Realisation 10 mean value using elements 2 2
3 6 6 6 6 2 0 0 The mean value of all
realisations (replications) gives the bootstrap
mean. The standard deviation of all mean values
gives the bootstrap standard deviation as
5
Bootstrap properties for three analytical
cases Number of sample values 31
6
Bootstrap properties for three analytical
cases Number of sample values 310
7
Bootstrap properties for three analytical
cases Number of sample values 3100
8
Bootstrap properties for three analytical
cases Number of sample values 31000
9
Bootstrap properties for three analytical
cases Number of sample values 310000
10
Conclusion concerning the convergence of the
method A number of 500
replications seems to be appropriate
to get a stable value for the bootstrap
variance. Setting the sample characteristics
Treating each pair of observations
and forecasts as a single
sample member leeds to large sample sizes with
relatively high autocorrelation.
Therefore values are grouped by
blocks of one, two and four days. Additionally,
a block size was constructed using the optimal
block length LOPT which can be estimated by
with a as a
function of autocorrelation and N as sample size.
11
The real world Dependence of bootstrap standard
deviation and bootstrap confidence intervals on
the number of replications 2m-temperature
forecasts during Summer 2010 and 10m-wind speed
during Winter 2010/2011. BIASes for different
periods, models and weather elements
12
The real world Dependence of bootstrap standard
deviation and bootstrap confidence intervals on
the number of replications 2m-temperature
forecasts during Summer 2010 and 10m-wind speed
during Winter 2010/2011. RMSEs for different
periods, weather elements and types of mean wind
direction over Germany (700 hPa)
13
Quantiles 10 and 90 for different bootstrap
types, Period 01.06.2010 31.08.2010 COSMO-EU
(solid), COSMO-DE (dotted), Element Temperature
2m Top Median and quantiles (green overlapping
quantiles, red no overlapping quantiles) Bottom
another visualisation of the overlapping
intervals (bluish overlapping intervals, deep
red no overlapping intervals)
14
Quantiles 10 and 90 for different bootstrap
types, Period 01.06.2010 31.08.2010 COSMO-EU
(solid), COSMO-DE (dotted), Element Wind speed
10m Top Median and quantiles (green overlapping
quantiles, red no overlapping quantiles) Bottom
another visualisation of the overlapping
intervals (bluish overlapping intervals, deep
red no overlapping intervals)
15
Comparison of overlapping quantile intervals for
different wind directions NW north westerly
flow, SW south westerly flow, NO north
easterly flow, SO south easterly flow
16
Comparison of overlapping quantile intervals for
different wind directions NW north westerly
flow, SW south westerly flow, NO north
easterly flow, SO south easterly flow
17
Some typical situations occuring during
operational verification in 2009, 2010 and 2011
18
Conclusions
  • Different types of grouping the samples lead to
    different result concerning the statistical
    significance of the model errors.
  • Block methods give more or less equivalent
    results.
  • The results for the comparison of different
    models may users lead to a decision which model
    should be used.
  • The results for different weather types (flow
    directions) may developers give some hints
    concerning the development of model physics.

19
References
  • Efron, B., Tibshirani, R.J.(1993) An
    Introduction to the Bootstrap (Chapman
    Hall/CRC Monographs on Statistics Applied
    Probability) Mudelsee, M. (2010) Climate Time
    Series Analysis Classical Statistical and
    Bootstrap Methods, Springer Dordrecht,
    Heidelberg, London, New York
Write a Comment
User Comments (0)
About PowerShow.com