V' Simulation Management and Output Analysis - PowerPoint PPT Presentation

1 / 21
About This Presentation
Title:

V' Simulation Management and Output Analysis

Description:

... arrivals) is generated by a stream of random numbers from a LCG beginning ... Positive serial correlation can lead to false assessment of steady state (Banks ... – PowerPoint PPT presentation

Number of Views:19
Avg rating:3.0/5.0
Slides: 22
Provided by: mpeter2
Category:

less

Transcript and Presenter's Notes

Title: V' Simulation Management and Output Analysis


1
V. Simulation Management and Output Analysis
  • M. Peter Jurkat
  • CS452/Mgt532 Simulation for Managerial Decisions
  • The Robert O. Anderson Schools of Management
  • University of New Mexico

2
Simulation Management
  • Simulation output depends on
  • Model being simulation
  • Parameter values (see DOE next slide set)
  • Initial conditions, including random number
    generator(s) and seed(s)
  • Terminating conditions
  • Number of replications

3
Stochastic Process Simulation
  • Each stochastic variable (e.g., time between
    arrivals) is generated by a stream of random
    numbers from a LCG beginning with a particular
    seed value gt need multiple runs and statistical
    analysis for stochastic models (i.e., sampling)
  • The same LCG with the same seed, x0, will always
    generate the same sequence of pseudo-random
    numbers gt repeated simulation with the same
    inputs and the same seed will result in identical
    outputs
  • For independent replications need a different
    seed for each replication (F9 in Excel will do
    that)

4
Simulation Sampling
  • Each simulation run may yield only one value (a
    single sample value) of each DV (e.g., average
    waiting time, proportion of time server is idle)
  • Need replications to gain statistical stability
    and significance in output distribution
  • Repeated runs are akin to a sample from a
    population of all such runs with a given set of
    exogenous variables stochastic inputs cause the
    variation need good data for distribution (at
    least mean and variance) of stochastic variables

5
Simulation Replications
  • Two approaches
  • Independent replications/samples different
    seeds/ random number generators for random
    variable experimental variation (EV/DOE) for
    others
  • Good news standard statistics based on
    independent random samples apply
  • Bad news differences may be due to different
    random number
  • Generators may mask lack of differences in system
    avoid with sufficient sample size and
    statistical tests
  • Correlated replications same seeds/random number
    generators for each random variable EV/DOE
  • Good news differences must be due to model
    EV/DOE
  • Bad news need to use special statistical
    procedures for analysis (e.g., stat procedures
    which require independent samples not applicable)
    may not actually need statistics

6
Collecting Output Measures(usual 7 and others)
  • During simulation
  • many values for a given DV per run (e.g., Yi
    in Q)
  • Compare at same time between various runs
  • At end of simulation
  • one value per run (e.g., LQ time average of Yi
    in Q)
  • Time of termination, TE
  • Maximum, minimum of Yi and/or Y(t)
  • Simple average of Yi over one run for discrete
    measures possibly not independent samples due
    to serial correlation can test
  • Time-Averages of Y(t) for continuous measures
  • Quantiles value for which given proportion of
    Yi and/or Y(t) values are lower or higher

7
Terminating Simulations
  • Seek description of one system instance
    termination at TE due to
  • Schedule then analyze other measures
  • System failure time of termination (i.e., time
    of system failure) is often desired output
    measure
  • Multiple replications yield distribution of
    expected time to failure and time to failure
    distribution
  • Other output measures gathered during simulation
    may indicated reasons for failure

8
Non-Terminating
  • Seek steady-state behavior
  • Often difficult to achieve how recognized under
    stochastic input?
  • Stability of measure or its moving
    average/derivative
  • Initialization bias (chaos?) avoid by
  • Use near steady state conditions for initial
    conditions often unknown!!!
  • Analytic solution of simpler system
  • Analyze by batching/lag analysis/deleting batches
    at beginning of run see Banks et al, Table 11.6
  • Usually needs many, long, independent
    replications with different starting parameters
    can lead to chaos for non-linear systems

9
Serial Correlation
  • Positive correlation (serious!) between values
    gathered during simulation arises from taking
    data at intervals short compared to systems
    natural frequency negative correlation (often
    benign) from intervals long compared to system
    natural frequency
  • Positive serial correlation can lead to false
    assessment of steady state (Banks et al, Figure
    11.7)
  • Can determine serial correlation by calculating
    the lag-1 correlation if small, may conclude
    low serial correlation since higher lag
    correlations usually smaller see TIMXCORR.XLS

10
Single System
  • Single system often implies steady-state analysis
  • For independent replications output estimates
    based on averages, variances, and confidence
    intervals
  • Needs sample size determination for given desired
    precision
  • If variance of results is unknown, may need pilot
    study to determine variance and needed sample
    size
  • For correlated replications can use Bonferroni
    methods (weak)

11
Confidence Intervals
  • Single system analysis based on confidence
    intervals for (1-a) confidence, interval for
    true mean is given by
  • sample mean /- increment
  • Increment ta,n-1sample s.d./sqrt(n)
  • (see Section 11.5.3 and equation 11.38)
  • Excel procedure Descriptive Statistics calculates
    increment
  • see MonteCarlo-NewFab.xls

12
Competing Systems Analysis
  • Most simulation studies can be considered
    analyzing competing systems - system with changed
    parameter value vs. base system
  • One basic analytic measure is the confidence
    interval for the difference of the means of the
    two run outcomes (equations 12.1 12.6) if it
    contains zero it is concluded there is no
    difference
  • For C such runs there will be need to be
    C!/2!(C-2)! confidence intervals for the
    difference of the two means see Bonferroni
    approximation to significance (Banks Section
    12.2.1)

13
Other Competing Systems Tests
  • Intervals not only, nor best, analysis
  • For independent replication comparison of
  • Two systems t-test for independent samples see
    TwoPopTests.xls
  • More than two systems ANOVA see ANOVA.XLS
  • Example 12.1 Banks12t2.xls
  • For correlated replications comparison of
  • Two systems t-test for matched samples see
    TwoPopTests.xls
  • More than two confidence intervals using
    Bonferroni methods Weak result since it does
    not specify probability of Type I error exactly
    and lower a increases chance of Type II error

14
Analysis of Results of Simulating More than Two
Models
  • May want to consider improvement to current
    situation due to variation of several factors at
    several levels, e.g.,
  • Number of servers 1, 2, or 3, and
  • Amount of training none, one week, or one month
  • All combinations yields 9 different models
  • T-test of all differences inefficient since
    probability of Type I error for all levels
    increases as the sum of the Type I error for each
    level need large sample sizes to get error
    below acceptable level Bonferroni approach
  • Use Analysis of Variance (ANOVA) or DOE

15
ANOVA Terminology and Notation
  • SS sum of squared deviations from the mean -
    usually just called sum of squares
  • k number of levels indexed by j 1k
  • different values of a parameter
  • yields k simulations whose results to be compared
  • nj number of replications for level j indexed
    by i 1nj equivalent to k samples
  • N total number of trials across all levels and
    replications N n1 n2 nk

16
Example One Factor at Three Levels with Three
Replications
17
Basic Measure of DifferencesSums of Square
Deviations
  • Sum of Squares Total (SST) measures overall
    variation
  • Sum of Squares Regression (SSR) for the level
    means measures the between levels effect - the
    signal - also often called the between (SSB) or
    treatment (SST) SS
  • Sum of Squares Error (SSE) for all levels
    measures the within levels variation - the
    noise - also often called the within (SSW) or
    residual (SSR) SS
  • See ANOVA.XLS for example SSR, SSE, and SST

18
Mean Squares (MS) Variances
  • Sum of Squares divided by degrees of freedom (df
    data used in estimate minus number of
    population parameters estimated)
  • dfR k - 1 dfT N - 1 dfE dfT - dfR
  • MSR SSR/dfR MSE SSE/dfE
  • Since mean squares are variances, ratios of mean
    squares (which are estimates) divided by true
    variances have an F distribution

19
ANOVA Hypothesis Test
  • H0 sR sE vs. HA sR ¹ sE
  • Test statistics under null hypothesis is just
    FdfR,dfE MSR/MSE
  • Calculate P-value
  • If greater than, say, 5 then reject H0 and
    conclude levels made a significant difference
  • Else, conclude there is no significant difference
    between levels and the factor (e.g., number of
    servers) did not have a significant effect
  • ANOVA procedures in Excel make all the above
    calculation see ANOVA.XLS

20
Three or More Factor Designs
  • Use Excel to analyze one (e.g., number of
    servers) and two factor (e.g., number of servers
    and amount of server training) designs
  • To analyze more than two factor multilevel
    designs
  • Use other software such as Minitab, SPSS, SAS,
    etc. using ANOVA
  • Use DOE techniques (allows t-tests for
    independent factors)

21
Design of Analysis Approach
  • None of the prior analysis is considered complete
    complete analysis needs consistency between the
    design of the experiment and its analysis
  • Recommended approach is Design of Analysis (DOE)
    not only consistent but also efficient
  • Focuses on changes in a single DV resulting from
    simultaneous variation of multiple IVs
  • Much studied efficient designs available
  • Appropriate for determination of
  • Important (few?) parameters
  • Model formula relationship between IVs and DV
  • Optimum seeking by response surface method
Write a Comment
User Comments (0)
About PowerShow.com