Introductory Forecasting Concepts

1 / 135
About This Presentation
Title:

Introductory Forecasting Concepts

Description:

Don't confuse information with intuition/guessing/'human pattern recognition' 5 ... Patterns existing in customer specific data that we could exploit for improved ... – PowerPoint PPT presentation

Number of Views:53
Avg rating:3.0/5.0
Slides: 136
Provided by: dana94

less

Transcript and Presenter's Notes

Title: Introductory Forecasting Concepts


1
Introductory Forecasting Concepts
2
Outline
  • Information vs. intuition and the value of data
  • Past vs. future performance
  • Accuracy and its limits
  • Prediction intervals
  • Forecast accuracy and decisions
  • Aggregate forecasting
  • Averaging and smoothing
  • Monitoring forecasting systems
  • Dealing with alerts and outliers

3
Forecasting
  • The art of using information to predict future
    behavior of some time series
  • There is no magic math that allows clairvoyance

4
Forecasting
  • This course focuses on using historical data for
    forecasting
  • This should not diminish the importance of other
    sources of information and common sense
  • Information consists of
  • Historical data on our time series
  • Insight/knowledge and common sense
  • Dont confuse information with intuition/guessing/
    human pattern recognition

5
Information and Intuition
  • Example from a forecasting paper in the electric
    power industry
  • Total kilowatt-hour usage in Georgia,
    Mississippi, and the gulf region appears to
    follow a 12 month seasonal pattern while Alabama
    follows a 13 month season.
  • What do you think?

6
On the other hand..
  • People see Elvis visage in a cloud
  • Humans see patterns in purely random systems
  • Dont confuse information with intuition
  • Lets try a case study. Forecast a real time
    series from scratch using intuition (this is real
    data!)

7
(No Transcript)
8
Well guess same as last month
9
(No Transcript)
10
Well guess same as last month plus a little more
for a possible trend
11
This is easy, who needs forecasting
12
Continue with our successful method guess the
same as last month plus a little more for a
possible trend
13
(No Transcript)
14
Definitely looks like a trend
15
(No Transcript)
16
Trend might be a tad steeper than I thought
17
Opps
18
Momentary deviation, trend will continue
19
See, I told you this was easy!
20
Trend will continue
21
Opps, another momentary fluctuation
22
Trend should continue
23
Oh oh!
24
Sales has leveled off Lets average last few
points
25
Oh oh, maybe things are going down hill
26
Lets be conservative and Assume a negative trend
27
Thank goodness, we are still basically level
28
Well guess same as last month
29
This stuff is easy
30
We have for sure leveled off
31
Big trouble!!! Chief forecaster Smith and CEO
Smothers fired!
32
New chief forecaster points out the obvious trend
33
Remarkable turnaround in sales. New CEO Smithers
given credit
34
Still looks like a trend to me
35
Maybe not!
36
Level except for anomaly
37
Have things turned around?
38
Ill hedge my bets
39
Things have turned around. Perhaps Smithers truly
is a genius
40
Trend up!
41
Not bad!
42
Revise trend a tad
43
Smithers makes cover of Fortune
Smithers
Smothers
44
This is easy!!
45
No big deal, trend continues
(in an unrelated matter Smithers cashes out
stock options)
46
(No Transcript)
47
Heads will surely roll soon
48
Lets be cautiously optimistic
49
Smithers called before board
50
(No Transcript)
51
Perhaps we over reacted
52
We will guess level
53
Back to normal!
54
(No Transcript)
55
Smithers fired!
56
What have we learned?
  • Our Actual sales appears to be a great leading
    indicator of our forecast
  • It is supposed to work the other way around!!!!
  • If we add up the (absolute value of) our forecast
    errors, we get 226.2
  • If we had simply guessed same as last month we
    get 175.1

57
Information vs. Intuition
  • Our intuition (ability to recognize a pattern)
    was poor given almost no information or data.
    Never-the-less we saw patterns.
  • For monthly data we can be tempted to over
    think forecasting.
  • Now some additional information
  • Source of data is monthly sales of Australian Red
    Wine
  • We also have a few years of data

58
(No Transcript)
59
We observe
  • Series behavior includes
  • Clear seasonal behavior
  • A clear upward trend
  • An increase in amplitude as the level increases

60
Value of Data
  • Given data, we can forecast this series quite
    accurately.
  • This assumes stable behavior
  • Recommend at least 4 - 5 seasons of data.
  • Monthly demand thus prefers 4 to 5 years of data
  • With 2 years of data, we are essentially
    forecasting on the basis of two points if there
    is seasonal behavior

61
1st Law of Forecasting Past vs. Future
Performance
  • In forecasting, we assume the future will behave
    like the past
  • If behavior changes, our forecasts can be really
    really terrible

62
Here is an easy series to forecast!
63
No method based on historical behavior could see
this coming
64
Corollary to the 1st Law of Forecasting
  • In the real world, the future often does not
    behave like the past.
  • The behavior of many time series changes fairly
    frequently.

65
2nd Law of ForecastingForecast Accuracy
Limitations
  • Even given that the future behaves like the past,
    there is a limit to how accurate forecasts can be
    (or nothing can be predicted with complete
    accuracy)
  • Typical times series have a signal component
    and a noise component
  • The achievable accuracy depends on the magnitude
    of the noise component

66
Corollary to the 2nd Law of Forecasting
  • All forecasts will be wrong
  • The key issue is How close will the forecast be
    to the actual value?
  • It is crucial to attempt to quantify the expected
    accuracy of a forecast

67
Prediction Intervals
  • We can create prediction intervals of the basic
    form
  • There is a xx probability that the actual
    future value will fall in the range
  • Forecast ? K
  • Example

68
Now
Future forecast
Past Data
69
(No Transcript)
70
(No Transcript)
71
Prediction Intervals
  • Details become complex (and will come later)
  • Prediction intervals based on assumptions
  • The future behaves like the past
  • The model used to calculate the prediction
    interval is an accurate model of the data
  • Prediction intervals tend to be optimistic
    (because of the assumptions)
  • Never-the-less, look how wide they are!

72
3rd Law of Forecasting
  • The further into the future you attempt to
    forecast, the greater will be the forecast error.
  • This is typically true even if the future behaves
    like the past.
  • (as we can see from the prediction intervals)

73
Corollary to 3rd Law of Forecasting
  • Major decisions are often based on long term
    forecasts.
  • e.g. building a new plant
  • Considering risk is even more important in these
    cases
  • Example Fiber optic production capacity

74
Decisions and the 2nd Law of Forecasting
  • Decisions will be based on the forecast
  • (Otherwise there is no need to forecast!)
  • One should consider the ramifications of a
    decision over the range of possible outcomes
  • That is forecasts have inherent error, thus the
    decisions based on forecasts have inherent risk
  • Rational decision making includes analysis of
    risk
  • You would never buy stock solely on the basis of
    expected return

75
(No Transcript)
76
Sales forecast
77
Forecast prediction interval
78
Forecast prediction interval
79
Forecasting and Decisions
  • We forecast as part of a decision making process
  • It is important to keep in mind the decisions
    that are based on the forecast
  • Keeping this link in mind can help guide us
  • When and how often to forecast
  • At what level of aggregation to forecast
  • Etc.

80
Aggregate Forecasting
  • Suppose we forecast demand for product X at the
    individual customer level.
  • Suppose we also need a forecast of total demand
    for product X for production planning
  • Should we add the individual forecasts together
    to get the forecast for overall demand for
    product X?
  • Or should we forecast total demand for product X
    directly using total demand data?

81
Research is Inconclusive
  • Define Disaggregated forecasting
  • Develop individual forecasting methods for each
    customer
  • Add the forecasts together to get forecast of
    total demand
  • Define Direct forecasting
  • Use the total monthly demand time series data to
    develop a forecasting method

82
Argument for disaggregated forecasting
  • By aggregating into total demand, we have, by
    definition, lost information
  • Patterns existing in customer specific data that
    we could exploit for improved forecasting may be
    lost/obscured through aggregation
  • Under idealized conditions, it can be shown that
    disaggregated forecasting will be at least as
    good as direct forecasting
  • Assumptions for this are arguable however

83
Argument for Direct Forecasting
  • Idealized circumstances never true in practice
  • Each individual forecast requires specification
    of models and model parameter estimates both of
    which are subject to error
  • These errors are compounded in disaggregated
    forecasting
  • Direct forecasting does not suffer from this. We
    develop one simple model.

84
Empirical Results Support Both Viewpoints
  • Direct method tends to be better when
  • Individual series are positively correlated (tend
    to move together)
  • e.g. all customers effected by the economy
  • Lower level forecasts are ad hoc and do not
    exploit all the information available
  • e.g. simple automated methods are applied

85
Empirical Results Support Both Viewpoints
  • Disaggregated method tends to be better if
  • Individual series are negatively correlated (tend
    to move oppositely)
  • e.g. sales of Product X Product Y ? constant
  • Information at the lower levels is carefully
    exploited
  • e.g. expertise is applied to forecast individual
    customer demands
  • Warning be wary of demand forecasts from Sales
    Dept

86
Disaggregated vs. Direct
  • One can use composite forecasts (combine
    disaggregated and direct forecasts
  • One can try both and test which works better in
    practice
  • What if the sum of disaggregated forecasts is not
    equal to forecast of the sum?
  • Performance difference tends to be greatest for
    near term forecasts (1 or 2 periods ahead)

87
Forecasting Fundamentals
  • Averaging
  • Smoothing
  • Modeling
  • Most forecasting techniques combine modeling and
    smoothing

88
Averaging
  • Is a basic concept in statistics
  • Is important in understanding simple smoothing
    based forecasts
  • Understand the trade off between averaging and
    data currency

89
Standard deviation of an average
  • The standard deviation of an average is inversely
    proportional to the square root of the number of
    data points in the average
  • The larger the sample, the better the average
    approximates the true mean
  • Ball players may hit 450 in the first two weeks,
    but not by the end of the season

90
(No Transcript)
91
Standard deviation of an average
  • As sample size increases, the accuracy of an
    average increases
  • The increase in accuracy decreases in N
  • i.e. the marginal value of more samples decreases
  • Aside How big should N be? Answer 5
  • (an obvious gross over generalization)

92
Knee of the curve
93
Forecasting with Moving Averages
  • Consider the time series generated by
  • 10 random white noise
  • (the best forecast would be 10 if we knew how
    the series behaved beforehand)
  • We will average the last 3 points (a 3 period
    moving average)
  • Then we average the last 10 points (a 10 point
    moving average)

94
(No Transcript)
95
(No Transcript)
96
(No Transcript)
97
Averaging
  • Averaging can be seen to smooth or filter out
    the noise.
  • Why not use the average of all the data?
  • Because things rarely remain the same

98
(No Transcript)
99
(No Transcript)
100
Averaging
  • Long averages are good since they reduce the
    variance and thus the forecast error
  • Short averages are good because the respond more
    quickly to changes
  • How much data should we include in our average?
  • There is a trade off

101
Moving Averages
  • Moving averages are common forecasting techniques
  • Their best feature is their simplicity everyone
    can understand them
  • Consider the weights attached to past data points
    in a moving average

102
(No Transcript)
103
Moving Averages
  • We want less weight on past data points so the
    forecast can adapt to changes
  • Why then do we put equal weight (0.2) on points 4
    and 5 periods ago?
  • Why do we give a weight of 0.2 to 5 periods ago
    but zero weight to six periods ago?
  • Why not discount weight based on age?

104
(No Transcript)
105
Moving Average
106
Moving Average
Exponential Smoothing
107
Average age of data is the same in this example
108
Average age of data is the same in this example
ES gives decreasingly less weight to past data
109
Exponential Smoothing
  • With equivalent average age, the forecast error
    variance will be the same (under a constant
    model)
  • Since more weight is on recent points, ES should
    adapt to changes more quickly
  • The weighting scheme is more intuitively
    appealing
  • In general, ES is preferred to MA
  • The only disadvantage is its complexity

110
Time Series Modeling
  • The science of extracting a signal from noise
  • Requires a fair amount of data
  • Can be very effective as long as the behavior of
    the time series does not change much.

111
Simple Models
  • Constant Mean Random Noise
  • Trend Random Noise
  • Trend Seasonal Factor Random Noise
  • These are the basis for the models in SAP and
    other common planning systems

112
Monitoring and Alerts
  • Especially useful when automatically forecasting
    many time series
  • Basic idea
  • Compare forecast to actual value
  • If error is too big, issue an alert
  • Alert dealt with via human intervention
  • Automatic adaptation methods also exist
  • These are not without dangers

113
Monitoring
  • We can monitor for
  • Magnitude of forecast errors
  • Forecast bias

114
(No Transcript)
115
Looks like something happened around here
116
Could devise Control Limits
117
Story clearer using AD
118
Clearer still using Smoothed AD
119
Monitoring for Bias
  • Example
  • Can you spot anything in the following plot of
    forecast error Actual Forecast?

120
(No Transcript)
121
  • From time 0 to 25, White noise with a mean of
    zero
  • From time 26 to 50, White noise with a mean of
    0.2
  • After time 26, the forecast is too low by 0.2 on
    average
  • Now consider Error Total
  • Sum of all errors up to now
  • Should stay close to zero

122
(No Transcript)
123
Tracking Signal
  • Standardize Error Total
  • Tracking Signal Error total / Smoothed MAD

124
(No Transcript)
125
Alerts and Outliers
  • It is crucial to investigate alerts and outliers
    and find their cause
  • It may be a fluke or an error that should be
    ignored
  • It may be the most important point in the data
    set indicating entirely new behavior
  • i.e. it contains much information

126
Example
  • Lets look at my monthly power bill and that of
    three colleagues
  • This example is real!

127
Actual
Alert!
Forecast
128
Outliers and Alerts
  • Forecast is way off.
  • What happened?
  • What should be our next forecast?
  • Close to the prior pattern?
  • Close to the most recent observation?

129
Example
  • Lets investigate my power bill outlier
  • Facts of the case
  • From Jan to July 31, we were on sabbatical and
    rented out our house
  • We returned Aug. 1
  • My family still wonders what those switchy
    thingys on the wall are
  • They like fresh air and AC together

130
Outliers
  • After investigation, we know what to do
  • We expect power usage to stay at its new, higher
    level
  • We can adjust our forecast accordingly
  • We might also adjust our business practice in
    important new ways on the basis of what we have
    learned
  • Throwing out outliers simply because they are
    outliers is very bad practice

131
Data with many zeros
  • 100 APCI Demand data sets

132
How do we deal with this?
133
Assume a Demand Distribution
134
Poisson fit to data
  • We could fit a Poisson to the data
  • We could use the Poisson mean as the forecast
  • This assumes each demand outcome iid
  • There exist methods that try to track the
    parameter(s) of the demand distribution over time
  • see parameter driven state space models

135
Thank you
Write a Comment
User Comments (0)