Title: Continuous Time Markov Chains
1Chapter 8
- Continuous Time Markov Chains
2Markov Availability Model
32-State Markov Availability Model
- 1) Steady-state balance equations for each state
- Rate of flow IN rate of flow OUT
- State1
- State0
- 2 unknowns, 2 equations, but there is only
one independent equation.
42-State Markov Availability Model(Continued)
- 1) Need an additional equation
Downtime in minutes per year
876060
52-State Markov Availability Model(Continued)
- 2) Transient Availability
- for each state
- Rate of buildup rate of flow IN - rate of flow
OUT -
- This equation can be solved to obtain assuming
P1(0)1
62-State Markov Availability Model(Continued)
- 3)
- 4) Steady State Availability
7(No Transcript)
8(No Transcript)
9(No Transcript)
10(No Transcript)
11(No Transcript)
12(No Transcript)
13(No Transcript)
14(No Transcript)
15(No Transcript)
16(No Transcript)
17(No Transcript)
18(No Transcript)
19(No Transcript)
20(No Transcript)
21(No Transcript)
22(No Transcript)
23(No Transcript)
24Markov availability model
- Assume we have a two-component parallel redundant
system with repair rate ?. - Assume that the failure rate of both the
components is ?. - When both the components have failed, the system
is considered to have failed.
25Markov availability model (Continued)
- Let the number of properly functioning components
be the state of the system. The state space is
0,1,2 where 0 is the system down state. - We wish to examine effects of shared vs.
non-shared repair.
26Markov availability model (Continued)
2
1
0
Non-shared (independent) repair
2
1
0
Shared repair
27Markov availability model (Continued)
- Note Non-shared case can be modeled solved
using a RBD or a FTREE but shared case needs the
use of Markov chains.
28Steady-state balance equations
- For any state
- Rate of flow in Rate of flow out
- Consider the shared case
- ?i steady state probability that system is in
state i
29Steady-state balance equations (Continued)
30Steady-state balance equations (Continued)
- Steady-state unavailability ?0 1 - Ashared
- Similarly for non-shared case,
- steady-state unavailability 1 - Anon-shared
- Downtime in minutes per year (1 - A) 876060
31Steady-state balance equations
32Homework 5
- Return to the 2 control and 3 voice channels
example and assume that the control channel
failure rate is ?c, voice channel failure rate is
?v. - Repair rates are ?c and ?v, respectively.
Assuming a single shared repair facility and
control channel having preemptive repair priority
over voice channels, draw the state diagram of a
Markov availability model. Using SHARPE GUI,
solve the Markov chain for steady-state and
instantaneous availability.
33(No Transcript)
34(No Transcript)
35(No Transcript)
36(No Transcript)
37(No Transcript)
38(No Transcript)
39(No Transcript)
40Markov Reliability Model
41Markov reliability model with repair
- Consider the 2-component parallel system but
disallow repair from system down state - Note that state 0 is now an absorbing state. The
state diagram is given in the following figure. - This reliability model with repair cannot be
modeled using a reliability block diagram or a
fault tree. We need to resort to Markov chains. - (This is a form of dependency since in order
to repair a component you need to know the status
of the other component).
42Markov reliability model with repair (Continued)
Absorbing state
- Markov chain has an absorbing state. In the
steady-state, system will be in state 0 with
probability 1. Hence transient analysis is of
interest. States 1 and 2 are transient states.
43Markov reliability model with repair (Continued)
- Assume that the initial state of the Markov chain
- is 2, that is, P2(0) 1, Pk (0) 0 for k 0,
1. - Then the system of differential Equations is
written - based on
- rate of buildup rate of flow in - rate of flow
out - for each state
44Markov reliability model with repair
(Continued)
45Markov reliability model with repair
(Continued)
- After solving these equations, we get
- R(t) P2(t) P1(t)
- Recalling that
, we get
46Markov reliability model with repair
(Continued)
- Note that the MTTF of the two component
parallel redundant system, in the absence - of a repair facility (i.e., ? 0), would
have - been equal to the first term,
- 3 / ( 2? ), in the above expression.
- Therefore, the effect of a repair facility is
to - increase the mean life by ? / (2?2), or by a
- factor
47Markov Reliability Model with Imperfect Coverage
48Markov model with imperfect coverage
- Next consider a modification of the above
- example proposed by Arnold as a model of
- duplex processors of an electronic
- switching system. We assume that not all
- faults are recoverable and that c is the
- coverage factor which denotes the
- conditional probability that the system
- recovers given that a fault has occurred.
- The state diagram is now given by the
- following picture
49Now allow for Imperfect coverage
c
50Markov modelwith imperfect coverage (Continued)
- Assume that the initial state is 2 so that
- Then the system of differential equations are
51Markov model with imperfect coverage (Continued)
- After solving the differential equations we
obtain - R(t)P2(t) P1(t)
- From R(t), we can system MTTF
- It should be clear that the system MTTF and
system reliability are - critically dependent on the coverage factor.
52(No Transcript)
53(No Transcript)
54(No Transcript)
55(No Transcript)
56(No Transcript)
57(No Transcript)
58SOURCES OF COVERAGE DATA
- Measurement Data from an Operational system
Large amount of data needed - Improved Instrumentation Needed
- Fault/Error Injection Experiments
- Costly yet badly needed tools from
- CMU, Illinois, Toulouse
59SOURCES OF COVERAGE DATA (Continued)
- A Fault/Error Handling Submodel
- Phases of FEHM
- Detection, Location, Retry, Reconfig, Reboot
- Estimate Duration Prob. of success of each
phase - IBM(EDFI), HARP(FEHM), Draper(FDIR)
60Homework 6
- Modify the Markov model with imperfect coverage
to allow for finite time to detect as well as
imperfect detection. You will need to add an
extra state, say D. The rate at which detection
occurs is ? . Draw the state diagram and using
SHARPE GUI investigate the effects of detection
delay on system reliability and mean time to
failure.
61(No Transcript)
62(No Transcript)
63(No Transcript)
64(No Transcript)