Fault Tolerance in Distributed Systems - PowerPoint PPT Presentation

1 / 26
About This Presentation
Title:

Fault Tolerance in Distributed Systems

Description:

Impossibility of Asynchronous Consensus ... similar structure as the impossibility proof, assume P is such protocol ... – PowerPoint PPT presentation

Number of Views:20
Avg rating:3.0/5.0
Slides: 27
Provided by: Pao3
Category:

less

Transcript and Presenter's Notes

Title: Fault Tolerance in Distributed Systems


1
Fault Tolerance in Distributed Systems
  • Why
  • increasing number of components raises
    probability that some may fail
  • spread of applications with high reliability
    requirements
  • Main Approaches
  • robust algorithms guarantee correct behaviour
    in the presence of failures
  • self-stabilization guarantee recovery from
    effects of failures
  • Failure Models
  • initially dead processes
  • crash failures
  • Byzantine(malign) failures
  • omission failures
  • link failures
  • timing failures

2
Decision Problems
  • Each correct process irrevocably writes a
    decision value to the output variable.
  • Requirements
  • termination all correct processes decide
  • consistency the decision values satisfy some
    consistency conditions
  • non-triviality exclude algorithms not depending
    on the input
  • Examples
  • commit-abort
  • consensus on input
  • election
  • approximate agreement

3
Asynchronous Setting - Preliminaries
  • t-crash fair execution at least n-t processes
    execute infinitely many events, and each message
    sent to a correct process is eventually received.
  • An algorithm A is 1-crash-robust consensus
    algorithm if it satisfies the following three
    properties
  • termination all correct processes decide
  • consistency all correct processes decide on the
    same value
  • non-triviality both 0 and 1 are possible
    decision values
  • Lemma events occurring at disjoint sets of
    processes commute

4
Asynchronous Setting Preliminaries II
  • v-decided configuration
  • decided configuration
  • v-valent configuration
  • bivalent configuration
  • fork configuration (actions of at most t
    processes can force both outcomes)
  • Lemma A crash of at most t processes must be
    survived by the remaining processes For each
    reachable configuration g of t-robust algorithm
    and for each subset S of at least n-t processes,
    there exists a decided configuration d such that
    g d
  • Lemma There exist no reachable fork.

5
Impossibility of Asynchronous Consensus
Theorem There exists no asynchronous,
deterministic, 1-crash-robust consensus
algorithm. Let A be 1-crash-robust consensus
algorithm. Init-Lemma There exists a bivalent
initial configuration for A. - from
non-triviality Cont-Lemma Let g be a reachable
bivalent configuration and s an applicable step
for process p in g. There exists a sequence r of
events such that s is applicable in r(g) and
s(r(g)) is bivalent.
6
Impossibility of Asynchronous Consensus II
  • Proof of the theorem
  • start with the initial bivalent configuration
  • choose s to be the longest pending event and
    apply Cont-Lemma.
  • the resulting computation never decides
  • Let A be 1-crash-robust consensus algorithm.
  • Discussion possible remedies
  • weaker fault model (e.g. initially dead
    processes)
  • weaker coordination requirements (e.g. renaming)
  • randomization
  • weaker termination requirements (Byzantine
    broadcast, termination required only when the
    source is correct)
  • synchrony

7
Initially Dead Processes
  • Observation
  • solvable only if tltn/2
  • if a message is received from a process, that
    process will remain correct
  • Algorithm for process p
  • shout p
  • wait for at least L(n1)/2 shout messages
    and add the received ids to the list Succp
  • shout ltp, Succpgt
  • Alivep Succp , Recvp ?
  • while Alivep ? Recvp
  • receive ltq, Succqgt
  • add q to Recvp and add q and Succq to Succp
  • copute a knot in G

8
Initially Dead Processes II
  • Correctness
  • receiving a name defines an oriented edge
  • strongly connected components of this graph,
    (without outgoing edges) are called knots
  • each knot has at least L nodes
  • since 2Lgtn, there could be at most one knot
  • after termination, each process has received the
    successors for each of its descendants and can
    compute the knot
  • Analysis/Discussion
  • every correct process knows the same knot, the
    leader can easily be selected and decide on its
    input value
  • O(n2) messages of length L
  • more efficient solutions exist (O(n(tlog n))

9
Relaxing Coordination Requirements
  • a problem T to be solved can be described by a
    mapping from possible inputs Xn to sets of
    outputs P(Dn)
  • examples
  • consensus (0,0,0), (1,1,)
  • election (1,0,,0), (0,1,0,),
  • approximate agreemet (d0, d1, , dn-1),
    di-djlt?
  • renaming (d0, d1, , dn-1), i?j ? di ? dj
  • t-crash robust solution for task T
  • termination in every t-crash fair execution,
    all correct processes decide messages of length L
  • consistency if all processes are correct, the
    decision vector is among the allowed ones for the
    given input

10
Probabilistic Consensus
  • Relaxed termination condition
  • lim Prcorrect process has not decided after k
    steps 0
  • Fair schedulingThe algorithms works in rounds.
    In a round k a process shouts a message and waits
    for n-t messages. Let R(q,p,k) is the event that
    in round k process p receives (round k) message
    from q among the first n-t messages. Fair
    scheduling means
  • ??gt0 ? p,q,k PrR(p,q,k) ??
  • ? k, p,q,r R(q,p,k) and R(q,r,k) are
    independent
  • Upper bound on t There is no t-crash-robust
    consensus protocol for t?n/2.
  • similar structure as the impossibility proof,
    assume P is such protocol
  • P has bivalent initial configuration.
  • let S and T be two disjoint sets of processes of
    size t
  • for a reachable configuration g, g is either
    S-0-valent and T-0-valent or S-1-valent and
    T-1-valent (from commutativity of disjoint
    events)
  • there is no reachable bivalent configuration

k??
11
Probabilistic Consensus Crash Faults
  • The algorithm
  • shout (vote, p, round, weight)
  • count how many times you have received each vote
    and note whether you have received any votes with
    weight gt n/2 (witness votes)
  • if a witness vote has been received, that is
    what you will vote in the next round, otherwise
    choose majority (the weight is how many times
    this vote has been received)
  • if more then t witnesses has been received,
    decide on their value, shout your decided value
    for two more rounds and terminate
  • Correctness
  • in any round, no two processes witness for two
    different values
  • if a process decides, then all correct processes
    decide for the same value and at most two rounds
    later
  • lim PrNo decision is taken in round ? k0
  • from fair scheduling there is a positive
    probability that the n-t correct processes
    communicate with each other in three
    consecutive rounds

k??
12
Probabilistic Consensus Byzantine Faults
  • Note a process must be able to distinguish the
    source of the received message, otherwise a
    single malicious node can fool everybody
  • Upper bound on t There is no t-Byzantine robust
    protocol for t ? n/3
  • there exists initial bivalent configuration
    (non-triviality)
  • there exist sets of processes such that S and T
    such that S ? n-t, T ? n-1, S ?T ? t (from
    t ? n/3 )
  • for a reachable configuration g, g is either
    S-0-valent and T-0-valent or S-1-valent and
    T-1-valent
  • by contradiction, if the correct processes of S
    and T alone can reach different decisions, then
    the malicious nodes in the intersection can
    simulate that to both correct parts of S and T
    and force them to decide differently
  • there is no reachable bivalent configuration
  • by contradiction, assume g is bivalent and
    (wlog) S-1-valent and T-1-valent. There is
    0-valent d reachable from g. On the path from g
    to d there are g1 and g0 such that gv is
    v-valent. The node in which event transforming g1
    into g0 occurred can be neither in S, nor in T.

13
Probabilistic Consensus Byzantine Faults
  • Idea of the algorithm
  • rounds with voting
  • a process decides when it receives at least
    (nt)/2 votes for the same value
  • receiving a vote does not mean directly
    receiving a message with that vote (the adversary
    nodes could fool the correct ones by sending
    different votes to different nodes), but is
    replaced by two step protocol
  • shout your vote
  • forward received votes to all neighbours
  • accept a vote if its echo has been received from
    at least (nt)/2 nodes

14
Probabilistic Consensus Byzantine Faults
The algorithm for processor p while (true) do
initialize roundp and arrays msgsp and echosp
to 0 shout(init-vote, p, valp, roundp) /
accept the first n-t votes in this round / while
(msgsp0 msgsp1 lt n-t) do
receive(msgtype, r, v, rn) from q if
((msgtype, r, , rn) has already been received
from q or (msgtype init-vote) and
(qltgtr)) then ignore it, q repeats/lies,
it is byzantine else if (rn gt roundp)
process the message when roundp reaches rn
else if (msgtype init-vote) then
shout(echo-vote, r,v,rn) else if (msg-type
echo-vote) and (roundp rn) then
echospr,v if (echospr,v (nt)/2
1) then msgspv endwhile valuep
majority of msgsp if (msgspvaluep gt
(nt)/2 then decide valuep roundp endwhile

15
Probabilistic Consensus Byzantine Faults
  • Correctness
  • if a correct process p accepts a vote v for
    correct process r in round k then r indeed voted
    for v
  • at least one of the received messages must have
    been forwarded by a correct process
  • if correct processes p and q accept a vote for
    process r in round k, they both accept the same
    vote
  • p and q have received the echo of rs vote
    through at least one common correct process
  • if all processes start round k then all correct
    processes complete round k
  • since n-tgt(nt)/2, a vote from a correct process
    will be accepted, unless n-t votes have already
    been accepted

16
Probabilistic Consensus Byzantine Faults
  • Correctness II
  • if a correct process decides on v in round k,
    then all correct processes choose v in round k
    and the remaining rounds
  • lim PrCorrect p has not decided before round
    k 0
  • with positive probability the correct processes
    accept in round k from the same collection of n-t
    processors and in round k1 only votes for
    processes in S
  • if all correct processes start the algorithm
    with input v, a decision for v is eventually taken

k??
17
FT in Synchronous Systems
  • crashes can be identified and easily dealt with
  • we focus of Byzantine failures
  • bounded-delay discussion
  • Byzantine broadcast problem
  • The n/3 upper bound on t
  • Byzantine broadcast algorithm
  • exponential
  • polynomial
  • Byzantine broadcast with signatures
  • exponential
  • polynomial

18
Byzantine Broadcast
  • one process is the general, containing the
    initial value xg from some set V
  • at most t processes may fail in arbitrary
    (Byzantine) way, including the general
  • the goals of Byzantine broadcast are
  • Termination. Every correct process p will decide
    on a value yp from V
  • Agreement. All correct processes decide on the
    same value.
  • Dependence. If the general is correct, all
    correct processes decide on xg
  • Theorem There is not-Byzantine robust broadcast
    protocol for t ? n/3

19
Byzantine Broadcast with Authentication
  • each process can sign messages ltmsggtp,
    ltmsggtpq
  • a byzantine process cannot forge others
    signatures
  • 2 algorithms
  • KeepSigning Lamport, Shostak, Pease
  • (exponential communication, t1 steps)
  • FirstTwo Dolev, Strong (quadratic
    communication, t1 steps)
  • KeepSigning
  • Step 1 general shouts ltvalue xggtg
  • Step i receive messages ltvaluexgtgpl2p
    l3pli
  • If a message is correct, add your
    signature and shout it.
  • Remember values seen in correct
    messages (in set Wp)
  • Step t1
  • If Wp contains single value v, decide v,
    otherwise decide default (0)

20
Byzantine Broadcast with Authentication
  • KeepSigning Correctness
  • Termination in step t1
  • Dependence If the general is correct with value
    v, only value v will appear in the sets Wp and
    all correct processes will decide on v.
  • Agreement If the general is not correct, all
    correct processes will have the same sets Wp and
    decide the same value.
  • Let p inserted v into Wp after seeing message
    ltvaluevgtgpl2p l3pli in round i.
  • if qs signature is listed there, q already
    inserted v into Wq
  • if iltt1, p will send the message to q and q
    will insert v in round i1
  • if it1, at least one of the signatures is of a
    correct process, which has sent v to q

21
Byzantine Broadcast with Authentication
  • Algorithm FirstTwo
  • it is not necessary to forward all messages,
    just to detect whether there are two different
    values signed by the general
  • shout only the first two different values you
    learn about
  • if, after t1 steps, two values have been seen,
    decide general lies, otherwise decide on the
    single value
  • Message complexity O(n2)
  • Note
  • up to n-1 faults can be tolerated

22
Synchronous Byzantine Broadcast
  • Preliminaries
  • the algorithm works in rounds, shouting either
    value 1 or the processors identities
  • Rpq,v means p has received value v from q,
    initialized to false
  • idea gather enough evidence that the general
    said 1
  • low threshold Lt1, high threshold H2t1
  • Three activities at process p
  • supporting set Sp p decides to supports q
    iff
  • p has received value 1 directly from q (direct
    support)
  • r Rpr,q ? L (indirect support)
  • p supports q by shouting q
  • confirming set Cp q r Rpr,q ? H
  • initiating
  • general with value 1 initiates
  • a process receiving 1 from the general in step 1
    initiates
  • a process in step i initiates if it has
    confirmed at least Th(i)Lmax(0,i/2-1)
    lieutenants
  • initiation means shouting value 1

23
Synchronous Byzantine Broadcast II
The algorithm for process p Initialization
boolean Rp, false boolean inip (p
g) and (xp 1) Step i / sending / if
inip then shout lt1gt shout ltqgt for all q in your
supporting set Sp receive all messages of this
step / processing, update Sp and Cp / if i1
and Rpg,v then inip true if CpL ? Th(i)
then inip true if i2t3 then / deciding
/ if Cp ? H then decide 1 else decide 0
24
Synchronous Byzantine Broadcast III
  • Correctness
  • if correct p supports q then q has sent lt1gt
  • if p supports q indirectly, then some correct
    process supported q
  • therefore there must be first direct support,
    hence q sent lt1gt
  • if a correct process p confirms q then all
    correct processes confirm q at most one step
    later
  • at least L correct processes supported q,
    therefore each correct process received at least
    L supports for q so the next step all correct
    processes support q, after that each correct
    process confirms q
  • simultaneous termination follows directly from
    the algorithm (step 2t3)
  • dependence
  • if the general shouts lt1gt, every correct
    processor will initiate, then all of them will
    support each other, then confirm and at step 2t3
    decide on 1
  • if the general does not have 1, only faulty
    processes can start shouting lt1gt but there are
    not enough of them to break the L threshold
    needed for support

25
Synchronous Byzantine Broadcast IV
  • Correctness (agreement)
  • sufficient to consider the case of Byzantine
    general
  • Lemma If at least L processes (of set A)
    initiate at the end of pulse i for ilt2t then all
    correct processes decide on the value 1
  • at step i2 all correct processes have confirmed
    processes in A
  • we show that all of them initiate as well
  • if i1, A ? LTh(3)
  • if igt1
  • there is a process r that initiated for the
    first time in round i, therefore it has confirmed
    at least Th(i) lieutenants
  • these lieutenants will be confirmed by all
    correct processes by the end of round i1
  • r is not among them, as it shouts lt1gt for the
    first time in round i1
  • but all correct processes confirm r by the end
    of pulse i2
  • therefore at the end of pulse i2 each correct
    process confirms Th(i)1Th(i2) lieutenants and
    initiates
  • by step i4 all correct processes are confirmed
  • since ilt2t, i4lt2t3 and at time 2t3 every
    correct process decides

26
Synchronous Byzantine Broadcast V
  • Lemma If at least L correct processes initiate
    and let i be the first step in which L correct
    processes initiate. Then ilt2t.
  • In order a correct process to initiate, it must
    confirm at least Th(2t)L(t-1) lieutenants, from
    which at most t-1 are faulty (the general is
    faulty). Hence, at least L correct lieutenants
    must have been confirmed, i.e. they have been
    initiated in an earlier step.
  • Note
  • each process sends each of the n1 possible
    messages at most over each link, resulting in
    O(n3) message complexity (O(log n) bit messages)
  • if ngt3t1, a subset of 3t fixed lieutenants is
    chosen and they announce the result to the
    remaining passive processes
  • a passive process accepts value with
    multiplicity at least t1
  • complexity O(t3tn)
Write a Comment
User Comments (0)
About PowerShow.com