ALGORITHM TYPES - PowerPoint PPT Presentation

About This Presentation
Title:

ALGORITHM TYPES

Description:

ALGORITHM TYPES Greedy, Divide and Conquer, Dynamic Programming, Random Algorithms, and Backtracking. Note the general strategy from the examples. – PowerPoint PPT presentation

Number of Views:124
Avg rating:3.0/5.0
Slides: 66
Provided by: Debasi8
Learn more at: https://cs.fit.edu
Category:
Tags: algorithm | types | none | then | there | were

less

Transcript and Presenter's Notes

Title: ALGORITHM TYPES


1
ALGORITHM TYPES
  • Greedy, Divide and Conquer, Dynamic Programming,
    Random Algorithms, and Backtracking.
  • Note the general strategy from the examples.
  • The classification is neither exhaustive (there
    may be more) nor mutually exclusive (one may
    combine).
  • We are now emphasizing design of algorithms, not
    data structures.

2
GREEDY Some Scheduling problems
  • Scheduling Algorithm Optimizing function is
    aggregate finish-time with one processor
  • (job-id, duration pairs)(j1, 15), (j2, 8), (j3,
    3), (j4, 10) aggregate FT in this order is
    15232636100
  • Note durations of tasks are getting added
    multiple times 15 (158) ((158) 3) . .
    .
  • Optimal schedule is the Greedy schedule j3, j2,
    j4, j1. Aggregate FT311213671 Let the
    lower values get added more times shortest job
    first
  • Sort O(n log n), Placing ?(n),
  • Total O(n log n),

3
MULTI-PROCESSOR SCHEDULING (Aggregate FT)
  • Optimizing Fn. Aggregate FT - Strategy Assign
    pre-ordered jobs over processors one by one
  • Input (job-id, duration)(j2, 5), (j1, 3), (j5,
    11), (j3, 6), (j4, 10), (j8, 18), (j6, 14), (j7,
    15), (j9, 20) 3 proc
  • Sort first (j1, 3), (j2, 5), (j3, 6), (j4, 10),
    (j5, 11), (j6, 14), (j7, 15), (j8, 18), (j9,
    20) // O (n log n)
  • Schedule next job on earliest available processor
  • Proc 1 j1, j4, j7
  • Proc 2 j2, j5, j8
  • Proc 3 j3, j6, j9
  • // Sort Theta(n log n), Place Theta(n),
    Total Theta(n log n)
  • Note the first task is to order jobs for a
    greedy pick up

4
MULTI-PROCESSOR SCHEDULING (Last FT)
  • Optimizing Fn. Last FT - Strategy Sort jobs in
    reverse order, assign next job on the earliest
    available processor
  • (j3, 6), (j1, 3), (j2, 5), (j4, 10), (j6, 14),
    (j5, 11), (j8, 18), (j7, 15), (j9, 20) 3 proc
  • Reverse sort-
  • (j9, 20), (j8, 18), (j7, 15), (j6, 14), (j5, 11),
    (j4, 10), (j3, 6), (j2, 5), (j1, 3)
  • Proc 1 j9 - 20, j4 - 30, j1 - 33.
  • Proc 2 j8 - 18, j5 - 29, j3 - 35,
  • Proc 3 j7 - 15, j6 - 29, j2 - 34, Last FT
    35.
  • // sort O(n log n)
  • // place naïve ?(nM), with heap over
    processors O(n log m)
  • Optimal Proc1 j2, j5, j8 Proc 2 j6, j9 Proc
    3 j1, j3, j4, j7. Last FT 34.
  • Greedy alg is not optimal algorithm here, but
    the relative err lt 1/3 - 1/3m, for m
    processors.
  • An NP-complete problem, greedy alg is polynomial
    O(n logn) for n jobs (from sorting, assignment is
    additional O(n), choice of next proc. In each
    cycle O(log m) using heap, total O(n logn n
    logm), for ngtgtm the first term dominates).

5
HUFFMAN CODES
  • Problem device a (binary) coding of alphabets
    for a text, given their frequency in the text,
    such that the total number of bits in the
    translation is minimum.
  • Encoding alphabets on leaves of a binary tree
    (each edge indicating 0 or 1).
  • Result of Weiss book Fig 10.11, page 392, for 7
    alphabets (a, 001, freq10, total3x1030
    bits), (e, 01, 15, 2x1530 bits), (i, 10, 12, 24
    bits), (s, 00000, 3, 15 bits), (t, 0001, 4, 16
    bits), (space, 11, 13, 26 bits), (newline, 00001,
    1, 5 bits), Total bits146 is the minimum
    possible.

6
HUFFMAN CODES Algorithm
  • At every iteration, form a binary tree using the
    two smallest (lowest aggregate frequency)
    available trees in a forest, the trees frequency
    is the aggregate of its leaves frequency. Start
    from a forest of all nodes (alphabets) with their
    frequencies being their weights. When the final
    single binary-tree is formed use that tree for
    encoding.
  • Example Weiss page 390-395.

7
HUFFMAN CODES Complexity
First, initialize a min-heap for n nodes
O(n) Pick 2 best (minimum) trees 2(log n) Do
that for an order of n times O(n log n) Insert
1 or 2 trees in the heap O(log n) Do that for an
order of n times O(n log n) Total O(n log n)
8
RATIONAL KNAPSACK(not in the text)
  • Given a set of objects with (Weight, Profit)
    pair, and a Knapsack of limited weight capacity
    (M), find a subset of objects for the knapsack to
    maximize profit, partial objects (broken) are
    allowed.
  • Greedy Algorithm Put objects in the KS in a
    non-increasing (high to low) order of profit
    density (profit/weight). Break the last object
    which does not fit in the KS otherwise.
  • Example (4, 12), (5, 20), (10, 10), (12, 6)
    M14.
  • Solution KS(5, 20), (4, 12), (5, 5),
    Weight14, Profit37.
  • Optimal, polynomial algorithm O(N log N) for N
    objects - from sorting.
  • 0-1 KS problem cannot break any object
    NP-complete, Greedy Algorithm is no longer
    optimal.

9
APPROXIMATE BIN PACKING
  • Problem fill in objects each of sizelt 1, in
    minimum number of bins (optimal) each of size1
    (NP-complete).
  • Example 0.2, 0.5, 0.4, 0.7, 0.1, 0.3, 0.8.
  • Solution B1 0.20.8, B2 0.30.7, B3
    0.10.40.5. All bins are full, so must be
    optimal solution (note optimal solution need not
    have all bins full).
  • Online problem do not have access to the full
    set incremental
  • Offline problem can order the set before
    starting.

10
ONLINE BIN PACKING
  • Theorem 1 No online algorithm can do better than
    4/3 of the optimal bins, for any given input
    set.
  • Proof. (by contradiction we will use a
    particular input set, on which our online
    algorithm A presumably violates the Theorem)
  • Consider input of M items of size 1/2 - k,
    followed by M items of size 1/2 k, for 0ltklt0.01
  • Optimum bin should be M for them.
  • Suppose alg A can do better than 4/3, and it
    packs first M items in b bins, which optimally
    needs M/2 bins. So, by assumption of violation of
    Thm, b/(M/2)lt4/3, or b/Mlt2/3 fact 0

11
ONLINE BIN PACKING
  • Each bin has either 1 or 2 items
  • Say, the first b bins containing x items,
  • So, x is at most or ?2b items
  • So, left out items are at least or ?(2M-x) in
    number fact 1
  • When A finishes with all 2M items, all 2-item
    bins are within the first b bins,
  • So, all of the bins after first b bins are
    1-item bins fact 2
  • fact 1 plus fact 2 after first b bins A uses
    at least or ?(2M-x) number of bins
  • or BA ? (b (2M - 2b)) 2M - b.

12
ONLINE BIN PACKING
- So, the total number of bins used by A (say,
BA) is at least or, BA ? (b (2M - 2b)) 2M
- b. - Optimal needed are M bins. - So,
(2M-b)/M lt (BA /M) ? 4/3 (by assumption), or,
b/Mgt2/3 fact 4 - CONTRADICTION between fact 0
and fact 4 gt A can never do better than 4/3 for
this input.
13
NEXT-FIT ONLINE BIN-PACKING
  • If the current item fits in the current bin put
    it there, otherwise move on to the next bin.
    Linear time with respect to items - O(n), for n
    items.
  • Example Weiss Fig 10.21, page 364.
  • Thm 2 Suppose, M optimum number of bins are
    needed for an input. Next-fit never needs more
    than 2M bins.
  • Proof Content(Bj) Content(Bj1) gt1, So,
    Wastage(Bj) Wastage(Bj1)lt2-1, Average
    wastagelt0.5, less than half space is wasted, so,
    should not need more than 2M bins.

14
FIRST-FIT ONLINE BIN-PACKING
  • Scan the existing bins, starting from the first
    bin, to find the place for the next item, if none
    exists create a new bin. O(N2) naïve, O(NlogN)
    possible, for N items.
  • Obviously cannot need more than 2M bins! Wastes
    less than Next-fit.
  • Thm 3 Never needs more than Ceiling(1.7M).
  • Proof too complicated.
  • For random (Gaussian) input sequence, it takes 2
    more than optimal, observed empirically. Great!

15
BEST-FIT ONLINE BIN-PACKING
  • Scan to find the tightest spot for each item
    (reduce wastage even further than the previous
    algorithms), if none exists create a new bin.
  • Does not improve over First-Fit in worst case in
    optimality, but does not take more worst-case
    time either! Easy to code.

16
OFFLINE BIN-PACKING
  • Create a non-increasing order (larger to smaller)
    of items first and then apply some of the same
    algorithms as before.
  • GOODNESS of FIRST-FIT NON-INCREASING ALGORITHM
  • Lemma 1 If M is optimal of bins, then all items
    put by the First-fit in the extra (M1-th bin
    onwards) bins would be of size ? 1/3 (in other
    words, all items of sizegt1/3, and possibly some
    items of size ? 1/3 go into the first M bins).
  • Proof of Lemma 1. (by contradiction)
  • Suppose the lemma is not true and the first
    object that is being put in the M1-th bin as the
    Algorithm is running, is say, si, is of sizegt1/3.
  • Note, none of the first M bins can have more than
    2 objects (size of eachgt1/3). So, they have only
    one or two objects per bin.

17
OFFLINE BIN-PACKING
  • Proof of Lemma 1 continued.
  • We will prove that the first j bins (0 ? j?M)
    should have exactly 1 item each, and next M-j
    bins have 2 items each (i.e., 1 and 2 item-bins
    do not mix in the sequence of bins) at the time
    si is being introduced.
  • Suppose contrary to this there is a mix up of
    sizes and bin B_x has two items and B_y has 1
    item, for 1?xlty?M.
  • The two items from bottom in B_x, say, x1 and x2
    it must be x1 ? y1, where y1 is the only item in
    B_y
  • At the time of entering si, we must have x1, x2,
    y1 ? si, because si is picked up after all the
    three.
  • So, x1 x2 ? y1 si. Hence, if x1 and x2 can go
    in one bin, then y1 and si also can go in one
    bin. Thus, first-fit would put si in By, and not
    in the M1-th bin. This negates our assumption
    that single occupied bins could mix with doubly
    occupied bins in the sequence of bins (over the
    first M bins) at the moment M1-th bin is created.

18
OFFLINE BIN-PACKING
  • Proof of Lemma 1 continued
  • Now, in an optimal fit that needs exactly M bins
    si cannot go into first j-bins (1 item-bins),
    because if it were feasible there is no reason
    why First-fit would not do that (such a bin would
    be a 2-item bin within the 1-item bin set).
  • Similarly, if si could go into one of the next
    (M-j) bins (irrespective of any algorithm), that
    would mean redistributing 2(M-j)1 items in (M-j)
    bins. Then one of those bins would have 3 items
    in it, where each itemgt1/3 (because sigt1/3).
  • So, si cannot fit in any of those M bins by any
    algorithm, if it is gt1/3. Also note that if si
    does not go into those first j bins none of
    objects in the subsequent (M-j) bins would go
    either, i.e., you cannot redistribute all the
    objects up to si in the first M bins, or you need
    more than M bins optimally. This contradicts the
    assumption that the optimal bin is M.
  • Restating either si ? 1/3, or if si goes into
    (M1)-th bin then optimal number of bins could
    not be M.
  • In other words, all items of size gt1/3 goes into
    M or lesser number of bins, when M is the optimal
    of bins for the given set.
  • End of Proof of Lemma 1.

19
OFFLINE BIN-PACKING
  • Lemma 2 The of objects left out after M bins
    are filled (i.e., the ones that go into the extra
    bins, M1-th bin onwards) are at most M. This is
    a static picture after First Fit finished
    working
  • Proof of Lemma 2.
  • On the contrary, suppose there are M or more
    objects left.
  • Note that each of them are lt1/3 because they
    were picked up after si from the Lemma 1 proof.
  • Note, ?j1N sj ? M, since M is optimum bins,
    where N is items.
  • Say, each bin Bj of the first M bins has items of
    total weight Wj in each bin, and xk represent the
    items in the extra bins ((M1)-th bin onwards)
    x1, , xM,

20
OFFLINE BIN-PACKING
  • ? i1N si ? ? j1M Wj ? k1M xk (the first term
    sums over bins, the second term over items)
  • ? j1M (Wj xj)
  • But ? i1N si ? M,
  • or, ? j1M (Wj xj) ? ? i1N si ? M.
  • So, Wjxj ? 1
  • But, Wjxj gt 1, otherwise xj (or one of the xis)
    would go into the bin containing Wj, by First-fit
    algorithm.
  • Therefore, we have ? i1N si gt M. A
    contradiction.
  • End of Proof of Lemma 2.

21
OFFLINE BIN-PACKING
  • Theorem If M is optimum bins, then
    First-fit-offline will not take more than M
    (1/3)M bins.
  • Proof of Theorem10.4.
  • items in extra bins is ? M. They are of size ?
    1/3. So, 3 or more items per those extra bins.
  • Hence extra bins itself ? (1/3)M.
  • of non-extra (initial) bins M.
  • Total bins ? M (1/3)M
  • End of proof.

22
DIVIDE and CONQUER STRATEGY
  • Example MergeSort, QuickSort, Binary-search.
  • Recursive calls on divided parts of the input,
    combine the results from those calls, and then
    return.
  • A special case of recurrence equation becomes
    important in analysis (Master Theorem)
  • T(N) aT(N/b) ?(Nk), where a?1 and bgt1.
  • Solution T(N)
  • case 1 for agtbk, ?(N logba)
  • case 2 for a bk, ?(Nk logN)
  • case 3 for altbk, ?(Nk)
  • Proof Easy, READ IT, Weiss p 371-373

23
Example Binary Search Algorithm
  • BinSearch (array a, int start, int end, key) //
    T(N)
  • if startend // ?(1)
  • if astart key then return start else
    return failure
  • else // start ? end
  • center (startend)/2
  • if acenter lt key
  • BinSearch (a, start, center, key)
  • else BinSearch (a, center1, end, key) //
    T(n/2)
  • end if
  • End BinSearch. // T(N) 1 T(N/2)

24
DIVIDE and CONQUER STRATEGY
  • INTEGER MULTIPLICATION
  • Divide the bits/digits into two sets,
  • X XL10(size/2) XR, and
  • Y YL10(size/2) YR
  • XY XLYL10(size) (XLYR XRYL)10
    (size/2) XRYR,
  • four recursive calls to problems of sizesize/2,
    and
  • three additions of sizesize/2.
  • T(N) 4T(N/2) O(N),
  • or a4, b2, and k1,
  • or case of agtbk.
  • Solution, T(N) O(N logba) O(N2).

25
DIVIDE and CONQUER STRATEGY INTEGER MULT
  • Change the algorithm for three recursive calls!
  • XLYR XRYL (XL - XR)(YR - YL) XLYL
    XRYR
  • Now, T(N) 3T(N/2) O(N).
  • More additions, but the order (last term) does
    not change
  • Same case, but solution is,
  • T(N) O(Nlog23) O(N1.59).

26
DIVIDE and CONQUER STRATEGY
  • MATRIX MULTIPLICATION
  • Naïve multiplication O(N3)
  • Divide two square matrices into 4 parts each of
    size N/2, and
  • recursively multiply (N/2 x N/2) matrices,
  • and add resulting (N/2 x N/2) matrices,
  • then put them back to their respective places.
  • Eight recursive calls, O(N2) overhead for
    additions.
  • T(N) 8T(N/2) O(N2).
  • Solution case a gtbk T(N) O(N logba) O(N3)
  • Strassens algorithm
  • rewrite multiplication formula reducing recursive
    calls to 7 from 8.
  • T(N) 7T(n/2) O(N2)
  • Solution T(N) O(Nlog27) O(N2.81)

27
DYNAMIC PROGRAMMING STRATEGY
  • In case the divide and conquer strategy can
    divide the problem at a very small level, and
    there are repetition of some calculation over
    some components, then one can apply a bottom up
    approach calculate the smaller components first
    and then keep combining them until the highest
    level of the problem is solved.
  • Draw the recursion tree of Fibonacci-series
    calculation, you will see example of such
    repetitive calculations
  • f(n) f(n-1) f(n-2), for ngt1 and f(n)1
    otherwise
  • fib(n) calculation
  • n 1, 2, 3, 4, 5
  • fib 1, 2, 3, 5, 8 p 385

28
DYNAMIC PROGRAMMING STRATEGY (Continued)
  • Recursive fib(n)
  • if (nlt1) return 1
  • else return (fib(n-1) fib(n-2)).
  • Time complexity exponential, O(kn) for some
    kgt1.0
  • Iterative fibonacci(n)
  • fib(0) fib(1) 1
  • for i2 through n do
  • fib(i) fib(i-1) fib(i-2)
  • end for
  • return fib(n).
  • Time complexity O(n), Space complexity O(n)

29
DYNAMIC PROGRAMMING STRATEGY (Continued)
  • SpaceSaving-fibonacci(n)
  • if (nlt1) return 1
  • int last1, last2last1, result1
  • for i2 through n do
  • result last last2last
  • last2last last
  • lastresult
  • end for
  • return result.
  • Time complexity O(n), Space complexity O(1)

30
DYNAMIC PROGRAMMING STRATEGY (Continued)
  • The recursive call recalculates fib(1) 5 times,
    fib(2) 3 times, fib(3) 2 times - in fib(5)
    calculation. The complexity is exponential.
  • In iterative calculation we avoid repetition by
    storing the needed values in variables -
    complexity of order n.
  • Dynamic Programming approach consumes more memory
    to store the result of calculations of lower
    levels for the purpose of calculating the next
    higher level. They are typically stored in a
    table.

31
0-1 Knapsack Problem (not in the book)
  • Same as the Rational Knapsack Problem, except
    that you MAY NOT break any object. The Greedy
    algorithm no longer gets correct answer.
  • DP uses a table for optimal profit P(j, k)
  • for the first j objects (in any arbitrary
    ordering of objects) with a variable knapsack
    limit k.
  • Develop the table with rows j1 through N
    (objects), and
  • for each row, go from k0 through M (the KS
    limit).
  • Finally P(N, M) gets the result.

32
0-1 Knapsack Problem (not in the book)
  • The formula for
  • P(j, k) (case 1) P(j-1, k),
  • if wjgtk, wj weight of the j-th object
  • else
  • P(j, k) (case2) max-betweenP(j-1, k-wj)pj,
  • P(j-1, k)
  • Explanation for the formula is quite intuitive.

33
0-1 knapsack recursive algorithm
  • Algorithm P(j, k)
  • If j 0 or k 0 then return 0 //recursion
    termination
  • else
  • if Wjgtk then
  • return P(j-1, k)
  • else
  • return maxP(j-1, k-Wj) pj, P(j-1, k)
  • End algorithm.
  • Driver call P(n, M), for given n objects KS
    limit M.
  • Complexity ?

34
0-1 Knapsack DP-algorithm
  • For all j, k, P(0, k) P(j, 0) 0 //initialize
  • For j1 to n do
  • For k 1 to m do
  • if Wjgtk then
  • P(j, k) P(j-1, k)
  • else
  • P(j, k) maxP(j-1, k-Wj) pj, P(j-1, k)
  • Complexity O(NM), pseudo-polynomial because M is
    an input value. If M30.5 the table would be of
    size O(10NM), or if M30.54 it would be O(100NM).

35
0-1 Knapsack Problem (Example)
  • Objects (wt, p) (2, 1), (3, 2), (5, 3). M8
  • J 0 1 2 3 4 5 6 7 8
  • O -- -- -- -- -- -- -- -- --
  • 1 0 0 1 1 1 1 1 1 1
  • 2 0 0 1 2 2 3 3 3 3
  • 3 0 0 1 2 2 3 3 4 5
  • HOW TO FIND KS CONTENT FROM TABLE? SPACE
    COMPLEXITY?

36
Ordering of Matrix-chain Multiplications
  • ABCD a chain of matrices to be multiplied, where
    A (5x1), B(1x4), C(4x3), and D(3x6). Resulting
    matrix would be of size (5x6).
  • scalar or integer multiplications for (BC), is
    1.4.3, and the resulting matrixs dimension is
    (1x3), 1 column and 3 rows.

37
Ordering of Matrix-chain Multiplications
  • There are multiple ways to order the
    multiplication (A(BC))D, A(B(CD)), (AB)(CD),
    ((AB)C)D, and A((BC)D). Resulting matrix would
    be the same but the efficiency of calculation
    would vary drastically.
  • Efficiency depends on scalar multiplications. In
    the case of (A(BC))D, it is 1.4.3 5.1.3
    5.3.6 117. In the case (AB)CD), it is 4.3.6
    1.4.6 5.1.6 126.
  • Our problem here is to find the best such
    ordering. An exhaustive search is too expensive -
    Catalan number involving n!

38
Ordering of Matrix-chain Multiplications continued
  • For a sequence A1...(Aleft....Aright)...An, we
    want to find optimal break point for the
    parenthesized sequence.
  • Calculate for ( right-left1) number of cases and
    find minimum min(Aleft...Ai) (Ai1... Aright),
    with left ? i lt right,
  • Or,
  • M(left, right)
  • minM(left, i) M(i1, right) rowleft
    .coli .colright,
  • with left ? i lt right.
  • Start the calculation at the lowest level with
    two matrices, AB, BC, CD etc. Then calculate for
    triplets, ABC, BCD etc. And so on..

39
Matrix-chain Recursive algorithm
  • Recursive Algorithm M(left, right)
  • if left gt right return 0
  • else
  • return minM(left, i) M(i1, right)
    rowleft .coli .colright,
  • for left?iltright
  • end algorithm.
  • Driver call M(1, n).

40
Matrix-chain DP-algorithm
  • for all 1? jlti ? n do Mij 0 //lower
    triangle 0
  • for size 1 to n do // size of subsequence
  • for l 1 to n-size1 do
  • r leftsize-1 //move along diagonal
  • Mlr infinity //minimizer
  • for i l to r do
  • x M(l, i) M(i1, r)
  • rowleft .coli .colright
  • if x lt min then M(l, r) x
  • // Complexity?

41
Ordering of Matrix-chain Multiplications (Example)
  • A1 (5x3), A2 (3x1), A3 (1x4), A4 (4x6).
  • c(1,1) c(2, 2) c(3, 3) c(4, 4) 0
  • c(1, 2) c(1,1) c(2,2) 5.3.1 0 0 15.
  • c(1,3) minI1 c(1,1)c(2,3)5.3.4, I2
    c(1,2)c(3,3)5.1.4
  • min72, 35 35(2)
  • c(1,4) min I1 c(1,1)c(2,4)5.3.6, I2
    c(1,2)c(3,4)5.1.6,
  • I3 c(1,3)c(4,4)5.4.6 min132, 69, 155
    69(2)
  • 69 comes from the break-point i2
    (A1.A2)(A3.A4)
  • You may need to recursively break the sub-parts
    too, by looking at which value of i gave the min
    value at that stage, e.g., for (1,3) it was i2
    (A1.A2)A3

42
Ordering of Matrix-chain Multiplications (Example)
  • j 1 2 3 4
  • i
  • 1 0 15 35(2) 69(2)
  • 2 0 12 42
  • 3 0 24
  • 4 0
  • Calculation goes diagonally.

Triplets
Pairs
43
Computing the actual break points
j 1 2 3 4 5 6 i 1 - - (2) (2) (3) (3) 2
- - (3) (3) (4) 3 - - (4) (4) 4 - (3) (4
) 5 - - - 6 - - - ABCDEF -gt (ABC)(DEF) -gt
((AB)C) (D(EF))
44
Ordering of Matrix-chain Multiplications (DP
Algorithm)
  • Algorithm optMult returns cost matrix m and
    tracking matrix lastChange.
  • (1) for (left1 through n) do costleftleft
    0
  • (2) for k2 through n do // k is the size of the
    sub-chain
  • for left1 through (n-k1) do
  • rightleftk-1
  • (5) costleftright infinity
  • //break is between I and I1
  • for Ileft through (right-1) do
  • (7) cstcostleftIcostI1right
  • rowleftcolIcolright
  • (8) if (cstltcostleftright) then
  • (9) costleftright cst
  • (10) lastChangeleftrightI

45
Optimal Binary Search Tree
  • REMIND BINARY-SEARCH TREE for efficiently
    accessing an element in an ordered list
  • DRAW MULTIPLE BIN-SEARCH TREES FOR SAME SET,
    EXPLAIN ACCESS STEP FOR NODES.
  • Note the access step for each node is its
    distance from the root plus one. So, it is
    relative to the sub-tree in which the node is in.
  • Assume some measure of frequency of access is
    associated with data at each node. Then different
    tree organization will have different aggregate
    access steps over all nodes, because the depths
    of the nodes are different on different tree.
  • EXPLAIN DIFFERENT AGGREGATE COST FOR DIFFERENT
    TREE.
  • Problem We want to find the optimal aggregate
    access steps, and a bin-search tree organization
    that produces it.

46
Optimal Binary Search Tree (Continued)
  • Say, the optimal cost for a sub-tree is C(left,
    right).
  • Note, when this sub-tree is at depth one
  • (under a parent,
  • within another higher level sub-tree)
  • then EACH nodes access step will increase by 1
    in the higher level subtree.

47
a1 a2 aleft ak aright an
fk
C(k1, right)
C(left, k-1)
48
Optimal Binary Search Tree (Continued)
  • If the i-th node is at the root for this
    sub-tree, then
  • C(left, right)
  • minleft ? i ? right f(i) C(left, i-1)
    C(i1, right)
  • åjlti-1 f(j) åji1rt f(j) additional
    costs
  • minleft ? i ? right C(left, i-1) C(i1,
    right)
  • åjltrt f(j)
  • We will use the above formula to compute C(1, n).
  • We will start from one element sub-tree and
    finish with n element full tree.

49
Optimal Binary Search Tree (Continued)
  • Like matrix-chain multiplication-ordering we will
    develop a triangular part of the cost matrix
    (rtgt lt), and
  • we will develop it diagonally (rt lt size)
  • Note that our boundary condition is different
    now,
  • c(left, right)0 if leftgtright (meaningless
    cost)
  • We start from, leftright single node trees
  • (not with leftright-1 pairs of matrices, as
    in matrix-chain-mult case).
  • Also, i now goes from left through right, and
  • i is excluded from both the subtrees Cs.

50
Optimal Binary Search Tree (Example)
  • Keys A1(7), A2(10), A3(5), A4(8), and A5(4).
  • C(1, 1)7, C(2, 2)10, .
  • C(1,2)mink1 C(1,0)C(2,2)f1f201017,
  • k2 C(1,1) C(3,2) f1f2 7 0 17
    min27, 24 24 (i2)
  • j 1 2 3 4 5
  • i
  • 1 7 24(2) 34 55 67
  • 2 0 10 20 41 51
  • 3 0 5 18 26
  • 4 0 8 16
  • 5 0 4
  • This algorithm O(n3). However, O(n2) is
    feasible.

51
All Pairs Shortest Path
  • A variation of Djikstras algorithm. Called
    Floyd-Warshals algorithm. Good for dense graph.
  • Algorithm Floyd
  • Copy the distance matrix in d1..n1..n
  • for k1 through n do //considr each vertex as
    updating candidate
  • for i1 through n do
  • for j1 through n do
  • if (dik dkj lt dij) then
  • dij dik dkj
  • pathij k // last updated via k
  • End algorithm.
  • O(n3), for 3 loops.

52
Randomized Algorithms
  • QuickSort with randomly picked up pivot is an
    example.
  • The internal working of the algorithm is never
    same even for the same input. Randomized
    algorithms may have different output for same
    input.
  • Sensitive to the embedded random number
    generating algorithms. Computers work with
    pseudo-random number generators, dependent on the
    key, fed at the initiation of a generator.
    Typically clock digits (least significant ones)
    are used as a key.
  • There are powerful randomized algorithms that can
    non-deterministically solve some problems more
    efficiently. Examples Genetic Algorithms, or
    GSAT. Often these algorithms do not guarantee any
    solution, or are incomplete algorithms!

53
Random Number Generators
  • Random numbers are generated in a sequence, by a
    called routine.
  • The sequence is supposed to be aperiodic - never
    repeats.
  • A typical method x(i1) Ax(i) mod M, where M
    is a prime number and A is an integer.
  • x(i) should never become 0 (WHY?), hence mod is
    taken with a prime number M. With the proper
    choice of A, the sequence will repeat itself
    after M-1 elements in the sequence (bad choice of
    A causes a shorter period) hence, they are
    called pseudo-random numbers.
  • Example, M11, A7, and x(0)1 (the seed), the
    sequence is 7,5,2,3,10,4,6,9,8,1, 7,5,2,....
  • A large prime number should be used to avoid
    repetition.
  • The seed should not be zero. It could be the
    least significant digits of the clock.

54
Genetic Algorithms
Function Genetic-Algorithm (population,
fitness-fn) returns an individual
Russell-Norvig text of AI, 2003, p
116-119 Input (1) population a set of
individuals, (2) fitness-fn a fn that measures
the goodness of individuals toward the
solution Repeat parents lt- SELECTION(population,
fitness-fn) population lt- REPRODUCTION(parents) U
ntil some individual is fit enough Return the
best individual in population, according to
fitness-fn.
55
Genetic Algorithms
56
GSAT Algorithm
  • Boolean satisfiability (SAT) problem Given a set
    of Boolean variables Vv1, v2, v3,, and a CNF
    propositional formula C (C1 C2 ), each
    clause Ci being constituted as a disjunction of
    literals (each of which is a variable or the
    negation of a variable), check if there exists
    an assignment of the variables such that the
    formula C is True.
  • GSAT Start with a random assignment for
    variables
  • repeat until C is True or a fixed number of steps
    are done
  • flip the value of a variable in a clause
    that is False
  • May restart a few number of times. End
    algorithm.
  • Many variations of GSAT exist. Does not
    guarantee solution.

57
Backtracking Algorithms
  • Input an empty array of size n (or 4) and a
    starting level index
  • Output Prints A
  • Algorithm Unknown(level, array A)
  • if (level 4) then print A
  • else
  • Alevel 0
  • Unknown(level1, A)
  • Alevel 1
  • Unknown(level1, A)
  • End algorithm.
  • Driver Unknown(1, A1..3)
  • What does the algorithm do?

58
Backtracking Algorithms
  • Input an empty array of size n (or 4) and a
    starting level index (1)
  • Output Prints A
  • Algorithm Unknown(level, array A)
  • if (level 4) then print A
  • else
  • Alevel 0
  • Unknown(level1, A)
  • Alevel 1
  • Unknown(level1, A)
  • End algorithm.
  • The algorithm prints (000),(001),(010),(011),...,
    when called as Unknown(1, A).
  • Draw recursion tree this is an exhaustive
    Backtracking (BT) algorithm.

59
BT 0-1 Knapsack Problem
  • Input Set of n objects (wi, pi), and
    knapsack limitM
  • Output Total profits for all possible subsets
  • Algorithm BTKS(level, array A)
  • if (level n1) then
  • total_profit ?i(AiPi)
  • else
  • Alevel 0 // level-th object excluded from
    the bag
  • BTKS(level1, A)
  • Alevel 1 // the object is included
  • BTKS(level1, A)
  • End algorithm.
  • Driver Global optP0 Global optAnull BTKS(1,
    A)

60
BT 0-1 Knapsack Problem
  • Input Set of n objects (wi, pi), and
    knapsack limitM
  • Output Optimum profit over all possible subsets
    weighing ltM
  • Algorithm BTKS(level, array A)
  • if (level n1) then
  • total_profit ?i(AiPi)
  • if total_profit is gt optP then // optimizing
  • optP total_profit
  • optA A
  • else
  • Alevel 0 // level-th object excluded from
    the bag
  • BTKS(level1, A)
  • Alevel 1 // the object is included
  • if (total_wt ? M) then // constraint checking
  • BTKS(level1, A)
  • End algorithm.
  • Driver Global var optP0 Global var optAnull
    BTKS(1, A)

61
BT with further pruning 0-1 Knapsack Problem
Algorithm BTKS_Prun(level, array A) // global
optP and global optX if (level n1)
then ltltsame as beforegtgt else B?i1level-1
(AiPi) RKS(A, level-1, M- ? i1level-1
(AiWi)) if (BgtoptP) then // we have a
chance to make a better profit Alevel 1 if
(total_wt ? M) then BTKS_Prun(level1, A) if
(BgtoptP) then //optP may have increased
now Alevel 0 BTKS_Prun(level1, A) End
algorithm. RKS is the Rational knapsack greedy
algorithm, used for a bounding function here.
62
Bounding function for pruning
  • BF is a guessing function so that we can prune
    branches from unnecessarily expanding
  • Ideal value is the real, at every level
    (impossible to have such a BF)
  • Should evaluate to ? real value for
    maximization-problems, or ? real value for
    minimization-problems (correctness criterion)
  • Should be as close to the real value as possible
    (goodness, for pruning branches of the recursion
    tree)
  • Should be easy to calculate (calculation
    overhead on each step)

63
Turnpike Reconstruction Problem
  • Given N exits (on a turnpike), and MN(N-1)/2
    number of distances between each pair of them,
    put the exits on a line with the first one being
    at zero.
  • Input D1,2,2,2,3,3,3,4,5,5,5,6,7,8,10, the
    sorted gap values
  • Here, MD15, so, calculated N6, obviously.
    DRAW TREE, UPDATE D, page 407.
  • x10 given, so, x610, the largest in D.
    D1,2,2,2,3,3,3,4,5,5,5,6,7,8 now.
  • Largest now is 8. So, either x22, or x58. WHY?
  • In case x22, you can inverse direction and make
    x2 as x5, and then x5 would be 8 (prune branch
    for symmetry).
  • So, x58 is a unique choice. x6-x52, and x5-x18
    are taken off D.
  • 7 is now largest. So, either x47, or x23. We
    have two branches now, for these two options.

64
Turnpike Reconstruction Problem
  • For x47, we should have in D x5-x41, and
    x6-x43. Take them off.
  • Next largest is 6. So, either x36 or x24 (so
    that x6-x26).
  • For x36, x4-x31. There is no more 1 in D, so,
    this is impossible.
  • Backtrack to x24 (instead of x36), x2-x04 and
    x5-x24. We don't have those many 4's left. So,
    this is impossible also.
  • So, we backtrack past the choice x47, because it
    has no solution. Rather, we reassign x23 (Note.
    x4 is no longer assigned backtracked).

65
Turnpike Reconstruction Problem
  • For x23, we take off 3,5,7 from D. Thus,
    D1,2,2,3,3,4,5,5,6
  • Now, either x46 or x34.
  • For x34, we need two 4s for x3-x0, and x5-x3.
  • BT to x46, D1,2,3,5,5.
  • There is only one choice left now, i.e., x35,
    which fits all the values in D.
  • The algorithm terminates here.
  • Note that if backtracking leads up to the first
    choice and fails even there, then there is no
    solution (inconsistent) for the problem!
  • READ ALGORITHM on p443-4.
Write a Comment
User Comments (0)
About PowerShow.com