Checking Specific Algorithms - PowerPoint PPT Presentation

1 / 129
About This Presentation
Title:

Checking Specific Algorithms

Description:

Checking Specific Algorithms Greg Bronevetsky Objective Primary goal: find checkers that can be applied to any program Along the way must look at checkers for ... – PowerPoint PPT presentation

Number of Views:149
Avg rating:3.0/5.0
Slides: 130
Provided by: gregBrone
Category:

less

Transcript and Presenter's Notes

Title: Checking Specific Algorithms


1
Checking Specific Algorithms
  • Greg Bronevetsky

2
Objective
  • Primary goal find checkers that can be applied
    to any program
  • Along the way must look at checkers for specific
    algorithms
  • Each checker exploits special properties of
    target algorithm
  • General patterns may become apparent
  • Useful for toolkit-style checker generator
  • Library of algorithm-specific checkers used where
    appropriate

3
Outline
  • Low-hanging fruit
  • Himanshu Gupta. "Result Verification Algorithms
    for Optimization Problems", 1995
  • Checkers for optimizations programs
  • Existence of checkers for certain complexity
    classes
  • Certification Trails
  • Gregory F. Sullivan, Dwight S. Wilson and Gerald
    M. Masson, Certification of Computational
    Results, 1995.
  • Fundamentals of program testing correcting
  • Manuel Blum, Michael Luby and Ronitt Rubinfeld.
    "Self-Testing Correcting with Applications to
    Numerical Problems"

4
Low-Hanging Fruit Outline
  • Gregs Theorem
  • Himanshu Gupta
  • Checkers for optimization problems (primal-dual)
  • Max Flow
  • Min-cost Flow
  • Unweighted Weighted bipartite matching
  • All Shortest Paths
  • Existence proofs for verifiers
  • Verifiers for NP-complete and -hard languages
  • Approximate verifiers
  • Verifiers with Certification Trails

5
Intro to Complexity
  • Problems defined as sets of strings
  • Particular set language
  • Example language set of palindromes
  • 001100, 01011010, etc.
  • Membership in given language decidable by
    computing device of given power
  • ex Turing machine running in polynomial time
  • Machine A on input x
  • Runs for time poly(x)
  • Returns ACCEPT x?L(A) REJECT
    x?L(A)

6
Intro to Complexity
  • Complexity class set of languages accepted by
    machines of given power
  • Examples
  • P Turing machines with running time ?
    O(poly(x))
  • EXPSPACE Turing machines that use ? O(2x)
    memory
  • BC Boolean circuits of bounded depth
  • Big deal can show some problems inherently
    harder than others

7
Basics of Reductions
  • Problem A reducible to problem B if can easily
    solve A given algorithm for B
  • Reduction
  • Given x input to A
  • Efficiently transform x to y input to B
  • Efficiently usually means poly-time
  • Ask algorithm for B if y valid B-string
  • B is an oracle
  • x valid A-string ? y valid B-string
  • Thus, B at least as hard as A

8
Basics of Reductions
  • Can compare difficulties of problems via
    reductions
  • If all problems in class C1 reducible to problems
    in class C2 then C1 ? C2
  • Example
  • P ? NP
  • Unknown if NP ? P
  • If yes, then PNP
  • If no, then some problems in NP cannot be solved
    in polynomial time

9
Theorem 1
  • Let A be an implementation of an algorithm
  • Runs in time O(p()), p some polynomial
  • Does there exist checker C for A that runs in
    time lt p()?
  • C accepts As input output, returns PASS/FAIL
  • C knows As source code
  • Runs in time O(q()) lt O(p()), asymptotically
    faster
  • A is decision algorithm, so output is 1 bit

Input
PASS, FAIL
A
C
0,1
10
Theorem 1
  • As input x n bits
  • As output y 1 bit
  • Suppose C can verify whether x?L(A) using y
  • In O(q(x)) time
  • Then can decide L(A) in O(q(x)) time
  • Given input x
  • Run C on ltx,1gt
  • If C(x,1) returns PASS, ACCEPT x
  • If C(x,1) returns FAIL, REJECT x

11
Theorem 1
  • A reduced to C
  • C runs in O(q()) time
  • Thus, p()-time problems solvable in q()-time
  • This is not true
  • Therefore, for p()-time decision algorithm A ??
    checker C running in ltO(p()) time

12
Theorem 1.1
  • Lets make job simpler for checker
  • Helper proof
  • Suppose A outputs helper proof
  • C can use helper proof to simplify checking
  • Size of helper proof ? q(x)
  • Since C can only read ? q(x) bits of proof

0,1
Input
PASS, FAIL
C
A
HelperProof
13
Theorem 1.1
  • As input x n bits
  • As output y 1 bit q(x)-bit proof
  • Suppose C can verify whether x?L(A) using y
  • In O(q(x)) time
  • Then y is witness for x?L(A)

14
Nondeterministic Time Classes
  • Non-deterministic automaton for language L
  • Given input x
  • Runs for O(f(x)) amount of time
  • Can make binary guesses 0,1
  • If ? set of guesses s.t. automaton would accept,
    then x?L
  • At each guess, automaton splits
  • 0 branch, 1 branch
  • If accepts along some path then whole automaton
    accepts

0
1
0
1
0
1
15
Nondeterministic Time Classes
  • String of guesses leading to ACCEPT called a
    witness
  • Proves that x?L
  • Can deterministically verify x?L
  • No need to guess witness gives correct guesses
  • NP runs for polynomial time witnesses ?
    poly-length
  • Nondet(f()) runs in time ? O(f(x))
    witness length ? O(f(x))

16
Theorem 1.1
  • As input x n bits
  • As output y 1 bit q(x)-bit proof
  • Suppose C can verify whether x?L(A) using y
  • In O(q(x)) time
  • Then y is witness for x?L(A)
  • Thus, x?L(A) decidable by nondet automaton
    running in O(q(x)) time

17
Theorem 1.1
  • Guess all bits of proof
  • Takes ?q(x) guesses
  • For each proof, run C on ylt1, proofgt
  • Accept if C says PASS on some ylt1, some
    proofgt

Guess y
0
1
0
1
1
0
C
C
C
C
18
Theorem 1.1
  • x?L(A) decidable by nondet automaton running in
    O(q(x)) time
  • Thus, L(A) ? Nondet(q())
  • Recall A runs in time O(p())gtO(q())
  • Det(p()) ? Nondet(q()) ? Nondet(p()) ? ?
    Det(p()) ? Nondet(p())
  • Shown If L(A) ? Det(p()) and ? proof-checker
    that runs in time O(q()) where q()ltp() then
    Det(p()) ? Nondet(p())

19
Theorem 1.1
  • Shown If L(A) ? Det(p()) and ? checker that runs
    in time O(q()) where q()ltp() then Det(p()) ?
    Nondet(p())
  • Known Det(p()) ? Nondet(p()) for few functions
  • In general, as hard as P ? NP
  • Worst case scenario
  • If discover efficient checkers, then Det(p()) ?
    Nondet(p())
  • Very difficult
  • If Det(p()) ? Nondet(p()) proven, no help to us

20
Low-Hanging Fruit Outline
  • Gregs Theorem
  • Himanshu Gupta
  • Checkers for optimization problems (primal-dual)
  • Max Flow
  • Min-cost Flow
  • Unweighted Weighted bipartite matching
  • All Shortest Paths
  • Existence proofs for verifiers
  • Verifiers for NP-complete and NP-hard languages
  • Approximate verifiers

21
Definitions
  • Problem ??I?S
  • I set of inputs
  • S set of solutions
  • Algorithm on input x?I, returns y?S s.t. ltx,ygt??
  • Verification problemV(?)?(I?S)?PASS,FAIL
  • lt(x,y),PASSgt?V(?) iff ltx,ygt??
  • lt(x,y),FAILgt?V(?) iff ltx,ygt??

22
Maximum Flow
  • Given
  • Directed graph G(V,E)
  • Edges labeled with capacities
  • c(u,v) capacity of edge
  • Two special nodes S and T
  • Must find maximum flow from s to t

T
c8
c5
c4
c1
c9
c2
c6
S
c7
c3
23
Maximum Flow
  • Find edge labeling f
  • f(u,v) flow on edge u?v ? c(u,v)
  • Sum of flows coming into node Sum of flows
    coming out of node
  • Outflow of node s gt inflow
  • Inflow of node t gt outflow
  • Net outflow of s Net inflow of t Network
    flow
  • Problem Find labeling f generating largest
    network flow

24
Residual Capacities
  • Residual Capacity for flow fres(u,v) c(u,v)
    f(u,v)
  • (Capacity of edge unused by flow)
  • Residual Graph Rf graph on V,E where
  • Augmenting Path Path through Rf

25
Augmenting Paths ? Verifier
  • Theorem Rf contains augmenting path ? f is not
    maximum flow
  • Example one common max flow algorithm
  • Pick a flowWhile can find augmenting paths
  • Adjust flow to remove augmenting path
  • Thus, to check if given flow is max flow
  • Ensure given flow valid
  • Construct Rf
  • Ensure that no augmenting path exists
  • Path search much faster than max-flow algorithm

26
Min-Cost Flow
  • Now edges have cost
  • Goal find flow of given size with minimal cost
  • For any potential flow construct Residual Graph
    Rf
  • Theorem given flow is min-cost if Rf has no
    negative-cost cycles
  • Thus, checker
  • Ensures alleged min-cost flow is valid flow
  • Constructs Rf
  • Ensures that no negative-cost cycles exist
  • Cycle search much faster than min-cost flow
    algorithm

27
Matching
  • Given bipartite weighted/unweighted graph find
    subset of edges s.t.
  • No left node has gt1 right node as neighbor
  • No right node has gt1 left node as neighbor

X
Y
28
Verifier for Matching
  • Matching also has augmenting paths
  • Can verify matching
  • Ensure that matching valid
  • Search of augmenting path
  • Above problems examples of matroids
  • Intuitively problems solvable via basic greedy
    algorithm
  • All matroids have this structure
  • Thus, can easily check matroid problems

29
Other Matroid Problems
  • Minimal Spanning Tree
  • Minimal Cut in Graph
  • Maximal Basis in Linear Space

30
Low-Hanging Fruit Outline
  • Gregs Theorem
  • Himanshu Gupta
  • Checkers for optimization problems (primal-dual)
  • Max Flow
  • Min-cost Flow
  • Unweighted Weighted bipartite matching
  • All Shortest Paths
  • Existence proofs for verifiers
  • Verifiers for NP-complete and NP-hard languages
  • Approximate verifiers

31
All Shortest Paths
  • Given undirected graph G(V,E)
  • Each edge (i,j) has weight w(i,j)
  • Problem return the shortest path between every
    pair of nodes
  • Assume output format
  • P predecessor matrix
  • Pi,j k s.t. edge (k,j) lies on shortest path
    i?j
  • (i.e. node k precedes node j on path i?j)
  • D distance matrix
  • Di,j shortest distance from i to j
  • (Di,i 0)

Pi,j
j
i
32
Checker Algorithm
  • Step 1 ensure that P and D self-consistent
  • foreach (i,j)
  • Ensure that Di,j Di, Pi,j
    w(Pi,j, j)
  • If error seen, return FAIL
  • Step 2 ensure output paths truly shortest
  • foreach node i
  • foreach edge (u,v)
  • if(Di,u w(u,v) lt Di,v) return FAIL
  • Return PASS

33
Checking Self-Consistency
  • foreach (i,j)
  • Ensure that Di,j Di, Pi,j w(Pi,j,
    j)
  • Clearly, if output correct, test returns PASS

Pi,j
j
i
Di, Pi,j
w(Pi,j, j)
Di,j
34
Checking Self-Consistency
  • foreach (i,j)
  • Ensure that Di,j Di, Pi,j w(Pi,j,
    j)
  • If output incorrect, two possible problems
  • P may not represent valid paths
  • D may not be true lengths of P s paths

Pi,j
j
i
Di, Pi,j
w(Pi,j, j)
Di,j
35
Checking D
  • Suppose Di,j ? i?j distance according to P
  • Pick pair i,j with shortest true distance
  • Let k Pi,j
  • Check Di,j ? Di, k w(k, j)
  • No errors in D for pairs closer than i and j
  • Thus, Di, k correct distance i?js predecessor
  • So, Di, k w(k, j) is correct i?j distance
  • Thus, will detect error

Pi,j
j
k
i
36
Checking P
  • Pi, must induce spanning tree on nodes
  • Thm Graph on n nodes is spanning tree ?
    n-1 edges and acyclic
  • Clearly, n-1 edges since each node j?i has Pi,j
  • Above check will fail if cycle exists

37
Checking P Acyclicity
  • Claim if cycle exists then D has error

i
Di,v
Di,u
u
v
path
  • Suppose cycle exists
  • Suppose all edges of u-v cycle have been checked
    (and passed), u?v is last

38
Checking P Acyclicity
i
Di,v
Di,u
foreach (i,j) Di,j Di, Pi,j
w(Pi,j, j)
u
v
path
  • v?u path passed, so Di,u Di,v w(path)
  • In order for u?v to pass Di,v Di,u
    w(u,v)
  • But then, Di,v w(u,v) (Di,v w(path))
  • Contradiction!
  • Thus, cycle exists ? error in D detected

39
Checker Algorithm
  • Step 1 ensure that P and D self-consistent
  • foreach (i,j)
  • Ensure that Di,j Di, Pi,j
    w(Pi,j, j)
  • If error seen, return FAIL
  • Step 2 ensure output paths truly shortest
  • foreach node i
  • foreach edge (u,v)
  • if(Di,u w(u,v) lt Di,v return FAIL
  • Return PASS

?
40
Checking Shortest Paths
  • Test ? nodes i, edges (u,v) Di,u w(u,v) ??
    Di,v (FAIL if lt)
  • Test checks if path through u shorter than
    official path
  • If output correct
  • If u on shortest path, Di,u w(u,v) Di,v
  • If u not on shortest path, Di,u w(u,v) ?
    Di,v
  • Thus, on correct output PASS returned

v
i
u
41
Checking Shortest Paths
  • Suppose output incorrect Di,j not length of
    shortest i?j path
  • Step 1 ensures Di,j true distance of path i?j
    according to P
  • Let
  • D? distance function for truly shortest
    solution
  • P? predecessor function for shortest
    solution

42
Checking Shortest Paths
  • Pick i,j s.t.
  • Di,j ? D?i,j
  • No other erroneous pair closer to each other (by
    P? )
  • Thus, ?u?Pi,j s.t. D?i,u w(u,j) lt Di,j
  • Path thru u shorter than path thru Pi,j
  • If shortest path is thru Pi,j, then Di,
    Pi,j is wrong
  • Big Question Will test detect this?

u
j
Shorter Path
i
Wrong Path
Pi,j
43
Checking Shortest Paths
  • Thus, ?u?Pi,j s.t. D?i,u w(u,j) lt Di,j
  • For each node i, test looks at all edges
  • When looks at u?j
  • Evaluates Di,u w(u,j) ?? Di,j
  • Di,u D?i,u
  • Since i,j closest erroneous pair
  • Thus, Di,u w(u,j) D?i,u w(u,j) lt Di,j

u
j
Shorter Path
i
Wrong Path
Pi,j
44
Checking Shortest Paths
  • For each node i, test looks at all edges
  • When looks at u
  • Evaluates Di,u w(u,v) ?? Di,v
  • Di,u D?i,u
  • Since i,j closest erroneous pair
  • Thus, Di,u w(u,v) D?i,u w(u,v) lt Di,v
  • Test returns FAIL
  • Shown if output not shortest paths ?
    test returns FAIL

45
Summary of Checkers
  • Shown checkers for
  • Matroids
  • All-shortest paths
  • Checkers very specific to target algorithm
  • In general
  • Find invariant of output
  • Ensure it is true

46
Low-Hanging Fruit Outline
  • Gregs Theorem
  • Himanshu Gupta
  • Checkers for optimization problems (primal-dual)
  • Max Flow
  • Min-cost Flow
  • Unweighted Weighted bipartite matching
  • All Shortest Paths
  • Existence proofs for verifiers
  • Verifiers for NP-complete and NP-hard languages
  • Approximate verifiers

47
Verifiers for NP-Complete Problems
  • Given NP-complete decision problem ??I?0,1
  • ?s verification problem V(?)?I?0,1 ?P,F
  • ? reducible to V(?)

48
Verifiers for NP-Complete Problems
  • Given NP-complete decision problem ??I?0,1
  • ?s verification problem V(?)?I?0,1 ?P,F
  • ? reducible to V(?)
  • On input x, check if ltx,Pgt?V(?)
  • Mapping computable in poly-time
  • Thus, V(?) NP-complete
  • Proven if ? NP-complete then so is V(?)

49
Verifiers for Some NP-Hard Problems
  • Given NP-Hard problem H?I??
  • On input x, returns y s.t.
  • yltO(f(x))
  • f(x) ? O(poly(x))
  • f() poly-time computable
  • Verification problem V(H)?I???P,F

50
Verifiers for Some NP-Hard Problems
  • Reduction from H to V(H)
  • Given x, compute f(x)
  • Try all possible values for y?0 f(x)
  • If ltltx,ygt, Pgt?V(H) then ACCEPT
  • If not accepts, REJECT
  • Reduction is poly-time
  • Thus, V(H) is NP-Hard (unless PNP)
  • Examples min vertex cover, max clique, chromatic
    number, max cycle length, etc.

51
Low-Hanging Fruit Outline
  • Gregs Theorem
  • Himanshu Gupta
  • Checkers for optimization problems (primal-dual)
  • Max Flow
  • Min-cost Flow
  • Unweighted Weighted bipartite matching
  • All Shortest Paths
  • Existence proofs for verifiers
  • Verifiers for NP-complete and NP-hard languages
  • Approximate verifiers

52
Maximization Problems
  • Given NP-Hard Maximization problem H?I??
  • Output size of maximum solution to problem
  • For each H ? NP-Complete Decision problem HD
  • Input ltx,ygt
  • ACCEPT iff maximal solution to problem is ?y
  • Can use binary search to find solve H using log
    x calls to HD

53
Approximate Algorithms
  • Given NP-Hard Maximization problem H?I??
  • AH is (a,c) approximation algorithm for H if
  • On input x, outputs y
  • y ? max ? a?y c
  • max is maximal solution of H
  • Runs in poly time
  • (a,c) approximation verifier AVH
  • Input ltx,ygt
  • Returns PASS iff y ? max ? a?y c

54
Approximation Verifiers NP
  • Thm If poly-time (a,c) approximation algorithm
    AH exists for NP-Hard maximization problem H,
    Then approximation verifier AVH is NP-Complete
  • Proof Reduce sibling decision problem HD to AVH
  • Calls AH
  • Uses AVH as oracle

55
Reducing HD
  • Input ltx,ygt
  • ACCEPT iff y?max
  • max size of maximal solution to input x
  • Step 1 call AH
  • Get m?AHs approximation of max
  • Step 2 work
  • If y ? m?, ACCEPT
  • If y gt m?
  • If y gt a?m? c, REJECT
  • Else, use AVH Oracle to see if ltx,ygt?L(AVH)
  • If ltx,ygt?L(AVH), ACCEPT
  • Else, REJECT

56
Reducing HD
  • Given input ltx,ygt to HD
  • Decided whether y?max by calling
  • AH approximation algorithm
  • AVH approximation verifier
  • AH runs in poly time
  • Reduction is poly-time, so can run AH as part of
    reduction
  • ltx,ygt transformed to input to AVH
  • Thus, AVH at least as hard as HD

57
Proven
  • NP-Hard Maximization problem AND
  • Poly-time approximation algorithm for it
  • Approximation verifier for approximation
    algorithm is NP-Complete

58
Algorithm-specific Verifiers
  • If poly-time approximation algorithm exists,
    trivial approximation verifier
  • On input ltx,ygt
  • Run fault-free version of approximation algorithm
    on x
  • See if returns y
  • Verifier still poly-time
  • Only works for specific approximation algorithm
  • Wont work for different approximators of same
    problem

59
Low-Hanging Fruit Summary
  • Covered several basic theorems
  • Give idea of what is possible/impossible
  • Fairly basic results
  • Equivalent to homework in graduate theory class

60
Outline
  • Low-hanging fruit
  • Himanshu Gupta. "Result Verification Algorithms
    for Optimization Problems", 1995
  • Checkers for optimizations programs
  • Existence of checkers for certain complexity
    classes
  • Certification Trails
  • Gregory F. Sullivan, Dwight S. Wilson and Gerald
    M. Masson, Certification of Computational
    Results, 1995.
  • Fundamentals of program testing correcting
  • Manuel Blum, Michael Luby and Ronitt Rubinfeld.
    "Self-Testing Correcting with Applications to
    Numerical Problems"

61
Certification Trails
  • Algorithm runs on input
  • Produces
  • Regular output
  • Certification Trail short proof that output
    matches input
  • Certifier runs on ltinput, trailgt
  • Produces same output or return FAIL
  • Additional info from trail speeds recomputation
  • If trail wrong, may still output correctly
  • Certifiers presented here will just FAIL

62
Basic Definitions
  • D set of inputs
  • S set of valid outputs
  • T set of certification trails
  • Original program
  • P D ? S?T
  • Accepts input, returns ltoutput, trailgt
  • Certifier
  • C D?T ? S?FAIL
  • Accepts ltinput, trailgt, returns output or FAIL

63
Focus of Certification Trails
  • Paper comes from Software Engineering background
  • Thus, focus on detecting programmer errors
  • Argue that P and C different algorithms
  • Thus, implementations different
  • Low probability of same errors in P and C
  • Hardware faults also mentioned

64
Certification Trails Outline
  • Paper presents checkers for several common
    algorithms
  • Sorting
  • Convex Hull
  • Neat approach
  • Problem-specific and very manual
  • Little insight into general procedure
  • Though, can create fault tolerant libraries

65
Sorting
  • Given list of numbers, return the numbers in
    sorted order
  • Trivial check
  • Given allegedly sorted output
  • Check that order non-decreasing
  • This doesnt work
  • Output must be permutation of input
  • Above checker would
  • On input 2, 4, 6, 8
  • Accept 0, 0, 0, 0, 0, 0

66
Correct Certifier
  • Certification Trail list of indexes
  • All input elements get ID
  • ith element gets ID i
  • At spot j of trail ID of element in sorted
    position j

0 12
1 45
2 9
3 26
4 33
5 17
6 11
7 92
IDs Data
2 9
6 11
0 12
5 17
3 26
4 33
1 45
7 92
IDs Sorted Data
IDs in Trail
2
6
0
5
3
4
1
7
67
Checking Permutation
  • Certifier gets
  • Input numbers
  • Their IDs in sorted order
  • Uses ID list to reorder input numbers
  • ID list serves as cheat sheet
  • Shows correct sort decisions

Data
12
45
9
26
33
17
11
92
IDs in Trail
2
6
0
5
3
4
1
7
2 9
6 11
0 12
5 17
3 26
4 33
1 45
7 92
Sorted(?) IDs Sorted(?) Data
68
Checking Permutation
  • Two things to check
  • Onto For each sorted element, ? input element
  • 1-1 All sorted elements refer to different
    input elements

Data
12
45
9
26
33
17
11
92
IDs in Trail
2
6
0
5
3
4
1
7
2 9
6 11
0 12
5 17
3 26
4 33
1 45
7 92
Sorted(?) IDs Sorted(?) Data
69
Checking Permutation
  • Onto For each sorted element, ? input element
  • Check that all IDs in trail are valid
  • 1-1 All sorted elements refer to different
    input elements
  • If two IDs in trail equal then some input element
    not touched
  • Not copied to sorted list
  • Use touch counters

0 1 2 3 4 5 6 7
Input
Sorted
0 1 3 3 4 5 6 7
70
Checking Sort
  • To check sort
  • Traverse reordered list
  • Ensure non-decreasing
  • Sorter time O(n?log n)
  • Certifier time O(n)
  • Asymptotically faster
  • Big trick
  • Trail summarizes decisions made by sorter
  • Enough info to quickly recompute
  • Can any problem be cast into set of big decisions?

71
Certification Trails Outline
  • Paper presents checkers for several common
    algorithms
  • Sorting
  • Convex Hull
  • Neat approach
  • Problem-specific and very manual
  • Little insight into general procedure
  • Though, can create fault tolerant libraries

72
Convex Hull Problem
Given set of points on 2D plane, find subset
that forms convex hull around all points.
73
Convex Hull Step 1
Points sorted in order of increasing slope
relative to P1
P1 is the point with the least x-coordinate.
74
Convex Hull Invariant
All the points not on Hull are inside triangle
formed by P1 and two successive points on Hull
P6
P8
P5
P7
P4
P3
P1
P2
75
Convex Hull Invariant
P3 not Hull point ? clockwise angle between
lines P2P3 and P3P4 180º
P6
P8
P5
P7
P4
180º
P3
P1
P2
76
Convex Hull Invariant
P6
P8
P5
P7
P4
P1
lt 180º
P3
P2
77
Convex Hull Algorithm Outline
  • Walk through P2 to Pn in slope order
  • Keep adding points to hull
  • If find point generating angle 180º
  • Back up, remove all such points
  • Until added Pn
  • P1, P2 and Pn must be on convex hull

78
Convex Hull Algorithm
  • Add P1, P2 and P3 to the Hull
  • For Pk P4 to Pn
  • (... trying to add Pk to the Hull )
  • Let QA and QB be the two points most recently
    added to the Hull
  • While the angle formed by QAQB and QBPk 180
  • remove QB from the Hull since it is inside the
    triangle P1, QA, Pk.
  • Add Pk to the Hull

79
Trail for Convex Hull
  • Augment Program to output
  • h1, h2, ..., hm indexes of points on hull
  • For each point Pi not on hull, proof of why not
  • Tuple (xi, hj, hk, hk) s.t. xi in triangle hjhkhl
  • xi internal point
  • hj, hk, hk hull points

P5
P3
P1
P4
P2
80
Convex Hull Checker
  • Checker checks that
  • There is 1-1 correspondence between input points
    and x1, x2, ..., xmUh1, h2, ..., hr.
  • xi internal points
  • hi hull points
  • Each point in triangle proofs lies in given
    triangle
  • Basic error checking
  • Assures that all points accounted for

81
Convex Hull Checker
  • Checker checks that
  • For each triple of consecutive hull points
    hihi1hi2 lines hihi1 and hi1hi2 form
    counter-clockwise angle ?180?
  • i.e. form convex corners of hull
  • ? unique locally maximal point on hull
  • There exists theorem saying shape with above
    properties is convex hull

P8
P6
?180?
?180?
P1
?180?
82
Proving Correctness
  • If hull correct, will return PASS
  • Since we check properties true of convex hulls
  • If hull not correct
  • If bad encoding of proof
  • (i.e. points dont refer to input)
  • First two checks will detect
  • If internal points should be on hull
  • Will not be in any triangle
  • If hull points not convex
  • Convexity assured by theorem, given points were
    valid (ensured by first check)

83
Checking Convex Hull
  • Original Algorithm O(n?log n) time
  • Dominated by initial sort of points
  • Checker O(n) time
  • Asymptotically faster
  • Very different algorithms
  • Little chance of similar errors in both
  • Hardware errors will affect both differently

84
Checking Convex Hull Big Trick
  • Trail contains key facts discovered by algorithm
  • Containment triangles
  • Certifier checks main invariant
  • Convexity of hull

85
Outline
  • Low-hanging fruit
  • Himanshu Gupta. "Result Verification Algorithms
    for Optimization Problems", 1995
  • Checkers for optimizations programs
  • Existence of checkers for certain complexity
    classes
  • Certification Trails
  • Gregory F. Sullivan, Dwight S. Wilson and Gerald
    M. Masson, Certification of Computational
    Results, 1995.
  • Fundamentals of program testing correcting
  • Manuel Blum, Michael Luby and Ronitt Rubinfeld.
    "Self-Testing Correcting with Applications to
    Numerical Problems"

86
Goals
  • Aimed at implementation errors
  • Given implementation P of function f
  • Testing must see if P(x)f(x) for most inputs
  • If correct for all inputs, output PASS
  • If incorrect on too many inputs x, output FAIL
  • Correcting if ? inputs x for which P(x)?f(x)
    then
  • Figure out correct P(x)
  • Assuming P correct on most inputs
  • Tester and corrector may call P

87
Added Reliability
  • Tester and corrector usually simpler than P
  • Use simple additions, etc.
  • Smaller probability of bugs
  • Smaller running time ? smaller prob of random
    fault
  • Tester and corrector run faster than P
  • (Counting calls to P as constant time)
  • Thus, very likely different implementations
  • Small probability of same bug in tester/corrector
    and P

88
Added Reliability
  • One tester/corrector pair per function
  • Reusable across implementations
  • Can design testers/correctors for specific
    libraries
  • Applications using lots of library code (ex
    Java, C) mostly covered
  • Thus, can spend more time debugging
  • Bottom Line
  • P may be faulty
  • Tester/Corrector assumed reliable

89
General Technique Testing
  • Will test linearity property
  • Shared by many functions
  • f(xy) f(x) f(y)
  • Will show
  • If linearity holds for implementation P on inputs
  • Then ? linear function g s.t. P(x)g(x) for most
    inputs x
  • (i.e. if P mostly linear then close to actually
    linear function)
  • Since P(x)f(x) on many inputs, g f

90
General Technique Correcting
  • Focus on random self-reducible functions
  • Can express f(x) as f(y1), f(y2), f(yk) where
    y1, y2, yk random
  • Assumes simple reduction function R()
  • To get P(x), run several times
  • Pick random y1, y2, yk
  • P(x) R(P(y1), P(y2), P(yk))
  • If on input x P(x) wrong, now correct with high
    probability

91
True Definitions
  • error(f, P, D) probability that P(x)?f(x) with
    x randomly chosen via distribution D
  • Probabilistic oracle program
  • Probabilistic program M
  • Makes calls to external oracle program A
  • Calls counted as constant time
  • Syntax MA

92
True Definition Self-Tester
  • Let 0 ? ?1 lt ?2 ? 1
  • Confidence parameter ?gt0
  • (?1, ?2) self-testing program for f
  • Probabilistic oracle program Tf
  • If error(f, P, D) ? ?1 then TfP returns PASS with
    Prob ? 1-?
  • If error(f, P, D) ? ?2 then TfP returns FAIL with
    Prob ? 1-?

?1
?2
FAIL
PASS
0
1
error(f, P, D)
93
True Definition Self-Corrector
  • Let 0 ? ? lt 1
  • Confidence parameter ?gt0
  • ? self-correcting program for f
  • Probabilistic oracle program Cf
  • If error(f, P, D) ? ? then CfP(x) f(x) with
    Prob ? 1-?
  • x randomly selected from distribution D

94
Self-Tester/Corrector Pair
  • Let 0 ? ?1 lt ?2 ? ? ? 1
  • Confidence parameter ?gt0
  • Tf (?1, ?2) self-tester for f
  • Cf ? self-corrector for f
  • Cf applied when Tf doesnt FAIL

NOT FAIL
?1
?2
?
FAIL
0
1
error(f, P, D)
95
Outline
  • Self-correctors
  • Assume existence of self-testers
  • Simpler to prove correctness
  • Self-Testers
  • Correctness harder to prove
  • Requires a bit of group theory

96
Self-Correctors
  • Assume that for implementation Perror(f, P, D)
    known
  • Determined by self-tester
  • error(f, P, D) ? constant
  • Assume function f self-reducible
  • f(x) R(f(y1), f(y2), f(yk))
  • y1, y2, yk random
  • Not independent of each other

97
General Self-Corrector
  • For i 1 to q
  • Pick random y1, y2, yk
  • answeri R(P(y1), P(y2), P(yk))
  • Output majority answer out of answer1q
  • Note For any call to P we ensure output in
    correct range

98
Improving Correctness
  • Before P(x)?f(x) for some xs
  • P(x)?f(x) on constant ? ? inputs x
  • Now
  • Each iteration has prob of failure ? ??k
  • ProbR(P(x1), P(x2), , P(xk)) ?
    R(f(x1), f(x2), , f(xk)) f(x) ? k??
  • Since k calls to P
  • Goal compute correct f(x) with prob ? 1-?
  • Each self-reduction is independent
  • Thus, error probability goes down exponentially
  • Can get ? 1-? success prob after several
    repeats
  • Several ? ln(1/?)

99
Self-Reducibility of Mod
  • Want to compute f(x) (x mod R)
  • R fixed
  • x ? 0R?2n-1
  • Self-reducible
  • (x mod R) (x1 mod R) (x2 mod R)
  • x1 picked uniformly at random
  • x2 picked s.t. x x1 x2
  • Since (x1x2 mod R) (x1 mod R) (x2 mod R)

100
Self-Corrector for Mod
  • For i 1 to q
  • Pick random x1
  • Pick x2 s.t. xx1x2
  • answeri P(x1) P(x2)
  • Return majority among answer1q

101
Self-Reducibility of Modular Mult
  • Want to compute f(x,y) x?y mod R
  • R fixed
  • x, y ? 02n-1
  • Self-Reducible
  • x?y (x1?y1 mod R) (x1?y2 mod R)
    (x2?y1 mod R) (x2?y2 mod R)
  • x1, y1 picked uniformly at random
  • x2, y2 picked s.t. x x1 x2, y y1 y2
  • Since
  • x1?y1 x1?y2 x2?y1 x2?y2
  • x1?(y1 y2) x2?(y1 y2)
  • (x1x2) ? (y1 y2) x?y

102
Self-Corrector for Modular Mult
  • For i 1 to q
  • Pick random x1, y1
  • Pick x2 s.t. xx1x2
  • Pick y2 s.t. yy1y2
  • answeri P(x1,y1) P(x1,y2) P(x2,y1)
    P(x2,y2)
  • Return majority among answer1q
  • Corrector for integer multiplication similar

103
Self-Reducibility of Matrix Mult
  • Want to compute f(A,B) A?B
  • A, B matrices
  • Self-Reducible
  • A?B A1?B1 A1?B2 A2?B1 A2?B2
  • A1, B1 picked uniformly at random
  • A2 A A1, B2 B B1
  • Since
  • A1?B1 A1?B2 A2?B1 A2?B2
  • A1?(B1 B2) A2?(B1 B2)
  • (A1A2) ? (B1 B2) A?B

104
Self-Corrector for Matrix Mult
  • For i 1 to q
  • Pick random A1, B1
  • Pick A2 s.t. A2 A - A1
  • Pick B2 s.t. B2 B B1
  • answeri P(A1,B1) P(A1,B2) P(A2,B1)
    P(A2,B2)
  • Return majority among answer1q
  • Polynomial multiplication works same way

105
Self-Correctors Summary
  • Can design self-corrector for any random
    self-reducible function
  • Unknown how large this set is
  • Self-Correctors very simple
  • Finite bound loop
  • Simple reduction function
  • Thus, low probability of error in self-corrector
  • Number of calls to P increases by log factor

106
Outline
  • Self-correctors
  • Assume existence of self-testers
  • Simpler to prove correctness
  • Self-Testers
  • Correctness harder to prove
  • Requires a bit of group theory

107
Moving to Self-Testing
  • Can now self-correct
  • Self-correctors only work if error(f, P, D) ?
    constant
  • But how can we know?
  • Self-testing tells us

108
Self-Testing
  • Given implementation P of function f
  • Want to know error(f, P, D) ProbP(x)?f(x) for
    x chosen with distribution D
  • Assume P(x) f(x) for many xs
  • Will present self-testers for linear functions
  • Linear f(x1x2) f(x1) f(x2)
  • Tests will bound error(f, P, D)

109
Linearity
  • Modf(x1x2) (x1x2) mod R
    (x1 mod R)(x2 mod R)
  • Modular Multf(x1x2) ((x1x2)?y) mod R
    (x1?y mod R) (x2?y mod R)

110
Self-Testing Technique
  • Central Claim If P(x) is linear in many
    spotsThen ? linear function g s.t. low prob
    of P(x)?g(x)
  • If P mostly linear then P very close to actually
    linear function
  • Since assumed for most x P(x) f(x)Then the
    linear function P is close to is f()

111
Self-Testing Technique
  • Thus self-tester repeatedly
  • Picks random inputs
  • Verifies P(x1x2) P(x1) P(x2)
  • Verifies P(x11) P(x1) 1

112
Intro to Group Theory
  • To explain why this works, must introduce group
    theory
  • Group set of numbers with operation ltS, gt
  • Set S
  • Binary Operation
  • Given two group elements, produces another group
    element
  • Examples
  • Infinite group Integers with
  • Finite group Even integers 098, with
    mod 100

113
Identity Element
  • Special element 0
  • ?h?S. 0 h h
  • Called Identity Element
  • Ex Rationals, with ?
  • x ? y another rational
  • 1 is identity since 1 ? x x
  • Inverse x-1 s.t. x x-1 0

114
Group Generators
  • Powers hn h h h h (n times)
  • an am anm
  • Each group has elements g1,,gc s.t.?h?S.
    ?n1n2nc. h
  • Thus, all elements expressible in terms of g1,gc
  • They generate the group

115
Subgroups
  • Given group ltS, gt
  • Subgroup ltT, gt if T?S
  • Ex
  • S Even Integers 098 mod 100
  • T Multiples of 4 ? 098
  • Each subgroup has own generators
  • ltS, gt generated by 2
  • ltT, gt generated by 4
  • T 0 always subgroup

116
Subgroups Example
  • Let G ltintegers 014, mod 15gt
  • Group generator 1
  • Subgroups
  • 0 generated by 0
  • 0, 3, 6, 9, 12 generated by 3
  • 0, 5, 15 generated by 5
  • All subgroups generated by some power of 1
  • In general, every subgroup generated by some
    combo of group generators

117
Functions on Groups
  • is function from group to itself
  • Can define others
  • Ex given additive group, can define
    multiplication
  • Functions from one group to another
  • G1 even integers 0...98
  • G2 powers of 4 0186
  • f G1?G2
  • f(x?G1) (x?2) ? G2

118
Homomorphisms
  • Homomorphism function on groups s.t.
  • f GA?GB
  • GB ltSA, Agt
  • GB ltSB, Bgt
  • f(x1 A x2) f(x1) B f(x2)
  • i.e. Linear function on groups
  • Example modular exponentiation
  • GA Integers 0...R?2n, with
  • GB Integers 0...R, with ?
  • fa(x) ax mod R
  • fa(x1x2) ax1x2 mod R (ax1 mod R) ? (ax2 mod
    R) f(x1) ? f(x2)

119
Homomorphisms and Subgroups
  • Fact homomorphisms map subgroups to subgroups
  • f G1?G2
  • ?x?H1, H1 subgroup of G1f(x) are all the members
    of some H2, subgroup of G2

G1
G2
Homomorphism
120
Self-Testing Technique
  • Given implementation P of function ff GA?GB
  • Linear TestFor i1 to n
  • Randomly pick x1, x2
  • Check if
  • If GB has no finite subgroups (besides 0B) then
    done

121
Self-Testing Technique
  • Given implementation P of function ff GA?GB
  • If GB has finite subgroups (besides
    0B)Neighbor TestFor i1 to n
  • For each generator gi of group GA
  • Randomly pick z
  • Check if
  • is f(z)s neighbor in target
    subgroup of GB

122
Proof
  • Method
  • Assume that linearity and neighbor tests passed
  • i.e. ? error for tests bounded by constant
  • Prove that P mostly equal to some linear g()
  • Steps
  • Define discrepancy function disc(x) g(x)
    P-1(x)
  • g() some linear function, operating on same
    groups as f()
  • P(x)g(x) for most x ? disc(x)0 for most x
  • disc(x) is homomorphism ? linearity test
    passes for x
  • Thus, if linearity test passed on many xs, disc
    is probably homomorphism

123
Proof
  • f() maps finite group to (possibly infinite)
    group
  • Mod
  • Integer Mult
  • Modular Mult
  • g() and disc() map same groups

124
Proof
  • Recall homomorphisms map subgroups to subgroups
  • Thus, homomorphism disc() can only map domain
    group to finite subgroup of range group
  • Question Does range group have finite subgroups?
  • (besides 0B)

125
Proof
  • Range group no finite subgroups besides 0B
  • Then disc must map domain group to 0B
  • disc(x) 0B
  • (On most xs, since disc probably homomorphism)

Finite Group
Infinite Group
0
Homomorphism
126
Proof
  • Range group has real finite subgroups
  • Each subgroup must have generators
  • Those generators are products of the groups
    generators
  • Neighbor test
  • For each generator gi
  • Ensures any such subgroup must be 0B

127
Proof
  • Thus, disc(x) 0B (with high probability)
  • Recall disc(x) g(x) P-1(x)
  • Thus, P(x) g(x) (with high probability)
  • g() still undefined linear function
  • Assumption P(x) f(x) on many inputs
  • Fact if two linear functions equal on gt1 point,
    they are same function
  • Thus, g f and P(x)f(x) (with high probability)

128
Summary of Self-Testers
  • Can verify if P(x)f(x) with high probability
  • Useful information to know
  • Required for self-correctors to work reliably
  • Simple implementation
  • Constant bound loop
  • A few additions, multiplications, etc.
  • Can be self-corrected themselves

129
Summary
  • Paper presents the basics of broader theory of
    self-testing/-correcting
  • Some additional related papers
  • S. Ravikumar and D. Sivakumar. "Efficient
    Self-Testing of Linear Recurrences".
  • Expands these testers to test linearity in bulk
    fashion for linear recurrences
  • Ronitt Rubinfeld and Madhu Sudan. "Robust
    Characterizations of Polynomials with
    Applications to Program Testing", SIAM Journal on
    Computing, 1996.
  • Funda Ergun, S. Ravi Kumar and Ronnitt Rubinfeld.
    "Checking Approximate Computations of Polynomials
    and Functional Equations", IEEE Conference on
    Foundations of Computer Science, 1996.
Write a Comment
User Comments (0)
About PowerShow.com