Title: Using Implications for Online Error Detection
1Using Implications for Online Error Detection
- Nuno Alves, Jennifer Dworak, R. Iris Bahar
- Division of Engineering
- Brown University
- Providence, RI 02912
Kundan Nepal Electrical Engineering
Dept. Bucknell University Lewisburg, PA 17837
NATW 2008
2Online error detection
- Purpose Detect transient faults that may occur
in a circuit during operation - Critical as circuits scale to smaller sizes
- Easy in memory logic
- In circuit logic not so easy
3Common online detection techniques
- Stored pre-computed test vectors in hardware
- Duplicating the computation of disjoint hardware
elements and voting on the result - Use of check bits
4Our approach
- Find invariant relationships in a circuit
- Violations of these expected relationships can
identify errors
5Error detection implementation
6Invariant relationships in circuits
These relationships are logic implications
7Error detection with implications
ERROR
n51 n81 will generate an error in checker
logic
8How we find implications
9We have implications. Now what?
10Why should we remove implications?
- With all implications we can generate checker
logic for each implication. - Inefficient!
- A circuit can contain thousands of implications
- generating separate checker logic for each
implication could more than double circuit size. - We want to detect only the most important
implications.
11Removing redundant implications
n1
n2
n9
n12
n8
n13
n3
n10
n4
n5
n11
n6
n7
12Removing low coverage implications
- We only want implications that
- Detect many faults
- Identify hard-to-detect faults
- Cover faults not detected by other implications
- Finding these important implications requires
- fault analysis to determine the specific fault
coverage for each implication
13Reducing the number of implications
14Covering faults with implications
- For each random input vector, and at each fault,
the implications-based circuit operation can fall
into the following 4 categories
Case 1 Case 2 Case 3 Case 4
Error Propagates To Output ? ? ? ?
An Implication is Violated ? ? ? ?
15Average distribution of the 4 scenarios
16How often do we detect errors?
17Implications with fixed HW budgets
- Given a fixed HW budget, by how much can we
reduce the probability of an undetected error?
18Conclusions
- Practical online error detection alternative
based on implication validation - No modification of targeted logic
- Checker logic is added off the critical path and
run in parallel rest of circuit. - For several circuits, we can detect almost 90 of
all errors that propagate to a primary output. - With only a 10 area overhead, probability of an
error being both observable and undetected is
reduced to 11 on average