New Results in Reversible Logic. - PowerPoint PPT Presentation

About This Presentation
Title:

New Results in Reversible Logic.

Description:

In like fashion, with Q(n) and NOT gates, we can devise a circuit that ... As an illustration, two reversible 4-bit carry-look-ahead adders in 0.8 m c-MOS have ... – PowerPoint PPT presentation

Number of Views:1965
Avg rating:3.0/5.0
Slides: 120
Provided by: webCe
Learn more at: http://web.cecs.pdx.edu
Category:

less

Transcript and Presenter's Notes

Title: New Results in Reversible Logic.


1
New Results in Reversible Logic.
  • Marek Perkowski

LDL seminar, January 18, 2002
2
Plan
  • 1. recent results in Reversible Logic obtained by
    our group.
  • 2. review the work of DeVos, Storme and others.
  • 3. new reversible gates in CMOS.
  • 4. reversible fuzzy logic.
  • 5. methods to create reversible gates of higher
    number of inputs/outputs.
  • 6. new regular structures and synthesis methods
    for them.
  • 7. an approach to synthesize arbitrary
    multi-output functions with no garbage.
  • 8. the system of EDA tools for reversible/quantum
    logic.
  • 9. binary, data mining, multiple-valued and
    quantum benchmarks

3
Think about a gate as an input/output constraint
A B X Y 0 0 0 0 0 1 0 1 1 0
1 1 1 1 1 0
R(input, output) R(ltA,Bgt,ltX,Ygt)
R(ltA,Bgt,ltX,Ygt) Permute(2,3)
ltlt1,0gt , lt1,1gtgt YES
lt0,0gt gt lt0,0gt
lt1,0gt lt lt1,1gt
ltlt1,1gt , lt0,1gtgt NO
4
Reversible computation.
  • Charles Bennett, IBM, 1973.
  • Logical reversibility of computation,
  • IBM J. Res. Dev. 17, 525 (1973).

This principle applies also to combinational
circuits that we build, but is this a best way,
question for us to answer
5
Landauer's principle
  • Landauer's principle logic computations that are
    not reversible, necessarily generate heat
  • i.e. kTlog(2), for every bit of information that
    is lost.
  • where k is Boltzmann's constant and T
    the temperature.
  • For T equal room temperature, this package of
    heat is small, i.e. 2.9 x 10 -21 joule, but
    non-negligible.
  • In order to produce zero heat, a computer is
    only allowed to perform reversible computations.
  • Such a logically reversible computation can be
    undone' the value of the output suffices to
    recover what the value of the input has been'.
  • The hardware of a reversible computer cannot be
    constructed from the conventional gates
  • On the contrary, it consists exclusively of
    logically reversible building blocks.

6
  • Tomasso Toffoli, 1980 There exists a reversible
    gate which could play a role of a universal gate
    for reversible circuits.

Q(3) (x,y,z)gt(x,y,z_at_xy)
_at_ denotes EXOR
Fredkin and Toffoli created the first (3,3)
universal gate
A B
A A XOR B
reversible
Toffoli Gate
7
Quantum XOR
  • Of special interest is the controlled-NOT or
    reversible XOR gate
  • XOR (x y) gt (x, x _at_
    y)
  • by _at_ we denote EXOR (modulo-2 sum)

These notations were introduced by physicists and
they are inconsistent with standard electrical
engineering notations, however it will be
convenient for us to use both notations.
8
Swapping bits using XOR cascade
With the circuit
Constructed from three Quantum XORs, we can swap
two bits
(x,y) gt (x,x_at_y) gt (y,x_at_y) gt (y,x)
Conclusion in quantum logic you pay for crossing
wires!!!
cascade
  • Of importance in quantum, quantum dot, but not
    CMOS

9
Use of Toffoli Gate
  • From three-bit Toffoli-Gate Q(3)

The first step is to show that from the three-bit
Toffoli Gate Q (3) we can construct an n-bit
Toffoli Gate Q (n). The n-bit gate works as
follows
(x1,x2,,x n-1, xn)gt(x1,x2,,x n-1 y_at_x1 x 2x
n-1 )
The construction requires one extra bit of
scratch space. For example, we construct Q (4)
circuit from Q (3) circuits as follows
Q 4
Q 4 from three Q 3
Scratch space
10
Use of Toffoli Gate
  • The purpose of the last Q (3) gate is to reset
    the scratch bit back to its original value 0.
  • With one more gate we can obtain an
    implementation of Q (4) that works irrespective
    of the initial value of the scratch bit

We can eliminate the last gate if we dont mind
flipping value of the scratch bit
11
Use of Toffoli Gate
  • When we need to construct Q (k) , any available
    extra bit will do, since the circuit returns the
    scratch bit to its original value.
  • The next step is to note that, by conjugating
    Q(n) with NOT gates, we can in effect modify the
    value of the control string that triggers" the
    gate.
  • For example, the circuit

flips the value of y if x1 x2 x3 010, and it
acts trivially otherwise.
12
Use of Toffoli Gate
  • Flips the value of y if x1 x2 x3 010, and it
    acts trivially otherwise.
  • Thus this circuit transposes the two strings 0100
    and 0101.
  • In like fashion, with Q(n) and NOT gates, we can
    devise a circuit that transposes any two n-bit
    strings that differ in only one bit.
  • (The location of the bit where they differ is
    chosen to be the target of the Q (n) gate.)

13
P
Q
R
Q
R
P
0
1
0
1
A
0
1
A
C
B
B
C
A circuit from two multiplexers
Its schematics
This is a reversible gate, one of many
Notation for Fredkin Gates
14
Margolus Gate
15
Toffoli Gate
  • The 3 3 Toffoli gate is described by these
    equations
  • P A,
  • Q B,
  • R AB ? C,
  • Toffoli gate is an example of two-through gates,
    because two of its inputs are given to the
    output.

P
R
Q


A
B
C
16
P
Q
(b)
Feynman
Toffoli
P
R
Q


1
0



C
A
B
Kerntopf Gate
17
Kerntopf Gate
  • The Kerntopf gate is described by equations
  • P 1 _at_ A _at_ B _at_ C _at_ AB,
  • Q 1 _at_ AB _at_ B _at_ C _at_ BC,
  • R 1 _at_ A _at_ B _at_ AC.
  • When C1 then P A B, Q A B, R !B, so
    AND/OR gate is realized on outputs P and Q with
    C as the controlling input value.
  • When C 0 then P !A ! B, Q A !B, R A
    _at_ B.
  • 18 different cofactors!

18
Kerntopf Gate
  • As we see, the 33 Kerntopf gate is not a
    one-through nor a two-through gate.
  • Despite theoretical advantages of Kerntopf gate
    over classical Fredkin and Toffoli gates, so far
    there are no published results on optical or CMOS
    realizations of this gate.

19

How to build garbage-less circuits
D
Feynman
Fredkin
A
Toffoli
B
C

Feynman
inputs reconstructed
F1 from spy
F2 from spy
2 outputs no garbage width 4 delay 9
A,B,C,D are original inputs
This process is informationally reversible It can
be in addition thermodynamically reversible
20
Observations
  • We reduced garbage at the cost of delay and
    number of gates
  • We were not able to reduce the width of the
    scratchpad register

21
0 1 1 0 1
1 1 0 1 0
Conservative circuit the same number of ones in
inputs and outputs
X3 xor3
Variable
Davio
Majority
X2 xor2
Shannon
Examples of balanced functions half of Kmap are
ones
22
YZ
YZ
X
X
- - - -
0 1 1 1
garbage
0
1
i
fg
garbage
1
0
0
1
23
X
Y
Z
24
Lattice Structure for Multiple-valued and Binary
Logic
  • Realizes every binary symmetric function
  • Realizes every non-symmetric function by
    repeating variables
  • Realizes piece-wise linear multivalued functions

Patented by Pierzchala and Perkowski 1994/1999
25
Lattice Structure for Binary Logic
F S 1,3 (A,B,C)
A
0
1
B
C
0
1
1
0
S0
S1
S2
S3
26
YZ
YZ
X
X
- - - -
0 1 1 1
garbage
0
1
i
fg
garbage
1
0
0
1
27
(No Transcript)
28
Control gates as building blocksfor reversible
computers
  • A. De Vos 1 , B. Desoete 2 , F. Janiak 3 , and A.
    Nogawski 3
  • 1 Universiteit Gent and Imec v.z.w., B-9000 Gent,
    Belgium 2 Universiteit Gent, B-9000 Gent, Belgium
    3 Politechnika L odzka, PL-90-924 L odz, Poland

29
  • As an illustration, two reversible 4-bit
    carry-look-ahead adders in 0.8 ?m c-MOS have been
    built.

30
2 Simple control gates
  • 2.1 Definition
  • A gate with k inputs (A1, A2 ,, Ak) and k
    outputs (P1, P2,,... ,, Pk), satisfying Pi Ai
    for all i ? 1, 2,.., k
  • Pk f(A1,A2,, Ak-1) ? Ak
  • with f an arbitrary boolean function of k
    boolean arguments, is called a simple control
    gate.
  • The number k is called the width of the gate.
  • The logic inputs A1, A2, ..., Ak-1 are named the
    controlling bits, whereas the input Ak is the
    controlled bit.
  • Finally, the function f is called the control
    function.

31
2.2 Properties
  • Any simple control gate is reversible.
  • We cascade two identical simple control gates,
    yielding
  • Pk f(A1, A2,, A k-1) ? f(A1, A2,,
    Ak-1) ? Ak
  • and thus Pi Ai for all i, because of the
    two boolean identities X ? X 0 and 0 ? Y Y .
  • The result is thus the k-bit follower.
  • In other words any simple control gate is its
    own inverse, and thus is necessarily reversible.

32
  • Cascading two arbitrary simple control gates (one
    with control function f and one with control
    function f) results in a new simple control
    gate, with control function f ? f.
  • Therefore the simple control gates of width k
    together with the operation cascading form a
    group.
  • The group has
    elements.

33
  • and thus can be built into a square geometry,
    provided we use dual line electronics, i.e. any
    signal X is accompanied by its counterpart NOT X.
  • Fig. 1 shows Pk is connected to Ak if f 0 but
    is connected to Ak (short notation for NOT Ak)
    if f 1.

34
  • Because a boolean function f(A1, A2,.,A k-1) can
    always be written
  • either as an OR of ANDs' (often referred to as
    sum of minterms')
  • or as an AND of ORs' (often referred to as
    product of maxterms'),
  • we can implement Pk f(A1, A2,.,A k-1) ? A k
    in the square, using
  • either four parallel connections of series
    connections of switches
  • or four series connections of parallel
    connections of switches
  • or a combination of both.

35
  • Each switch is composed of one n-MOS transistor
    in parallel with one p-MOS transistor (forming
    together a transmission gate).
  • This leads to a reversible electronic
    implementation in dual-line pass-transistor
    logic so-called r-MOS technology 9.
  • Such logic is naturally suited for adiabatic
    addressing 10 11 12.
  • All energy supplied to the outputs Pk and Pk
    comes from the inputs Ak and Ak, i.e. not from
    separate power lines.
  • 9. A. De Vos Reversible computing. Progress in
    Quantum Electronics 23 (1999) 1-49
  • 10. B. Desoete and A. De Vos Optimal charging of
    capacitors. In A. Trullemans and J. Spars½
    (eds.) Proc. 8 th Int. Workshop Patmos, Lyngby
    (Oct. 1998) 335-344
  • 11. A. De Vos and B. Desoete Equipartition
    principles in finite-time thermodynamics. Journal
    of Non-Equilibrium Thermodynamics 25 (2000) 1-13
  • 12. A. De Vos, B. Desoete, A. Adamski, P.
    Pietrzak, M. Sibi nski, and T. Widerski Design
    of reversible logic circuits by means of control
    gates. In D. Soudris, P. Pirsch, and E. Barke
    (eds.) Proc. 10 th Int. Workshop Patmos,
    G.ottingen (Sept. 2000) 255-264

36
Fig. 1. Implementation of the function f ? Ak,
with the help of four switches
  • f ? Ak

37
Example of extending functions to reversible
functions
38
(No Transcript)
39
Fig. 2. Implementation of boolean Table 2 using
12 switches
40
Fig. 2. Implementation of boolean Table 2 using
28 switches
  • An alternative approach makes use of standard
    cells, where the particular function f(A, B, C),
    to be XORed with D, is hardwired by the vias
    between the Metal1 and Metal2 layers of the chip.
  • These vias are displayed as small black squares
    in Fig. 2b.
  • The programmable gate however needs 2 k1 - 4
    28 switches.

41
Fig. 3. Decomposition of a control gate into
simple control gates
  • Because each output is only one boolean function
    f away from the inputs, its logic depth is only
    1.
  • We call such gates control gates, as each output
    Pi is either equal to the controlled bit Ai or to
    its inverse Ai, depending on the value of its
    k-1 controlling inputs A1, A2, .., Ai-1.

3. Control gates
3.1 Definition When we cascade k simple control
gates (one of width k, one of width k-1,..., and
one of width 1), in the way of Fig. 3, we have a
new gate of width k.
42
  • We thus come to the definition of a control gate
  • a logic gate with k inputs (A1, A2,.,A k) and k
    outputs (P1, P2,.,P k), satisfying
  • Pi fi (A1, A2,.,A k) ? Ai for all i ?
    1,2,,k
  • with fi arbitrary boolean functions of (i-1)
    arguments, is called a control gate.
  • Note that a control gate with width k has (k-1)
    controlling bits (A1, A2,.,A k) as well as
    (k-1) controlled bits (A2,.,A k)
  • We remark that the above definition is somewhat
    more general than the preliminary definition
    presented at Patmos 2000 12.

43
3.2 Properties
  • As any control gate is composed of simple control
    gates and any simple control gate is reversible,
    the control gate is thus also reversible.
  • The inverse of a simple control gate is equal to
    itself.
  • This is not the case with an arbitrary control
    gate.
  • The inverse of the control gate of Fig. 3
    consists of first putting the simple control gate
    f1, then the simple control gate f2, etc.
  • Now the cascading of two simple control gates of
    different width is not commutative.
  • Thus an arbitrary control gate and its reverse
    are not necessarily equal, the simple building
    blocks appearing in opposite order.

44
  • Two control gates (one with control functions f
    and one with control functions f), when
    cascaded, form a new control gate,
  • with control functions fi (A1, A2,.,A i-1) f0
  • ? Ai fi(A1 A2 Ai-1) f0 i
  • (A1, A2,.,A i-1) ? f
  • (A1, A2,.,A i-1) ? f
  • (A1 A2 Ai -1) XOR f00 i ( f0 1 () XOR A1
    f0 2 (A1) XOR A2 f0-1 (A1 A2 Ai-2)
    XOR Ai 1 )

45
  • Thus, the control gates of width k, together with
    the cascading operation, form a group.
  • This group has 2(2 k-1) elements and is
    solvable, but not abelian.

46
4 Carry-look-ahead adder
  • To demonstrate the flexibility of using control
    gates, we present here, as an example, a 4-bit
    carry-look-ahead adder, as an alternative to the
    classical, i.e. ripple adder.
  • An n-bit ripple adder consists of 2n gates of
    type CONTROLLED NOT and 2n gates of the
    CONTROLLED CONTROLLED NOT type 12.
  • Its logic depth increases with increasing n.
  • In order to make the calculation less deep, and
    thus faster, we replace the ripple adder by a
    carry-look-ahead adder.
  • For the carry-look-ahead (or c.l.a.) 13, we
    first need to implement the calculation of the n
    generator bits Gi and the n propagator bits Pi
    from the n addend bits Ai and the n augend bits
    Bi
  • Gi Ai AND Bi
  • Pi Ai XOR Bi

47
4 Carry-look-ahead adder
  • Next we need to calculate the n carry-out bits Ci
    from the single carry-in bit C0, the n generator
    bits, and the n propagator bits. In its simplest
    form, the 4-bit carry-look-ahead adder implements
    the following equations
  • C1 G0 OR (C AND C0)
  • C2 G1 OR (P1 AND (G0 OR (P0 AND C0)))
  • C3 G2 OR (P2 AND (G1 OR (P1 AND (G0 OR (P0 AND
    C0)))))
  • C4 G3 OR (P2 AND (G2 OR (P2 AND (G1 OR (P1 AND
    (G0 OR (P0 AND C0)))))))
  • This can be performed by a control gate with 2n
    1 bits controlling n other bits (i.e. k 3n
    1).
  • The electronic implementation of this gate
    consists of n squares, counting 8n(n 2)
    transistors.
  • In the third and final step, the adder calculates
    the n sum bits
  • Si Pi XOR Ci

48
5 Results
  • Putting the three parts (calculation of (Gi Pi),
    of Ci1, and of Si) together, we see that the
    logic depth d of the resulting n-bit c.l.a. adder
    is 3, independent of n.
  • Note that we consider the NOT as a gate of zero
    depth.
  • Indeed, in dual line hardware, the NOT gate is
    merely an interchange of the two lines and thus
    costs neither silicon area, nor time delay, nor
    power dissipation.
  • Fig. 4 shows the 4-bit c.l.a. adder.
  • For sake of clarity, the 8 preset input lines and
    the 12 garbage output lines are not shown, nor
    are the inverters (i.e. the NOT gates).
  • Each logic gate has an equal number of logic
    inputs and logic outputs, a number we call the
    width w of the gate.
  • The full circuit has depth d 3, width w 17,
    and transistor count t 320. For an arbitrary n,
    we have d 3, w 4n 1 and t 8n(n 6).
  • For comparison the ripple adder has d n 1, w
    3n 1 and t 48n.
  • Thus for any number n gt 2, the c.l.a. adder is
    less deep (and thus faster) than its ripple
    counterpart.
  • At the other side, for any number n, the c.l.a.
    circuit is more complex than the ripple circuit,
    the hardware overhead becoming quite substantial
    for large n.

49
  • Fig. 5 shows the 4-bit implementation in the 0.8
    ? m c-MOS n-well technology CYE of Austria Mikro
    Systeme.
  • The n-MOS transistors have length L equal to 0.8
    ?m and width W equal to 2 ? m.
  • The p-MOS transistors have L 0.8 ? m and W 6
    ? m.
  • The threshold voltages are 0.85 volt (n-MOS) and
    0.75 volt (p-MOS).
  • -Vss ranging from 1 volt to 3 volts.
  • The whole chip (bonding pads included) measures
    1.9 mm 1.2 mm.
  • The chip has been tested successfully, with power
    supply voltage Vdd -Vss ranging from 1 volt to 3
    volts.

50
  • A c.l.a. chip, applying hardware-programmed
    control gates, is designed.
  • It contains as many as t 64/3 (4 n 3n - 1)
    5696 transistors and measures 2.5 mm 2.0 mm.
  • In the recent literature, other 4-bit c.l.a.
    adders 14 15, and even an 8-bit 16 c.l.a.
    adder with adiabatic/reversible gates have been
    presented.
  • Our design should not at all be considered as
    just one more such a circuit.
  • Our c.l.a. adders should be regarded as specific
    examples of the design philosophy we have
    developed reversible control gate logic.

51
Fig. 4. Schematic diagram of reversible
carry-look-ahead four-bit adder
52
Fig. 5. Microscope photograph of c-MOS reversible
carry-look-ahead four-bit adder
53
  • Fig. 6a shows the experimental transient output
    C4 for augend B 1101 and addend A changing
    quasi-adiabatically from 0010 to 0011 with
    charging time ? 50 ?s.
  • Fig. 6b shows the power dissipation estimated by
    Spectre simulations (including parasitics) for
    Vdd - Vss 2 V, as a function of ? .

54
References
  • 1. R. Landauer Irreversibility and heat
    generation in the computing process. I.B.M.
    Journal of Research and Development 5 (1961)
    183-191
  • 2. C. Bennett and R. Landauer The fundamental
    physical limits of computation. Sc. American 253
    (July 1985) 38-46
  • 3. R. Landauer Information is physical. Physics
    Today 44 (May 1991) 23-29
  • 4. T. Toffoli Reversible computing. In J. De
    Bakker and J. Van Leeuwen (eds.) 7 th Colloquium
    on Automata, Languages and Programming, Springer,
    Berlin (1980) 632644
  • 5. E. Fredkin and T. Toffoli Conservative logic.
    Int. Journal of Theoretical Physics 21 (1982)
    219-253
  • 6. L. Storme, A. De Vos, and G. Jacobs Group
    theoretical aspects of reversible logic gates.
    Journal of Universal Computer Science 5 (1999)
    307-321
  • 7. R. Feynman Quantum mechanical computers.
    Optics News 11 (1985) 11-20

55
References
  • 8. R. Feynman Feynman lectures on computation
    (A. Hey and R. Allen, eds.). Addison-Wesley,
    Reading (1996)
  • 9. A. De Vos Reversible computing. Progress in
    Quantum Electronics 23 (1999) 1-49
  • 10. B. Desoete and A. De Vos Optimal charging of
    capacitors. In A. Trullemans and J. Spars½
    (eds.) Proc. 8 th Int. Workshop Patmos, Lyngby
    (Oct. 1998) 335-344
  • 11. A. De Vos and B. Desoete Equipartition
    principles in finite-time thermodynamics. Journal
    of Non-Equilibrium Thermodynamics 25 (2000) 1-13
  • 12. A. De Vos, B. Desoete, A. Adamski, P.
    Pietrzak, M. Sibi nski, and T. Widerski Design
    of reversible logic circuits by means of control
    gates. In D. Soudris, P. Pirsch, and E. Barke
    (eds.) Proc. 10 th Int. Workshop Patmos,
    G.ottingen (Sept. 2000) 255-264

56
References
  • 13. H. Taub Digital circuits and
    microprocessors. Mc Graw Hill, Auckland (1982)
  • 14. S. Kim and M. Papaefthymiou True
    single-phase energy-recovering logic for
    low-power, high-speed VLSI. In Proc. 1998 Int.
    Symposium on Low Power Electronics Design,
    Monterey (August 1998) 167-172
  • 15. S. Kim and M. Papaefthymiou Pipelined DSP
    design with true single-phase energy-recovering
    logic style. In Proc. I.E.E.E. Alessandro Volta
    Memorial Workshop on Low Power Design, Como
    (March 1999) 135-143
  • 16. S. Kim and M. Papaefthymiou Low-energy adder
    design with a single-phase source-coupled
    adiabatic logic. In J. Spars½ and D. Soudris
    (eds.) Proc. 9 th Int. Workshop Patmos, Kos
    (Oct. 1999) 93-102

57
  • Example 1
  • In the vector space of two-variable Boolean
    functions, the Reed-Muller basis functions are 1,
    a, b, and ab, while the minterms are a b, ab,
    ab, and ab.
  • The transition from the minterm basis to the
    Reed- Muller basis is given by

a b ab b
a b ab a
58
(No Transcript)
59
  • As it can be observed, the rows of the transition
    matrices are linearly independent.
  • In general, in the space of Boolean functions,
    all nonsingular matrices of dimension 2 n provide
    the transition matrices for all possible bases.
  • These are the bases of Universal XOR forms (UXF).

60
  • Denition 3 Let be a vector space of n-variable
  • Boolean functions over GF(2). A Universal XOR
  • form (UXF) is a basis in this vector space.
  • If a basis function in a UXF can be realized as a
    product of
  • literals, it is called a monoterm.
  • In general, a term in a UXF is called a uxf-term
    of f.

61
(No Transcript)
62
  • By Theorem 1, there exist 20160/4! 840
    different XOR canonical forms for a 2-variable
    function alone.
  • This number for a 3-variable function is around
    1.326 10 14 .

63
Figure 1 Example of a Complex Maitra Logic Array
64
Figure 2 An Example of a Complex Plane of a CMLA
65
(No Transcript)
66
(No Transcript)
67
(No Transcript)
68
(No Transcript)
69
(No Transcript)
70
Group Properties
71
Buffer
  • Corresponds to shaded parts in (c)

Reversible Energy Recovery Logic
72
Reversible Pipeline Connection
  • A shaded arrow in (b) indicates the direction and
    path of energy charging or discharging
  • F,G, and H are forward functions of each logic
    stage
  • F-1,G-1 and H-1 are their backward functions,
    respectively
  • A solid line in the buffer symbol of c
    indicates the energy flow
  • Clocks connected to the isolation switches are
    not shown explicitly in the symbol

73
Symbol for a buffer chain
74
(No Transcript)
75
Symmetry Analysis by De Vos
  • Logic gates with three input bits and three
    output bits have a privileged position within
    fundamental computer science.
  • They are a sufficient building block for
    constructing arbitrary reversible boolean
    networks and therefore are the key to reversible
    digital computers.
  • Such computers can, in principle, operate without
    heat production.
  • There exist as many as 8! 40,320 different
    3-bit reversible truth tables
  • The question which ones to choose as building
    blocks.
  • Because these gates form a group with respect to
    the operation cascading', we can apply group
    theoretical tools, in order to make such a
    choice.
  • We will study permutations

76
4 Calculation with three bits
  • There exist 88 16,777,216 different truth
    tables with 3 inputs and 3 outputs.
  • Among them, only 8! 40,320 are reversible.
  • However, 48 of these truth tables fall apart into
    three separate 1-bit reversible tables and
    another 288 fall apart into one 1-bit and one
    (true) 2-bit reversible gate.
  • Thus, among the 40,320 reversible 3-bit gates,
    only 39,984 are true 3-bit gates.

77
4 Calculation with three bits
  • Two well-known examples are
  • the Fredkin gate
  • the Feynman's CONTROLLED CONTROLLED NOT gate
  • The truth table of the Feynman gate is given in
    Table 3c.
  • TheFredkin gate is shown in Table 5a.

Feynman Gate
78
4 Calculation with three bits
  • Both have a particular property each is a
    universal primitive.
  • This means that any boolean function of any
    finite number of logic input variables can be
    implemented by combining a finite number of such
    building blocks.
  • The proof consists of two steps
  • one first proves that the building block suffices
    to implement the NAND function (Table 1b),
  • then one refers to the fact that the NAND
    function is a universal primitive.

79
4 Calculation with three bits
  • The latter step is a well-known theorem.
  • The former step is demonstrated by introducing a
    so-called preset we keep one or two inputs fixed
    and look how the three outputs are function of
    the remaining input(s).
  • Among the 39,984 reversible true 3-bit gates,
    many have the universality property.
  • It is clear, however, that the number 39,984 is
    too large to allow manual' inspection.
  • We have to recur to computer-algebra software
    specially dedicated to group theory, such as GAP
    and Magma
  • In the present study, we have chosen the GAP
    approach, because of GAP's built-in commands
    DoubleCoset and DoubleCosets.

80
Table 4 The three generators of the 2-bit
reversible gates (a) EXCHANGER, (b) NOT, (c)
CONTROLLED NOT.
81
Table 5 Truth tables (a) Fredkin's conservative
gate, (b) a pseudo-inverting' gate.
82
5 Groups and subgroups
  • Because of the universality property of some of
    the 3-bit reversible gates, we now continue the w
    3 case in more detail.
  • When a reversible 3-bit gate x is cascaded by a
    reversible 3-bit gate y (i.e. when the P output
    of gate x is connected to the A input of y,
    etc.), then a new reversible 3-bit gate is
    formed, denoted xy.
  • The 40,320 reversible truth tables of width 3
    therefore form a group, say R, which is
    isomorphic to the symmetric group S8 .

83
5 Groups and subgroups
  • The identity element of the group is the 3-bit
    follower (P A, Q B, and R C). In GAP, each
    element of R is denoted by its permutation
    notation.
  • E.g. the follower is denoted (), whereas the
    CONTROLLED CONTROLLED NOT is written (7,8),
    because the seventh and the eighth line of the
    truth table (i.e. Table 3c) are interchanged.

84
Groups E, F and G
  • In order to classify the large number of elements
    of the group R, we will introduce in the
    following paragraphs three different important
    subgroups, namely E with 6 elements, F with 48
    elements, and finally G with 1,344 elements.
  • They are ordered as
  • E lt F lt G lt R
  • where lt denotes is proper subgroup of'.
  • By means of each of these three subgroups, we
    will partition R into double cosets, which will
    serve as equivalence classes in the application
    of Section 6.

85
Table 6 The subgroup generators (a) EXCHANGER,
(b) EXCHANGER, (c) NOT, (d) CONTROLLED NOT. From
top to bottom, four different representations
schematic, set of logic equations, truth table,
and permutation. Gates (a) and (b) together
generate subgroup E Gates (a), (b), and (c)
together generate subgroup F Gates (a), (b),
(c), and (d) together generate subgroup G.
86
e1 and e2
  • An important subgroup of R is formed by the
    follower together with the five elements
    representing mere relabellings.
  • Table 6a shows the example e1, satisfying P A,
    Q C, and R B, i.e. performing an exchange of
    B and C. In permutation notation, gate e1 is
    written (2,3)(6,7).
  • A second example is e2 or (3,5)(4,6). See Table
    6b.

Table 6 The subgroup generators (a) EXCHANGER,
(b) EXCHANGER, (c) NOT, (d) CONTROLLED NOT. From
top to bottom, four different representations
schematic, set of logic equations, truth table,
and permutation. Gates (a) and (b) together
generate subgroup E Gates (a), (b), and (c)
together generate subgroup F Gates (a), (b),
(c), and (d) together generate subgroup G.
87
  • An important subgroup of R is formed by the
    follower together with the five elements
    representing mere relabellings.
  • Table 6a shows the example e1, satisfying P A,
    Q C, and R B, i.e. performing an exchange of
    B and C. In permutation notation, gate e1 is
    written (2,3)(6,7).
  • A second example is e2 or (3,5)(4,6).
  • See Table 6b.

88
Subgroup E of exchangers
  • Together these two elements generate the whole
    subgroup of exchangers.
  • The importance of this subgroup E comes from the
    fact that these gates are trivial to implement in
    any technology.
  • E.g. in electronics, they consist merely of
    cross-overs of metal lines.
  • The subgroup E of exchange gates contains six
    elements.
  • It is denoted G6 (SW) by Rayner and Newman and
    is isomorphic to the symmetric group S3 .

89
Subgroup E of Exchangers
  • Results from GAP show us that E partitions the
    full group R into 1,172 distinct double cosets.
  • A double coset of an element g consists of all
    elements e ge , where both e and e are
    elements of the subgroup E.
  • This means that, although there exist 40,320
    different reversible gates, there are only 1,172
    really different' ones, as soon as we consider
    exchangers as for free'.
  • From these 1,172 gates, all other 39,148 gates
    can be fabricated by merely adding one
    relabelling gate to the left and one to the
    right.

90
Enlarging subgroup E
  • In a next step, we can enlarge the above subgroup
    E, by introducing the inverter or NOT gate.
  • One can either invert A (i.e. realize the gate P
    NOT A, Q B, and R C), or invert B (i.e.
    realize P A, Q NOT B , and R C), or invert
    C (P A, Q B, and R NOT C ).
  • As an example, the cycle notation of the last
    gate is (1,2)(3,4)(5,6)(7,8).
  • These three inverters (denoted i1, i2 , and i3 )
    generate a subgroup of order 23 8, isomorphic to
    Z23, where Z2 is the cyclic group of order 2.

91
EIF
  • Together, the subgroup E of exchangers and the
    subgroup I of inverters generate a new subgroup F
    of order 48, isomorphic to S3Z23, the
    semi-direct product of S3 and Z23.
  • The elements of F are exactly the 48 gates
    mentioned in Section 4, i.e. the 3-bit gates that
    fall apart into three distinct 1-bit gates.

92
52 distinct double cosets
  • Using GAP, we find that the subgroup F partitions
    the full group R into 52 distinct double cosets.
  • This means that, although there exist 40,320
    different reversible gates, there are only 52
    really different' ones, if we consider both
    exchangers and inverters as for free'.
  • From these 52 gates, corresponding to
    representatives of the 52 distinct double cosets
    of F, all other 40,268 gates can be fabricated by
    merely adding one free gate to the left and one
    to the right.

93
The list of all double cosets
  • Table 7 gives a list of all 52 double cosets ki .
    Note that GAP gives them in a specific order,
    which has no a priori meaning for the user. We
    also get a representative ri of class ki.

94
Table 7 The double cosets ki of F in R.
95
  • Again GAP's way of choosing this representative
    is not transparent to the user.
  • The different double cosets constructed with F
    sometimes have different size.
  • Table 7 gives ni , i.e. the number of elements in
    the double coset.
  • At first sight, it may be a surprise that a
    double coset may contain less than 482 2,304
    members.
  • This is caused by the fact that different
    products fgf (with g a member of R and both f
    and f members of F) can lead to equal results.

96
  • It is possible to prove that each double coset
    contains a number of elements which is a multiple
    of 48.
  • Double coset k1 is the subgroup F itself, with
    the follower () as representative r1 .
  • We remark that Feynman's gate (7,8) is the
    representative r4 of class k4 .
  • If we take the elements of subgroup F and add the
    representative ri of ki

97
Table 7 The double cosets ki of F in R.
98
  • If we take the elements of subgroup F and add the
    representative ri of ki , then these 49 elements
    together generate a subgroup.
  • Such a subgroup is called the closure of F and
    ri . Its order we denote by mi in Table 7.
  • From GAP, we learn that
  • Sometimes mi is as large as 40,320, meaning that
    the closure of F and ri is the full group R. In
    other words, any element of R can then be written
    as a finite product of form fri fri f ri
    f..., i.e. a finite cascade of ri gates
    separated by merely exchangers and/or inverters.
    In this case, we call ri universal.
  • Sometimes mi is as small as ni 48, meaning that
    ki together with k1 forms a subgroup. Any product
    of the form fri fri fri f ... then
    generates either an element of k1 or an element
    of ki. The only double cosets with this property
    are k3 and k31 .

99
lattice of all subgroups containing F and
contained in R.
  • In order to get more insight into the 52 double
    cosets in which R is divided, we construct the
    lattice of all subgroups containing F and
    contained in R.
  • This yields a set of partially ordered subgroups
    Figure 2.

Figure 2 The lattice.
100
Figure 2 The lattice.
101
We note ten different subgroups
  • k1 F
  • k1 k3 of order 192 4 x 48
  • k1 k31 of order 192 4 x 48
  • k1 k2 k3 of order 384 8 x 48
  • k1 k31 k40 of order 576 12 x 48
  • k1 k8 k31 k40 k49 of order 1,152 24 x 48
  • k1 k3 k36 k38 of order 1,344 28 x 48
  • k1 k3 k19 k21 k31 k33 k34 of order 1,344 28 x
    48
  • k1 k3 k5 k7 k9 k11 k13 k15 k18 k19 k21 k23 k25
    k28 k31 k33 k34 k36 k38 k40 k41 k43 k45 k46 k48
    k50, forming the subgroup of all even
    permutations and thus isomorphic to the
    alternating group A8 of order 20,160
  • the whole group R of order 40,320 itself.

102
  • Some of these subgroups have a particular
    interpretation.
  • E.g. the subgroup k1 k8 k31 k40 k49 is the
    closure of subgroup F and the subgroup of the 36
    conservative gates.
  • A conservative gate is a gate where the output
    (P, Q, R) always has the same number of 1's as
    the input (A, B, C).
  • Fredkin's gate (2,3) is an example.

103
  • The subgroup k1 k2 k3 is the closure of F and the
    subgroup of the 16 pseudo-inverting gates.
  • We call a pseudo-inverting' gate a gate where
    the output (P, Q, R) always is equal to either
    the input (A, B, C) or to (NOT A, NOT B, NOT C).
  • Its permutation notation consists merely of
    transpositions (i,9-i).
  • Table 5b shows an example. The meaning of the
    subgroup k1 k3 k19 k21 k31 k33 k34 will become
    clear below.

104
  • In a final step, we can enlarge subgroup F, by
    adding a Feynman CONTROLLED NOT.
  • See Table 6d.
  • The resulting G is a subgroup of R and a
    supergroup of F.
  • It is isomorphic to 23L3 (2), the semi-direct
    product of 23, i.e. the additive group of all
    binary vectors of length 3, and L3(2), i.e. the
    multiplicative group of all non-singular binary 3
    x 3 matrices.

105
  • In detail, G is isomorphic to the multiplicative
    group of all non-singular binary 4 x 4 matrices
    of the form
  • with det(A) 0 and with a1, a2, a3 0,1.
  • It is of order 1,344 and divides the full group R
    into four double cosets see Table 8.
  • Double coset K1 is the subgroup G itself,
    represented by representative R1 ().

106
Table 8 The double cosets Ki of G in R.
107
  • It is identical to the subgroup k1 k3 k19 k21 k31
    k33 k34 encountered above.
  • It contains
  • all 48 reversible 3-bit gates that fall apart
    into three distinct 1-bit gates,
  • all 288 reversible 3-bit gates that fall apart
    into one 1-bit gate and one true 2-bit gate,
  • another 1,008 gates, which are true 3-bit gates,
    but can be constructed by cascading two (or more)
    non-true 3-bit gates.

108
6 Application
  • Which of R's three subgroups (E, F, or G) is of
    importance, depends on the technological
    circumstances.
  • In almost each technology the exchangers are
    for free'.
  • E.g. in electronics, they are implemented by a
    mere metal cross-over.
  • In the special case of dual-line electronics,
    each logic variable is represented by two metal
    lines of opposite logic value, e.g. A together
    with NOT A.

109
6 Application
  • Therefore, in dual-rail electronics, also an
    inverter is free of charge' we only need a
    metal cross-over to exchange A with NOT A.
  • In this technology, the CONTROLLED NOT is not
    for free. Thus we are in the case of the free
    subgroup F.

110
6 Application
  • The following question arises 13.
  • We like to be able to synthetize any arbitrary
    member of R with the help of a limited number of
    generators.
  • In electronics, we say we like to implement any
    arbitrary reversible 3-bit gate with the help of
    a library with a limited number of standard
    cells.
  • If we denote by s1, s2, ..., sm the different
    members of the library, then an arbitrary member
    r of R must satisfy
  • r f sfsf...s (n) f (n1), (1)
  • where s, s, ..., s (n) are elements of the
    library s1, s2 , ..., sm and f, f, f ,
    ..., f(n1) are elements of F f1, f2, ...,
    f48
  • The number n is called the logic depth' of the
    implementation. In order to minimize the number
    of standard cells, we have to choose them from
    different double cosets and not from double coset
    k1 .
  • Thus the library will be a subset of the
    representatives r2, r3, ..., r52.
  • From Table 7, it follows that a library with a
    single building block is sufficient each of the
    representatives r4, r6, r10, r12, r14, r16, r17,
    r20, r22, r24, r26, r27, r29, r30, r 32, r35,
    r37, r39, r42, r44, r47, r51 and r52 have mi
    40,320 and are thus sufficient to generate the
    whole group R.

111
6 Application
  • But, these 23 solutions are not equivalent.
    Indeed, we not only want to limit the number p of
    different building blocks in the library, we also
    like to limit the number of times we have to use
    the blocks, i.e. we like to minimize the depth n
    of the products (1).
  • It turns out that with building block r14 all
    elements of R can be synthetized with n 4.
  • The elements of k1 need no r14 block (i.e. n
    0) the elements of k14 need only one r14 block
    (i.e. n 1) most elements of R need n 2 or n
    3 only the elements of k34 need four cascaded
    r14 blocks (n 4) on the average an arbitrary
    class of R needs 32/13 2.46 building blocks in
    cascade. Exactly the same results apply to the
    building block r44 and to r47 .
  • Table 9 compares these three optimal choices to
    the other twenty, i.e. less efficient, choices.
  • In particular, we see in the 18 th line that
    Feynman's gate r4 needs 0 n 6 (with expectation
    value 97/26 3.73) in order to generate all R.

112
  • None of the double cosets k14, k44 , and k47
    appears in the lattice of Figure 2, except at the
    top, in the parent group R itself.
  • Indeed, any element of any subgroup of Figure 2
    can only generate other elements of that
    particular subgroup.

113
  • The underlying reason is clear such elements
    show too much symmetry'.
  • E.g. conservative gates (such as Fredkin's gate)
    can, by cascading, only generate elements of k1
    k8 k31 k40 k49 , such that no finite depth n can
    generate the other elements of R.
  • Finally, it is remarkable that the optimum double
    cosets k14, k44, and k47 are among the largest
    double cosets of Table 7 n14 n44 n47
    2,304.

114
  • If we consider depth n 4 as too deep a cascade
    (too much silicon surface area), we can construct
    a larger library.
  • If we choose an p 2 library, there are four
    equivalent optimal combinations
  • r14 together with r18,
  • r14 together with r41,
  • r44 together with r48 , and
  • r44 together with r50 .
  • Now we have n 3, with expectation value 101/52
    1.94.
  • Enlarging the library to p 3 yields n 2 and
    average cascade depth 99/52 1.90.

115
Table 9. Cascade Depth n
116
7 Conclusion
  • The reversible gates of width w form a group,
    isomorphic to the symmetric group S2w .
  • Group theory in general, and double cosets in
    particular, are well suited to detect different
    classes within the (2w )! elements of the group.
  • This can lead to an optimized choice of a set of
    generators.
  • In electronics, this means an optimal set of
    hardware building blocks.
  • With the help of GAP, we identified optimal gates
    g that are able to generate all other elements of
    R by means of a product of the form
  • f gfgf ...
  • of minimal length.

117
References
  • 1 Bennett, C. "Logical reversibility of
    computation" I.B.M. Journal of Research and
    Development 17 (1973), 525-532.
  • 2 Bennett, C., Landauer, R. "The fundamental
    physical limits of computation" Scientific
    American 253 (July 1985), 38-46.
  • 3 Bosma, W., Cannon, J., Playoust, C. "The
    Magma Algebra System I the user language"
    Journal of Symbolic Computation 3-4 (1997),
    235-265.
  • 4 Conway, J.H., Curtis, R.T., Norton, S.P.,
    Parker, R.A., Wilson, R.A. "Atlas of finite
    groups" Oxford University Press, New York
    (1985), p. 22.
  • 5 De Vos, A. "Introduction to r-MOS systems"
    Proc. 4 th Workshop on Physics and Computation,
    Boston (1996), 92-96.
  • 6 De Vos, A. "Towards reversible digital
    computers" Proc. European Conference on Circuit
    Theory and Design, Budapest (1997), 923-931.
  • 7 De Vos, A. "Reversible computing" Progress
    in Quantum Electronics 23 (1999), 1-49.
  • 8 Feynman, R. "Quantum mechanical computers"
    Optics News 11 (1985), 11-20.
  • 9 Feynman, R. "Feynman lectures on
    computation" (A. Hey and R. Allen, eds)
    Addison-Wesley, Reading (1996).

118
References
  • 10 Fredkin, E., Toffoli, T. "Conservative
    logic" International Journal of Theoretical
    Physics 21 (1982), 219-253.
  • 11 http//www.can.nl/SystemsOverview/Special/Gro
    upTheory/GAP/index.html
  • 12 http//www.maths.usyd.edu.au8000/u/magma/ind
    ex.html
  • 13 Jacobs, G. "Algebra der reversibele
    logische schakelingen" M.Sc. thesis,
    Universiteit Gent, Gent (1998).
  • 14 Keyes, R., Landauer, R. "Minimal energy
    dissipation in logic" I.B.M. Journal of Research
    and Development 14 (1970), 153-157.
  • 15 Landauer, R. "Irreversibility and heat
    generation in the computational process" I.B.M.
    Journal of Research and Development 5 (1961),
    183-191.
  • 16 Rayner, M., Newman, D. "On the symmetry of
    logic" Journal of Physics A Mathematical and
    General 28 (1995), 5623-5631.
  • 17 Schönert, M. "GAP" Computer Algebra
    Nederland Nieuwsbrief 9 (1992), 19-28.
  • 18 Stix, G. "Riding the back of electrons"
    Scientific American 279 (September 1998), 20-21.
  • 19 Toffoli, T. "Reversible computing" in
    "Automata, languages and programming" (J. De
    Bakker and J. Van Leeuwen, eds) Springer, New
    York (1980), pp. 632-

119
Reversible gates for universal computation.
  • Fredkin Gate

AACB BB_at_AC CC
Write a Comment
User Comments (0)
About PowerShow.com