Principles of Adiabatic Processes

About This Presentation

Title:

Principles of Adiabatic Processes

Description:

Scattering of ballistic electrons from lattice imperfections, causing Ohmic resistance ... diodes in the charge-return path. Forgetting to obey one of the ... – PowerPoint PPT presentation

Number of Views:1096

Avg rating:3.0/5.0

Slides: 128

Provided by: Jam123

Learn more at: https://www.cise.ufl.edu

Category:

more less

Transcript and Presenter's Notes

Title: Principles of Adiabatic Processes

1
Principles of Adiabatic Processes
2
Adiabatic Processes - overview

Adiabatic steps in the reversible Carnot cycle
Evolution of the meaning of adiabatic
Time-proportional reversibility (TPR) of
quasi-adiabatic processes
Adiabatic theorem of quantum mechanics
Adiabatic transitions of a two-state system
Logic memory in irreversible and adiabatic
processes.

3
The Carnot Cycle

In 1822-24, Sadi Carnot analyzed the efficiency
of an ideal heat engine all of whose steps were
reversible, and furthermore proved that
Any reversible engine (regardless of details)
would have the same efficiency (TH?TL)/TH.
No engine could have greater efficiency than a
reversible engine w/o producing work from nothing
Temperature itself could be defined on a
thermodynamic scale based on heat recoverable by
a reversible engine operating between TH and TL

4
Steps of Carnot Cycle
P

Isothermal expansion at TH
Adiabatic (without flow ofheat) expansion TH?TL
Isothermal compression at TL
Adiabatic compression TL?TH

TH
TL
V
Adia-batic
Iso- thermal
Iso- thermal
Adia-batic
Reser-voir
Reser-voir
Reser-voir
Reser-voir
5
Carnot Cycle Terminology

Adiabatic (Latin) literally Without flow of
heat
I.e., no entropy enters or leaves the system
Isothermal At the same temperature
Temperature of system remains constant as entropy
enters or leaves.
Both kinds of steps, in the case of the Carnot
cycle, are examples of isentropic processes
at the same entropy
I.e., no (known) information is transformed into
entropy in either process
But, the usage of the word adiabatic in applied
physics has mutated to essentially mean
isentropic.

6
Old and New Adiabatic

Consider a closed system where you just lose
track of its detailed evolution
Its adiabatic (no net heat flow),
But its not adiabatic (not isentropic)
Consider a box containing some heat,flying
ballistically out of the system
Its not adiabatic, (no heat flow)
because heat is flowing out of the system
But its adiabatic (no entropy is generated)

The System
Box o Heat
7
Justifying the Modern Usage

In an adiabatic process following a desired
trajectory through configuration space,
No heat flows in or out of the subsystem
consisting of those particular degrees of freedom
whose variation carries out the motion along the
desired trajectory.
E.g., the computational degrees of freedom in a
computational process.
No heat flow ? no entropy flow
Heat is just energy whose configuration info. is
entropy
No entropy flow ? no sustained entropy generation
Since bounded systems have a maximum entropy

8
Quasi-Adiabatic Processes

Complete adiabaticity means absolutely zero rate
of entropy generation
Requires infinite degree of isolation of system
from uncontrolled external environment!
? Impossible to completely achieve in practice.
Real processes are only adiabatic to the extent
that their entropy generation approaches zero.
Term quasi-adiabatic emphasizes imperfection
Asymptotically adiabatic designs conceptually
approach 0 in the limit of variation of specified
technology design parameter(s)
E.g., low device frequency, large device size

9
Quasi-Adiabatic Processes

No real process is completely adiabatic
Because some outside system may always have
enough energy to interact with disturb your
systems evolution - e.g., cosmic ray, asteroid
? Evolution of system state is never perfectly
known
Unless you know the exact quantum state of the
whole universe
Entropy of your system always increases.
Unless it is already at a maximum (at
equilibrium)
Cant really be at complete equilibrium with its
surroundings
unless whole universe is at utterly stable heat
death state.
Systems at equilibrium are sometimes called
static.
Non-equilibrium, quasi-adiabatic processes are
sometimes also called quasi-static
Changing, but near a local equilibrium otherwise

10
Quantifying Adiabaticity

An appropriate metric for quantifying the degree
of adiabaticity of any process is just to use the
quality factor Q of that process.
Q isnt just for oscillatory processes any more
Q is generally the ratio Etrans / Ediss between
the
Energy Etrans involved in carrying out a process
transitioning between states along a trajectory
Amount Ediss of energy dissipated during the
process.
Normally also matches the following ratios
Physical information content / entropy generated
Quantum computation rate / decoherence rate
Decoherence time / quantum-transition time

11
Degree of Reversibility

The degree of reversibility (a.k.a.
reversibility, a.k.a. thermodynamic efficiency)
of any quasi-adiabatic process is defined as the
ratio of
the total free energy at the start of the process
? by the total energy spent in the process
Or, equivalently
the known, accessible information at the start
? by the amount that is converted to entropy
This same quantity is referred to as the
(per-cycle) quality factor Q for any resonant
element (e.g., LC oscillator) in EE.

12
The Adiabatic Principle

Claim Any ideal quasi-adiabatic process
performed over time t has a thermodynamic
efficiency that is proportional to t,
in the limit as t?0.
We call processes that realize this idealization
time-proportionally reversible (TPR) processes.
Note that the total energy spent (?Espent), and
the total entropy generated (?S), are both
inversely proportional to t in any TPR process.
The slower the process, the more energy-efficient.

13
Proving the Adiabatic Principle

(See RevComp memo M14)
Assume free energy is in generalized kinetic
energy of motion Ek of system through its
configuration space. Ek ½mv2 ? v2 (?/t)2 ?
t?2 for m, ? const.
Assume that every tf time, on average (mean free
time), a constant fraction f of Ek is thermalized
(turned into heat)
Whole process thermalizes energy f(t/tf)Ek ?
t?t?2 t?1. Constant in front is ½ fm?2/tf ?
??2, where ?½fm/tf is the effective viscosity.

14
Example Electrical Resistance

We know PspentI2R(Q/t)2R, or Espent Pt
Q2R/t. ? Note scaling with 1/t
Charge transfer through a resistor obeys the
adiabatic principle!
Why is this so, microscopically?
In most situations, conduction electrons have a
large Fermi velocity or thermal velocity relative
to drift velocity.
? Scatter off of lattice-atom cross-sections with
a mean free time tf that is fairly independent of
drift velocity
Each scattering event thermalizes the electrons
drift kinetic energy - a frac. f of currents
total Ek
Therefore assumptions in prev. proof apply!

15
The Adiabatic Theorem

A result in basic quantum theory
Proved in many quantum mechanics textbooks
Paraphrased A system initially in its ground
state (or more generally, its nth energy
eigenstate) will, after subjecting it to a
sufficiently slow change of applied forces,
remain in the corresponding state, with high
probability.
Result has been recently shown to be very
general.
Amount of leakage out of desired state is
proportional to speed of transition, at low
speeds.
?Quantum systems obey the adiabatic principle!

16
Some Loss-Inducing Interactions

For ordinary voltage-coded electronics
Interactions whose dissipation scales with speed
Parasitic EM emission from dynamic (C,L)
reactances
Scattering of ballistic electrons from lattice
imperfections, causing Ohmic resistance
Interactions having different scaling laws
Interference from outside EM sources
Thermally-activated leakage of electrons over
potential energy barriers
Quantum tunneling of electrons through narrow
barriers (sub-Fermi wavelength)
Losses due to intentional treatment of known
physical information as entropy (bit erasure)

17
Some Ways to Reduce Losses

EM interference / emission Add shielding, use
high-Q MEMS/NEMS oscillators
Scattering/resistance Ballistic FETs,
superconductors
Thermal leakage avoid low VT and/or high temps
Tunneling thick tunnel barriers, high-?
dielectrics, conductors w. low Fermi-level/high
electron affinity, vacuum-gap barriers?
Intentional bit erasure reduce voltages, use
mostly-reversible adiabatic logic designs

18
Adiabatic Circuits Reversible Computing

Myths, Controversies Misconceptions

19
Some Claims Against Reversible Computing Eventual Resolution of Claim
John von Neumann, 1949 Offhandedly remarks during a lecture that computing requires kT ln 2 dissipation per elementary act of decision (bit-operation). No proof provided. Twelve years later, Rolf Landauer of IBM tries valiantly to prove it, but succeeds only for logically irreversible operations.
Rolf Landauer, 1961 Proposes that the logically irreversible operations that can be seen to necessarily cause dissipation are irreducible. Landauers argument for irreducibility of logically irreversible operations was conclusively refuted by Bennetts 1973 paper (partially presaged by Lecerf).
Bennetts 1973 construction is criticized for using too much memory. Bennett devises a more space-efficient version of the algorithm in 1989.
Bennetts models criticized by various parties for depending on random Brownian motion, and not making steady forward progress. Fredkin and Toffoli at MIT, 1980, provide ballistic billiard ball model of reversible computing that makes steady progress.
Various parties including Zurek note that Fredkins original classical-mechanical billiard-ball model is chaotically unstable. Zurek, 1984, shows that quantum models can avoid the chaotic instabilities. (Though there are workable classical ways to fix the problem also.)
Various parties propose that classical reversible logic principles wont work at the nanoscale, for unspecified or vaguely-stated reasons. Drexler, 1980s, designs various mechanical nanoscale reversible logics and carefully analyzes their energy dissipation.
Carver Mead, CalTech, 1980 Attempts to show that the kT bound is unavoidable in electronic devices, via a collection of counter-examples. No general proof provided. Later he asked Feynman about the issue in 1985 Feynman provided a quantum-mechanical model of reversible computing.
Various parties point out that Feynmans model of reversible computing only supports serial computation. Margolus at MIT, 1990, demonstrates a parallel quantum model of reversible computingbut only with 1 dimension of parallelism.
People question whether the various theoretical models can be validated with a working electronic implementation. Seitz and colleagues at CalTech, 1985, demonstrate working energy recovery circuits using adiabatic switching principles.
Seitz, 1985Has some working circuits, but unsure if arbitrary logic is possible. Koller Athas, Hall, and Merkle (1992) separately devise general reversible combinational logics.
Koller Athas, 1992 Conjecture reversible sequential feedback logic impossible. Younis Knight _at_MIT do reversible sequential, pipelineable circuits in 1993-94.
Some computer architects (including anonymous ISCA reviewers) wonder whether the constraint of reversible logic leads to unreasonable design convolutions. Vieri, Frank and coworkers at MIT, 1995-99, refute these qualms by demonstrating straightforward designs for fully-reversible and scalable gate arrays, microprocessors, and instruction sets.
Some computer science theorists suggest that the algorithmic overheads of reversible computing might outweigh their practical benefits. Frank, 1997-2003, publishes a variety of rigorous theoretical analysis refuting these claims for the most general classes of applications.
Various parties point out that high-quality power supplies for adiabatic circuits seem difficult to build electronically. Frank, 2000, suggests microscale/nanoscale electromechanical resonators for high-quality energy recovery with desired waveform shape and frequency.
Frank, 2002Briefly wonders if synchronization of parallel reversible computation in 3 dimensions (not covered by Margolus) might not be possible. Later that year, Frank devises a simple mechanical model showing that parallel reversible systems can indeed be synchronized locally in 3 dimensions.
20
Adiabatic Circuits and Reversible Computing

Commonly Encountered Myths, Fallacies, and
Pitfalls
(in the Hennessy-Patterson tradition)

21
Myths about Adiabatic Circuits Reversible
Computing

Someone proved that computing with ltltkT
free-energy loss per bit-operation is
impossible.
Physics isnt reversible.
An energy-efficient adiabatic clock/power supply
is impossible to build.

True adiabaticity doesnt require reversible
logic.
Sequential logic cant be done adiabatically.
Adiabatic circuits require many clock/power
rails and/or voltage levels.
Adiabatic design is necessarily difficult.

22
Fallacies about Adiabatic Circuits and Reversible
Computing

Since speed scales with energy dissipation in
adiabatic circuits, they arent good for
high-performance computing.
If I tried and failed to invent an efficient
adiabatic logic, it must be impossible.

The algorithmic overheads of reversible
computing mean it can never be cost-effective.
Since leakage gets worse in nanoscale devices,
adiabatics is doomed.

23
Pitfalls in Adiabatic Circuits and Reversible
Computing

Using diodes in the charge-return path.
Forgetting to obey one of the transistor rules.
Using traditional models of computational
complexity.
Restricting oneself to an asymptotically
inefficient design style.

Assuming that the best reversible and
irreversible algorithms are similar.
Failing to optimize the degree of reversibility
of a design.
Ignoring charge leakage in low-power/adiabatic
design.

24
Adiabatic/Reversible Computing

Basic Models and Concepts

25
Bistable Potential-Energy Wells

Consider any system having an adjustable,
bistable potential energy surface (PES) in its
configuration space.
The two stable states form a natural bit.
One state represents 0, the other 1.
Consider now the P.E. well havingtwo adjustable
parameters
(1) Height of the potential energy
barrierrelative to the well bottom
(2) Relative height of the left and rightstates
in the well (bias)

(Landauer 61)
0
1
26
Possible Parameter Settings

We will distinguish six qualitatively different
settings of the well parameters, as follows

BarrierHeight
Direction of Bias Force
27
One Mechanical Implementation
Stateknob
Rightwardbias
Barrierwedge
Leftwardbias
spring
spring
Barrier up
Barrier down
28
Possible Adiabatic Transitions

Catalog of all the possible transitions in these
wells, adiabatic not...

(Ignoring superposition states.)
1states
1
1
1
leak
0
0states
0
leak
0
BarrierHeight
N
1
0
Direction of Bias Force
29
Ordinary Irreversible Logics

Principle of operation Lower a barrier, or not,
based on input. Series/parallel combinations of
barriers do logic.
Major dissipation
in at least one of the possible transitions.

1
Input changes, barrier lowered
0

Amplifies input signals.

Example Ordinary CMOS logics
Outputirreversiblychanged to 0
0
30
Ordinary Irreversible Memory

Lower a barrier, dissipating stored information.
Apply an input bias. Raise the barrier to latch
the new informationinto place. Remove
inputbias.

Retractinput
1
1
Dissipationhere can bemade as low as kT ln 2
Retractinput
Barrierup
0
0
Barrier up
(3)
(1)
Input1
Input0
ExampleordinaryDRAM
N
1
0
(2)
(2)
31
Input-Bias Clocked-Barrier Logic

Cycle of operation
(1) Data input applies bias
Add forces to do logic
(2) Clock signal raises barrier
(3) Data input bias removed

Can amplify/restore input signalin the
barrier-raising step.
(3)
1
1
(4)
Can reset latch reversibly (4) given copy
ofcontents.
(3)
0
0
(2)
(4)
(4)
(2)
(4)
Examples AdiabaticQDCA, SCRL latch, Rod logic
latch, PQ logic,Buckled logic
(1)
(1)
N
1
0
(4)
(4)
32
Input-Barrier, Clocked-Bias Retractile

Barrier signal amplified.
Must reset output prior to changing input.
Combinational logic only!

Cycle of operation
(1) Inputs raise or lower barriers
Do logic w. series/parallel barriers
Clock applies bias force, which changes state, or
not

0
0
0
(1) Input barrier height
ExamplesHalls logic,SCRL gates,Rod logic
interlocks
N
1
0
(2) Clocked force applied ?
33
Input-Barrier, Clocked-Bias Latching

? Cycle of operation
Input conditionally lowers barrier
Do logic w. series/parallel barriers
Clock applies bias force conditional bit flip
Input removed, raising the barrier locking in
the state-change
Clockbias canretract

1
(4)
(4)
0
0
0
(2)
(2)
(3)
(1)
Examples Mikes4-cycle 2-level adiabaticCMOS
logic (2LAL)
(2)
(2)
N
1
0
34
Full Classical-Mechanical Model

Claim The following components are sufficient
for a complete, scalable, parallel, pipelinable,
linear-time, stable, classical reversible
computing system
(a) Ballistically rotating flywheel driving
linear motion.
(b) Scalable mesh to synchronize local flywheel
phases in 3-D.
(c) Sinusoidal to flat-topped waveform shape
converter.
(d) Non-amplifying signal inverter (NOT gate).
(e) Non-amplifying OR/AND gate.
(f) Signal amplifier/latch.

Sleeve
(a)
(c)
(b)
(f)
(d)
Primary drawback Slow propagationspeed of
mechanical (phonon) signals.
(e)
cf. Drexler 92
35
Adiabatic electronics CMOS implementations
36
Conventional Gates are Irreversible

Logic gate behavior (on receiving new input)
Many-to-one transformation of local state!
Required to dissipate bT, by Landauer principle
Incurs ½CV2 dissipation in 2 out of 4 cases.

Transformation of local state
Example Static CMOS Inverter
in
out
37
(No Transcript)
38
(No Transcript)
39
Exact formulafor frequency reduction f ? RC/t
40
(No Transcript)
41
Common Mistakes to Avoid

In Adiabatic Design

42
Common Mistakes to Avoid

Dont use diodes in charge-return path!
The built-in voltage drop kills adiabaticity.
Dont disobey adiabatic transistor rules by
either
Turning on transistor with voltage across it
Turning off transistor with current thru it!
This one is often neglected!
Use mostly-reversible logic!
Optimize degree of reversibility for application
Dont over-constrain the design family!
Asymptotically efficient circuits should be
possible

43
Adiabatic Rules for Transistors

Rule 1 Never turn on a transistor if it has a
nonzero voltage across it!
I.e., between its source drain terminals.
Why This erases info. causes ½CV2 disspation.
Rule 2 Never apply a nonzero voltage across a
transistor even during any on?off transition!
Why When partially turned on, the transistor has
relatively low R, gets high PV2/R dissipation.
Corollary Never turn off a transistor if it has
a nonzero current going through it!
Why As R gradually increases, the VIR voltage
drop will build, and then rule 2 will be violated.

44
Adiabatic Rules, continued

Transistor Rule 3 Never suddenly change the
voltage applied across any on transistor.
Why So transition will be more reversible
dissipation will approach CV2(RC/t), not ½CV2.
Adiabatic rules for other components
Diodes Dont use them at all!
There is always a built-in voltage drop across
them!
Resistors Avoid moderate network resistances, if
poss.
e.g. stay away from range gt10 k? and lt1 M?
Capacitors Minimize, reliability permitting.
Note Dissipation scales with C2!

45
Transistor Rules Summarized
Legal adiabatic transitions in green. (For n- or
p-FETs.)Dissipative states and transitions in
red.
off
high
low
off
off
high
high
low
low
off
high
low
on
on
high
low
high
low
on
on
low
low
high
high
46
SCRL Split-level Charge Recovery Logic

The First Pipelined Fully-Adiabatic CMOS
Logic(Younis Knight, MIT, 94)

47
?
Transformation of local state
48
Input-Barrier, Clocked-Bias Retractile
Must reset outputprior to input.
Combinational logiconly!

Cycle of operation
Inputs raise or lower barriers
Do logic w. series/parallel barriers
Clock applies bias force which changes state, or
not

0
0
0
ExamplesHalls logic,SCRL gates,Rod logic
interlocks
Input barrier height
N
1
0
Clocked force applied ?
49
(No Transcript)
50
Retractile Logic w. SCRL gates

Simple combinational logic of any depth N
Requires N timing phases
Non-pipelined
No sequential reuse ofHW (even worse)
We needsequentiallogic!

Time ?
51
Sequential Retractile Logic

Approach 1 (Hall 92)
After every N stages, invoke an irreversible
latch
stores the output of the last stage
Then, retract all the stages,
and begin a new cycle
Problems
Reduces dissipation by at most a factor of N
Also reduces HW efficiency by order N!
In worst case, compared to a pipelined,
sequential circuit
Approach 2 (Knight Younis, 93)
The store output stage can also be reversible!
Gives fully-adiabatic, sequential, pipelined
circuits!
N can be as small 1 or 2 still have arbitrarily
high Q

52
Simple Reversible CMOS Latch

Uses a standard CMOS transmission gate
Sequence of operation
(1) input initially matches latch contents
(output)
(2) input changes?output changes (3) latch
closes (4) input removed

Before Input Inputinput arrived removedin out
in out in outa a a a a a b b a b
P
in
out
53
Resetting a Reversible Latch

Can reversibly unlatch data as follows (exactly
the reverse of the latching process)
(1) Data value d stored on memory node M.
(2) Present an exact copy of d on input.
(3) Open the latch (connecting input to M).
No dissipation since voltage levels match
(4) Retract the copy of d from the input.
Retracts copy stored in latch also.

54
Input-Bias Clocked-Barrier Logic

Cycle of operation
Data input applies bias
Add forces to do logic
Clock signal raises barrier
Data input bias removed

Can amplify/restore input signalin clocking step.
Retractinput
1
1
Retractinput
Clockbarrierup
Can reset latch reversibly givencopy of
contents.
0
0
Clock up
Input1
Input0
Examples AdiabaticQDCA, SCRL latch, Rod logic
latch, PQ logic,Buckled logic
N
1
0
55
(No Transcript)
56
(No Transcript)
57
SCRL 6-tick clock cycle
Initial state All gates off, all nodes neutral.
in
out
58
SCRL 6-tick clock cycle
Tick 1 Input goes valid, forward T-gate opens.
in
out
59
SCRL 6-tick clock cycle
Tick 2 Forward gate charges, output goes
valid.(Tick 1 of subsequent gate.)
in
out
60
SCRL 6-tick clock cycle
Tick 3 Forward T-gate closes, reverse gate
charges.
in
out
61
SCRL 6-tick clock cycle
Tick 4 Reverse T-gate opens, forward gate
discharges.
in
out
62
SCRL 6-tick clock cycle
Tick 5 Reverse gate discharges, input goes
neutral.
in
out
63
SCRL 6-tick clock cycle
Tick 6 Reverse T-gate closes, output goes
neutral.Ready for next input!
in
out
64
24 ticks/cyclein this version-includes
2-levelretractile stages
65
(No Transcript)
66
(No Transcript)
67
Some Timing Terminology

For sequential adiabatic circuits
1 Tick Time for a single ramp transition
adiabatic speed fraction f times the RC gate
delay.
1 Phase Latency for a data value to propagate
forward by 1 pipeline stage.
1 Cycle Minimum period for all timing
information to return back to its initial state.
Diadic Two retractile levels per gate
permits inverting or non-inverting logic.
Dual rail Two wires per logic value
permits universal logic with monodic gates

Monadiconly 1 level
68
Some Figures of Demerit

Some quantities we may wish to minimize
Ticks/phase
proportional to logic propagation latency
Ticks/cycle
reciprocal to rate of data throughput
Transistor-ticks/cycle
reciprocal to HW cost-efficiency
Number of required clock/power input signals
supplying these may be a significant component of
system cost
Number of distinct voltage levels required
may affect reliability/power tradeoff

69
Some Interesting Questions

About pipelined, sequential, fully-adiabatic CMOS
logic
Q Does it require an intermediate voltage level?
A No, you can get by with only 2 different
levels.
Q What is the minimum number of externally
provided timing signals you can get away with?
A ?4 (?12 if split levels are used)
Q Can the order-N different timing signals
needed for long retractile cascades be internally
generated within an adiabatic circuit?
A Yes, but not statically, unless N2 hardware is
used
where N is the number of stages per full
sequential cycle
We now demonstrate these answers.

70
Some Timing Examples

See next slide for some detailed timing diagrams.
N-level retractile cascades
2N ticks/phase 1 phase/cycle 2N ticks/cycle
3-phase fully-static diadic SCRL
8 ticks/phase 3 phases/cycle 24 ticks/cycle
2-phase fully-static monadic SCRL
5 ticks/phase 2 phases/cycle 10 ticks/cycle
2-phase fully-static diadic SCRL
6 ticks/phase 2 phases/cycle 12 ticks/cycle
6 tick/cycle dynamic SCRL detailed previously
1 tick/phase 6 phases/cycle 6 ticks/cycle

71
Some SCRL timing diagrams
72
Reversible / Adiabatic Chips Designed _at_ MIT,
1996-1999
By the author and other then-students in the MIT
Reversible Computing group,under AI/LCS lab
members Tom Knight and Norm Margolus.
73
2LAL 2-Level Adiabatic Logic

A Novel Alternative to SCRL

74
2LAL 2-level Adiabatic Logic
(Implementable using ordinary CMOS transistors)
P
P

Use simplified T-gate symbol
Basic buffer element
cross-coupled T-gates
Only 4 timing signals,4 ticks per cycle
?i rises during tick i
?i falls during tick (i2) mod 4

?
?1
Tick
in
0 1 2 3
?0
?1
out
?2
?0
?3
75
2LAL Cycle of Operation
Tick 0
Tick 1
Tick 2
Tick 3
?1?1
in?1
in?0
?1?0
out?1
in
?0?1
?0?0
?1?1
in0
out?0
out0
?0?1
?0?0
76
2LAL Shift Register Structure

1-tick delay per logic stage
Logic pulse timing propagation

?1
?2
?3
?0
in
out
?0
?1
?2
?3
0 1 2 3 ...
0 1 2 3 ...
in
in
77
More complex logic functions

Non-inverting Boolean functions
For inverting functions, must use quad-rail logic
encoding
To invert, justswap the rails!
Zero-transistorinverters.

?
?
A
B
A
A
B
A?B
AB
A 0
A 1
A0
A0
A1
A1
78
Hardware Efficiency issues

Hardware efficiency How many logic operations
per unit hardware per unit time?
Hardware spacetime complexity How much hardware
for how much time per logic op?
Were interested in minimizing( of
transistors) ( of ticks) / (gate cycle)
SCRL inverter, w. return path
(8 transistors) ? (6 ticks) 48 transistor-ticks
Quad-rail 2LAL buffer stage
(16 transistors) ? (4 ticks) 64 transistor-ticks

79
More SCRL vs. 2LAL

SCRL reversible NAND, w. all inverters
(23 transistors) ? (6 ticks) 138 T-ticks
Quad-rail 2LAL AND
(48 transistors) ? (4 ticks) 192 T-ticks
Result of comparison Although 2LAL minimizes
of rails, and ticks/cycle, it does not minimize
overall spacetime complexity.
The question of whether 6-tick SCRL minimizes
per-op spacetime complexity among pipelined
adiabatic CMOS logics is still open.

80
Minimizing Power-Clock Signals

How many external clock signals required?
N-level-deep retractile cascade logic
2N waveforms 1 phase 2N signals
6 tick/cycle, 6-phase dynamic SCRL
6 waveforms 6 phases 36 signals
24 tick/cycle, 3-phase static SCRL
12 waveforms 3 phases 36 signals
4 tick/cycle, 2LAL
1 waveform 4 phases 4 signals!
It turns out that 12 signals are sufficient to
implement any combination of 2-level or 3-level
logics (including retractile) on-chip!

81
How to Do It

Circular 2LAL shifter pulse-gated clocks

Tick
0 1 2 3
P0
P1
P2
P3
P0
P1
in
0
P2
P3
out
?0
P0
P1
P2
P3
?2
?1
?2
?2
?3
2
82
12-rail system pros cons

Pros
Completely solves adiabatic timing design problem
Enables mixtures of retractile, SCRL, and other
logic styles on 1 chip
Enables simple fully-adiabatic SRAM DRAM
Cons
Timing signals are dynamic
Known fully-static alternatives use order N2
gates and signals for N-tick-long cycles
N can be large in a chip that includes deep
retractile networks
Energy waste in driving the source/drain junction
capacitances of all the T-gates even when timing
pulse isnt present
SOI reduces these parasitics

83
GCAL General CMOS Adiabatic Logic

A general CMOS adiabatic design methodology
Currently under development at UF
Combines best features of SCRL, 2LAL, and
retractile logics
Permits designs attaining asymptotically optimal
cost-efficiency
For any combination of time, space, spacetime,
energy costs
Arbitrarily high degree of reversibility
Permits using minimal 2-level and 3-level
adiabatic gates
Requires only 4 externally supplied clock/power
signals for 2-level logic
And only 12 total for mixed 2-level 3-level
logic
Supports mixtures of fully-pipelined and
retractile logic.
Supports quiescent dynamic/static latches RAM
cells
Tools currently under development
A new HDL specialized for describing adiabatic
designs
Digital circuit simulator with adiabaticity
checker
Adiabatic logic synthesis tool, with automatic
legacy design converter

84
GCAL DRAM/SRAM cells

GCAL DRAM cell
4 transistors
4 word lines/row
2 bit lines/col (or 1)

GCAL SRAM cell
8 transistors
6 word lines/row
2 bit lines/col (or 1)

85
DRAM Cell Write Cycle

All nodes initially ½.
T-gate initially closed (off).
Transmission gate opens.
Internal node is connected to bit-line (at
matching voltage).
Bit line transitions to 0 or 1.
Pulls internal node to matching level.
Transmission gate closes.
Internal node latched to new level.
Bit line transitions back to ½.
Prepares for a new cycle.
Use the reverse sequence of operations to unwrite.

86
DRAM Cell Read Cycle

All external nodes initially ½.
T-gate initially off.
Internal node contains data.
Inverter rails split.
Bit line set to (inverted) data.
T-gate at end of column latches bit-line data.
Inverter rails merge.
Bit line restored to ½ level.
Can use the reverse sequence of operations to
unread copy of data available at end of column.

87
Fully-Adiabatic DRAM cell

6T, 6 lines/row, 1 line/column (in/out together)
Read cycle
Initially ? lines neutral, out neutral, R off
R for desired row turns on
? for desired row splits, driving out column
R turns off, out is read
? merges, out is reset
Write cycle
First, do read cycle.
in is set to out
W turns on
in changed to new value...

88
Fully-Adiabatic SRAM

10-T, 10 lines/row, 1 line/column
Operation similar to DRAM, except
Read-out
T2 off N2 retracts T3 on N2 asserts T2 on, T3
off
Write
T2 off N2 retracts N1 retracts, copy of M
presented on input T1 on inchanges T1 off,
N1asserts N2 asserts T2 on

N1
N2
M
T1
T2
T3
out
in
89
Limits of Adiabatics
90
Structured Systems

A structured system is defined as a system about
whose state we have some knowledge.
Some of its physical information is known.
? Its entropy is not at a maximum (by defn.).
? It is not at equilibrium (by defn.).
For states with a given energy E,
we say the systems energy is distributed among
those states, in proportion to their probability.

All statesof the abstractsystem havingenergy E
The systemsenergy isin here
States w.prob. gt 0
91
Desired Trajectories

Any structured systemwe build to servesome
purposehas somedesiredtrajectory, or set
oftrajectories, through its configuration space
that we would ideally like it to follow at all
times.
Think of any given state as having a specific
desirability at any given time.

Time
Config-uration
Desired trajectories
92
Energy Losses

Energy dissipation can be viewed as a departure
of part of the systems energy away from the
systems desired trajectory.
E.g., 1 of 106 electronsleaks out of aDRAM cell
systems energy hasdeparted from
desiredtrajectory (all 106 stay)by a small
amount

Time
Config-uration
Energy that hasdeparted from desiredtrajectories
93
Limits of Adiabatics IFriction
94
Generalized Friction

Any force leading to departure from desired
trajectory that obeys the adiabatic principle
I.e., force strength ( total energy loss) is
proportional to velocity along trajectory at low
velocities
Examples
Ordinary sliding friction
Fluid viscosity
Electrical resistance
Forces causing electromagnetic radiative losses
Forces causing losses in inelastic collisions

95
Ways to Quantify Friction

Normal friction measures referring to length,
mass, etc. may not apply to all processes.
For a given mechanism executing a specified
process (i.e., following a specified desired
trajectory or -ies) over a time t
Energy coefficient cE ?Elostt ?Elost/q
Energy dissipated from traj. per unit of
quickness
Note quickness q 1/t has units like Hz
Entropy coefficient cS ?Smadet ?Smade/q
New entropy generated per unit of quickness
Note that cE cST at temperature T.

What matters!
96
Energy Coefficient in Electronics

For charging capacitive load C by voltage V
through effective resistance R cE ?Elostt
(CV2RC/t)t C2V2R
If the resistances are voltage-controlled
switches with gain factor k controlled by the
same voltage V, then effective R ? 1/kV cE
C2V/k
In constant-field-scaled CMOS, k ? 1/dox ? ?, C ?
?, and V ? ?, so cE ? ?3/? ?4 ?Elost cE/t
? ?4/? ?3 (like CV2
energy)

97
Degree of Reversibility of CMOS

What is the Q of a min-size CMOS transistor?
Q Efree/?Ediss
Efree/(cE/t) ½CV2/(C2V2R/t)
½(t/RC) ½ s (s slowdown factor)
Note Using transistors wider than minimum-size
(larger C, smaller R) wouldnt change RC or Q,
and would increase overall dissipation by
increasing cE.

98
Lower Bounds on Friction?

No general (technology-independent) lower bounds
on friction coefficients for interesting types of
processes (e.g. computation) are currently known.
Clever engineering may eventually reduce the
friction in desired processes to values as small
as is desired.
Some ways
Reduce number of moving parts (or particles)
Isolate moving parts of system from unwanted
interactions w. environment

99
Entropy coefficients of some reversible logic
gate operations

From Frank, Ultimate theoretical models of
nanocomputers (Nanotechnology, 1998)
SCRL, circa 1997 1 b/Hz
Optimistic reversible CMOS 10 b/kHz
Merkles quantum FET 1.2 b/GHz
Nanomechanical rod logic .07 b/GHz
Superconducting PQ gate 25 b/THz
Helical logic .01 b/THz

How low can you go? We dont really know!
100
Is Adiabatic Limit Achievable?

Even if there is some lower bound on cS, it seems
we can have ?Smade? 0 as t ? ?.
What factors may prevent this?
Any lower bound gt0 on the number of irreversible
bit-operations performed. (Each has ?Smade ? 1.)
Fortunately, the lower bound can always be made
0.
Any lower bound on the rate of energy leakage,
even when system is completely stopped.
Any upper bound on the Q of the clocking
synchronization system.
The system dissipates Efree/Q on every cycle.
No technology-independent upper bounds on Q known

101
Some Synonyms

Leakage of energy or (equivalently) probability
mass out of a desired configuration or
trajectory.
Occurrence of errors in the desired analog or
digital state of a system. (Motion away from
desired states.)
Decay of structure of a structured system. (The
state departs from desired state.)

Leakage Error Decay
102
Perfect Mechanisms?

If a structured system is perfectly closed,
I.e. non-interacting with other systems, at all!
And if its internal interactions are perfectly
known,
Then, and only then, is its (von Neumann) entropy
going to be a constant.
Otherwise, its entropy will continuously increase
as we lose track of its state.
In this case, no mechanism is perfect, in that
some of its energy (i.e. some probability mass)
is always leaking away from the desired
trajector(y/ies) at some nonzero base rate, even
when the rate of systems progress along its
trajectory is zero.

103
Leakage Limits

Claim No real, structured system can have
absolutely zero rate of energy leakage out of its
desired trajectories, even if not moving.
However No general,technology-independentlower
bound onleakage ratesis known (otherthan
zero.)
Engineering advances mightmake leakage as small
as desired.

Time
Config-uration
Energy that hasleaked from desiredconfiguration
104
Quantifying Leakage

For a given structured system
Leakage power Pleak dEleak / dt
Spontaneous entropy generation rate Sleak
dSleak / dt
Again, note Pleak Sleak T at temperature T.

105
Ways to Decrease Leakage

Have high potential-energy barriers
slows down thermally excited leakage
exponentially
Have thick potential-energy barriers
slows down quantum tunneling exponentially
Example Older generations of CMOS!
Mechanical (clockwork) systems have high
potential energy barriers, for their size
Decay may require atoms to diffuse out of
tightly-bonded spots.
Mechanisms that avoid making/breaking contacts
(e.g. buckled logic) avoid losses due to stiction.

106
Limits of Adiabatics IILeakage
107
Minimum Losses w. Leakage
Etot Eadia Eleak
Eleak Pleaktr
Eadia cE / tr
108
Minimum Loss Derivation
109
Leakage in CMOS

In a given technology with constant-field
scaling, leakage becomes worse at smaller scales
because
Energy barriers between states are lower
Higher rates of thermally-induced leakage, at
given T
Higher rates of quantum tunneling
(temp.-independent)
Energy barriers between states are narrower
Higher rates of quantum tunneling
These effects get worse exponentially with
1/length (doubly-exponentially with time)
Need alt. technologies w. high energy barriers!

110
Future Techs. w. Low Leakage?

How can we achieve low entropy coefficients in
minimum-scale (atomic-scale) devices?
Need high energy barriers
Can achieve using atomic (not just electronic)
interactions
E.g. mechanical logics (rod logic, buckling
logic)
If strong bonds (e.g. C-C) are used in structure,
rates of unwanted bond breakage can made be very
low.
Rate for an atom passing through another one
(e.g. knobs in rod logic) is extremely low due to
height of barrier strength of Coulombic
fermionic repulsion between electrons,
width of barrier large number of particles
involved
Other possibilities?

111
Minimum Dissipation with Variable V

Notice that this function of V approaches 0
exponentially as V?8,
This is even true if we scale C?V!
Thus, there is no lower limit to the energy
dissipation of adiabatic field-effect circuits!
The key is to make devices larger, not smaller!
Device sizes need grow only logarithmically.

112
Maximum Q factor in terms of V

The maximum logic Q factor is the maximum ratio
between the energy involved in carrying out a
logical transition, and the energy dissipated by
the circuit during the transition.
We just calculated the minimum energy dissipated.
Thus, Qmax Einvolved/Ediss,min
½CV2/(2CV2exp(-V/2fT)) (1/4)exp(V/2fT)
Note that the maximum logic Q-factor goes up
exponentially with the logic-swing voltage V.

113
Minimum energy Roff/Ron ratio

(A simpler version of earlier derivation.) Note
that cE C2V2Ron and if the dominant leakage
mode is source/drain, then Pleak V2/Roff
So putting the two together cEPleak
C2V4(Ron/Roff) Emin 2(cEPleak)1/2
2CV2(Ron/Roff)1/2
So we can rederive the maximum logic Q as
follows Qmax ½CV2 / (2CV2(Ron/Roff)1/2)
¼(Roff/Ron)1/2 ¼(Imax/Ileak)1/2
¼(ron/off)1/2

114
Limits of Adiabatics IIIClock/Power Supplies

See transparencies.

115
Timing in Adiabatic Systems

When multiple adiabatic devices interact, the
relative timing must be precise, in order to
ensure that adiabatic rules are met.
There are two basic approaches to timing
Global (a.k.a. clocked, a.k.a. synchronous)
timing
This is the approach in nearly all conventional
irreversible CPUs.
Also is the basis for all practical adiabatic and
quantum computing mechanisms that have been
proposed to date.
Local (a.k.a. self-timed, a.k.a. asynchronous)
timing
Implemented in a few commercial irreversible
chips.
Feynman 86 showed that a self-timed serial
reversible computation was implementable in QM,
in principle.
Margolus 90 extended this to a 2-D model with
1-D of parallelism. - Can it still work in a full
3-D, fully-quantum-mechanical?
Indications from considering classical-mechanical
3D meshes of coupled oscillators is yes.

116
Global Timing

Examples of adiabatic systems designed on the
basis of global, synchronous timing
Adiabatic CMOS with external power/clock rails
Superconducting parametric quantron (Likharev)
Adiabatic Quantum-Dot Cellular Automaton (Lent)
Adiabatic mechanical logics (Merkle, Drexler)
All proposed quantum computers
A potential problem Synchronous timing may not
scale well to large machine sizes.
Work by Janzig others raises issues of possible
limits on timing systems due to quantum
uncertainty.
Issue is still unresolved.

117
Clock/Power Supply Desiderata

Here are some requirements for a good adiabatic
timing signal / power supply for driving
voltage-coded logic
Generate a trapezoidal voltage waveform with very
flat high/low regions.
Needed to avoid current through transistors when
turning them off
The flatness of the signal limits the maximum Q
factor of the logic.
Waveform during the high?low transitions should
ideally be linear,
But this does not affect the maximum logic Q,
only the energy coefficient.
So long as ramp slope scales down everywhere with
transition time.
Operate resonantly with the logic circuit, with a
high Q factor.
The power supplys Q will limit the overall
system Q
If possible, scale Q ? t (cycle time)
Required to be considered an adiabatic mechanism.
May conflict w. inductor scaling laws!
At the least, Q should still be high at
leakage-limited speed
Have a reasonable cost, compared to the logic it
powers.
Be scalable to large meshes of mutually
synchronized devices.

118
Supply concepts in my research

Superpose several sinusoidal signals from
phase-synchronized oscillators at harmonics of
fundamental frequency
Weight these frequency components as per Fourier
transform of desired waveform
Create relatively high-L integrated inductors via
vertical, helical metal coils
Only thin oxide layers between turns
Use mechanically oscillating, capacitive MEMS
structures in vacuo as high-Q (10k) oscillator
Use geometry to get desired wave shape directly

119
Early supply concepts

Inductors switches.
See transparency.
Stepwise charging.
See transparency.

120
Newer Supply Concepts

Transmission-line-based adiabatic resonators.
See transparency.
MEMS-based resonant power supply
See next couple of slides
Ideal adiabatic supplies - Can they exist?
Idealized mechanical model See transparency.
But, there may be quantum limits to
reusability/scalability of global timing signals.
This is a very fundamental issue!

121
MEMS/NEMS Resonators

A Novel Clock/Power Supply Technology for
Adiabatic Circuits

122
MEMS/NEMS Resonators

State of the art of technology demonstrated in
lab
Frequencies up to the 100s of MHz, even GHz
Qs gt10,000 in vacuum, several thousand even in
air!
Rapidly becoming technology of choicefor
commercial RF filters, etc., in
communicationsSoC (Systems-on-a-Chip) e.g. for
cellphones.

U. Mich., poly, f156 MHz, Q9,400
34 µm
123
A MEMS Supply Concept