MOUSETRAP%20Ultra-High-Speed%20Transition-Signaling%20Asynchronous%20Pipelines - PowerPoint PPT Presentation

About This Presentation
Title:

MOUSETRAP%20Ultra-High-Speed%20Transition-Signaling%20Asynchronous%20Pipelines

Description:

Clock power consumption is a significant proportion of total power consumption. ... All parts of the clocked circuits run the same frequency. Performance ... – PowerPoint PPT presentation

Number of Views:158
Avg rating:3.0/5.0
Slides: 40
Provided by: Aqua2
Category:

less

Transcript and Presenter's Notes

Title: MOUSETRAP%20Ultra-High-Speed%20Transition-Signaling%20Asynchronous%20Pipelines


1
MOUSETRAPUltra-High-Speed Transition-Signaling
Asynchronous Pipelines
  • Montek Singh Steven M. Nowick
  • Department of Computer Science
  • Columbia University, New York, NY 10027
  • 2001 IEEE

2
Agenda
  • Review
  • Introduction
  • MOUSETRAP
  • Preliminary Experiment Results
  • Conclusions

3
Review
  • Synchronous pipeline
  • Wave pipeline
  • Clock-delayed domino
  • Skew-tolerant domino
  • Self-resetting circuits
  • Asynchronous pipeline
  • Micropipeline
  • GasP
  • IPCMOS

4
Asynchronous circuits benefits
  • No clock skew problem
  • Low power consumption
  • Faster speed (average case)
  • Reduce global timing issues
  • Avoid variations in fabrication,temperature,etc.
  • Low EMI Noise

5
Low Power Consumption
  • On high-performance chips
  • Clock power consumption is a significant
    proportion of total power consumption.
  • Gated clocks reduce the wastage
  • Make clock skew worse
  • Incur some power cost
  • All parts of the clocked circuits run the same
    frequency

6
Performance
  • Synchronous design must be toleranced for worst
    case conditions
  • Fabrication, temperature, voltage, data values,
    Clock skew
  • Asynchronous circuits self-adjust to the
    operating and data conditions

7
Agenda
  • Review
  • Introduction
  • MOUSETRAP
  • Preliminary Experiment Results
  • Conclusions

8
Introduction
  • Asynchronous Design Styles
  • Protocol
  • Level signaling (four phase)
  • Transition signaling (two phase)
  • Logic
  • Bundled-data (ex signal-rail)
  • Self-timed (ex dual-rail)

9
Level signaling (four phase)
  • A send data to B (active)
  • Step 1A?put data in bus, set req 1
  • Step 2B?get data from bus, set ack 1
  • (return-to-zero phase)
  • Step 3A?set req 0
  • Step 4B?set ack 0

10
Transition signaling (two phase)
  • A send data to B (active)
  • Step 1A?put data in bus, set req 1
  • Step 2B?get data from bus, set ack 1
  • Step 3A? put data in bus , set req 0
  • Step 4B? get data from bus, set ack 0

11
Introduction
  • Asynchronous Design Styles
  • Protocol
  • Level signaling (four phase)
  • Transition signaling (two phase)
  • Logic
  • Bundled-data (ex signal-rail)
  • Self-timed (ex dual-rail)

12
C-element
  • ZnextABZ(AB)
  • When A1,B1 ? Znext1
  • When A0,B0 ? Znext0

13
Micropipeline
  • 4-phase latch FIFO

ack
req
14
Bundled-data
15
Self-timed
  • Generate Completion-Detection signal
  • Delay-Insensitive (DI) Coding
  • exdual-rail coding (two phase coding)
  • 00 -gt invalid value
  • 01 -gt 0
  • 10 -gt 1
  • 11 -gt no use

16
Self-timed (dual-rail coding)
17
Performance Comparison of Asynchronous Adders
Mark A. Franklin Tienyo Pan
18
Agenda
  • Review
  • Introduction
  • MOUSETRAP
  • Preliminary Experiment Results
  • Conclusions

19
Mousetrap
  • Minimal-Overhead Ultrahigh-SpEed
    Transition-signaling Asynchronous Pipeline

20
MOUSETRAP-FIFO
Latch delay is 110 ps XNOR delay is 65 ps
21
MOUSETRAPwith logic (bundled data)
22
Bundled data
  • Bundled data scheme
  • Reqn must arrive at stage N after the data
    inputs to that stage have stabilized.
  • Worst-case delay
  • Allow circuits to have hazards

23
Delay Buffer
  • Inverter chain
  • A chain of transmission gates
  • Duplicate the worst-case critical path
  • More accurate delay
  • More area-expensive

24
Timing-forward latency
25
Timing-Cycle time
26
Standard synchronous pipeline
  • Forward latency
  • Cycle time

27
MOUSETRAP-Setup time
28
MOUSETRAP-Hold time
29
Clocked-CMOS (C2MOS) logic
30
C2MOSs benefits
  • Smaller delay
  • Smaller area
  • Lower power consumption

31
MOUSETRAP- C2MOS
  • Forward latency
  • Cycle time

32
Handling wide datapaths
  • Datapath partitioning
  • Control kiting (buffer insertion)

33
Optimization
  • Sliding door
  • Change MOSs width (lower )

34
Non-Linear Pipeline-fork
35
Non-Linear Pipeline-join
36
experiment
  • 0.25µm TSMC
  • 2.5v , 300k
  • A pass-gate implementation of an XNOR/XOR
  • A standard 6 transistor pass-gate dynamic D-latch
  • 0.6µm HP
  • 3.3v ,300K
  • A pass-gate implementation of an XNOR/XOR
  • Clocked-CMOS style latch
  • 10 stage, 16-bit datapath
  • pre-layout simulation (HSPICE)

37
result
38
Conclusions
  • Use small fast latches
  • Low Latch controller overhead(XNOR)
  • Transition-signaling protocol(efficient
    concurrent)
  • Without complex timing design effort
  • Variable-speed environment(elasticity)

39
comparison
  • IPCMOS (asynchronous interlocked pipelined CMOS)
  • 3.34.5GHz
  • IBM 0.18µm
  • Post-layout simulation
Write a Comment
User Comments (0)
About PowerShow.com