Title: Giga-Scale System-On-A-Chip International Center on System-on-a-Chip (ICSOC)
1Giga-Scale System-On-A-ChipInternational Center
on System-on-a-Chip (ICSOC)
- Jason Cong
- University of California, Los Angeles
- Tel 310-206-2775, Email cong_at_cs.ucla.edu
- (Other participants are listed inside)
2Background Double Exponential Growth of
Design Complexity
- C1 complexity due to exponential increase of
chip capacity - More devices
- More power
- Heterogeneous integration,
- C2 complexity due to exponential decrease of
feature size - Interconnect delay
- Coupling noise
- EMI,
- Design Complexity ? C1 x C2
3Motivation Productivity Gap
10,000,000
100,000,000
1,000,000
10,000,000
58/Yr. Complexity growth rate
100,000
1,000,000
Logic Transistors/Chip (K)
Transistor/Staff-Month
10,000
100,000
1,000
10,000
21/Yr. Productivity growth rate
x
x
100
1,000
x
x
x
x
x
x
10
100
1
10
1998
2003
Chip Capacity and Designer Productivity
Source NTRS97
4Project Summary
- Develop new design methodology to enable
efficient giga-scale integration for
system-on-a-chip (SOC) designs - Project includes three major components
- SOC synthesis tools and methodologies
- SOC verification, test, and diagnosis
- SOC design driver network processor
5Research Team by Institutions
- US
- UCLA Jason Cong
- UC Santa Barbara Tim Cheng
- Taiwan
- NTHU Shi-Yu Huang, Tingting Hwang, J. K. Lee,
Youn-Long Lin, C. L. Liu, Cheng-Wen Wu, Allen Wu
- NCTU Jing-Yang Jou
- China
- Tsinghua Univ. Jinian Bian, Xianlong Hong, Zeyi
Wang, Hongxi Xue - Peking Univ. Xu Cheng
- Zhejiang Univ. Xiaolang Yan
6Current Research Team
- US
- UCLA Jason Cong
- UC Santa Barbara Tim Cheng
- Taiwan
- NTHU Shi-Yu Huang, Tingting Hwang, J. K. Lee,
Youn-Long Lin, C. L. Liu, Cheng-Wen Wu, Allen Wu
- NCTU Jing-Yang Jou
- China
- Tsinghua Univ. Jinian Bian, Xianlong Hong, Zeyi
Wang, Hongxi Xue - Peking Univ. Xu Cheng
- Zhejiang Univ. Xiaolang Yan
- Several new faculty members in the 7 institutions
- Guest members from National University of
Singapore, Purdue Univ., and UCLA (EE Dept)
7Thrust 1 -- SOC Synthesis Environment/Methodology
(Led by Jason Cong)
Design Spec VHDL/C
VHDL/C Co-Simulation
Design Partitioning
ASIC Synthesis
Interconnect-Driven High-level Synthesis
Code Generation for Retargetable Compiler and
Assembler Generator
DSP Synthesis and Optimization
FPGA Synthesis and Technology Mapping
Synthesis for IP Reuse
Physical Synthesis for Full-Chip Assembly
Embedded Processors
DSPs
Embedded FPGAs
Customized Logic
8Interconnect Bottleneck in Nanometer Designs
- Challenge Single-cycle full chip communication
is no longer possible - Not supported by the current CAD toolset
- ITRS01 0.07um Tech
- 5.63 G Hz across-chip clock
- 800 mm2 (28.3mm x 28.3mm)
- IPEM BIWS estimations
- Buffer size 100x
- Driver/receiver size 100x
- On semi-global layer (tier 3)
- Can travel up to 11.4 mm in one cycle
- Need 5 clock cycles from corner to corner
9Regular Distributed Register Architecture
- Use register banks
- Registers in each island are partitioned to k
banks for 1 cycle, 2 cycle, k cycle
interconnect communication in each island - Highly regular
10MCAS Architectural Synthesis for Multi-Cycle
Communication Using RDR Architecture
C program
CDFG generation
MCAS (Multi-Cycle Architectural Synthesis)
CDFG
Resource allocation Functional unit binding
ICG
Scheduling-driven placement
Locations
Placement-driven rescheduling rebinding
Register and port binding
Datapath FSM generation
Floorplan constraints
RTL VHDL
Multi-cycle path constraints
11MCAS flow vs. Synopsys Behavioral Compiler (on
Virtex-II)
- Synopsys Behavioral Compiler setting default
(optimizing latency) - Average latency ratio of MCAS vs. BC 69
Latency
Resource
12Optimality Study of Large-Scale Circuit Placement
- Construction of Placement Example with Known
Optimal (PEKO) C. Chang et al, 2003
?
13High Interest in the Community
- Three EE Times articles coverage
- Placement tools criticized for hampering IC
designs Feb03 - IC placement benchmarks needed, researchers say
April03 - FPGA placement performance Nov03
- More than 150 downloads from our website
- Cadence, IBM, Intel, Magma, Mentor Graphics,
Synopsys, etc - CMU, SUNY, UCB, UCSB, UCSD, UIC, UMichgan,
UWaterloo, etc - Used in every placement since its publication
http//ballade.cs.ucla.edu/pubbench
14Floorplanning Interconnect Planning
- Based on proposed Corner Block List (CBL)
representation propose several Extended Corner
Block List, ECBL, CCBL and SUB-CBL to speed up
floorplanning and handle more complicate L/T
shaped and rectilinear shaped blocks. - Propose floorplanning algorithms with some
geometric constraints, such as boundary,
abutment, L/T shaped blocks. - Propose integrated floorplanning and buffer
planning algorithms with consideration of
congestion . - Using research results from UCLA on interconnect
planning - About 30 papers published in DAC, ICCAD, ISPD,
ASPDAC, ISCAS and Transactions.
15 P/G Network Analysis Optimization
- Propose an Area Minimization of Power
Distribution Network Using Efficient Nonlinear
Programming Techniques (ICCAD2001, accepted by
IEEE Trans. On CAD) - Propose a decoupling capacitance optimization
algorithm for Robust On-Chip Power Delivery
(ASPDAC2004, ASICON2003)
16Parasitic R/L/C Etraction
- 3-D R/C Extraction using Boundary Element Method
(BEM) - Quasi-Multiple Medium (QMM) BEM algorithms
- Hierarchical Block BEM (HBBEM) technique
- Fast 3-D Inductance Extraction (FIE)
- Papers were published in ASPDAC, ASICON and IEEE
Transaction on MTT
17Thrust 2 -- SOC Verification, Test, and
Diagnosis(Led by Tim Cheng)
Verification and Testing
Enabling techniques for semi-formal functional
verification
Testing and diagnosis for heterogeneous SOC
Self-testing using on-chip programmable components
Self-testing for on-chip analog/mixed-signal
components
Automatic/semi-automatic functional vector
generation from HDL code
Scalable constraint-solving techniques
Integrated framework for simulation, vector
generation and model checking
New test techniques for deep-submicron embedded
memories
18Key Results - Verification
- Developed and released ATPG-based SAT solvers for
circuits (Univ. of California, Santa Barbara) - Integrating structural ATPG and SAT techniques
with new conflict learning - CSAT Fast combinational solver (released on
March 2003) - Demonstrated 10-100X speedup over
state-of-the-art SAT solvers on industrial test
cases (reported by Intel and Calypto) - Has been integrated into Intels FV verification
system and a startups verification engine - Publications DATE2003 and DAC2003
- Satori2 Fast sequential solver (released on Dec.
2003) - Demonstrated 10X-200X speedup over a commercial,
sequential ATPG engine on public benchmark
circuits - Publications ICCAD2003, HLDVT2003 and ASPDAC2004
19Key Results - Testing
A new Statistical Delay Testing and Diagnosis
framework consisting of five major components
(UCSB)
- Statistical timing analysis
- Statistical critical path selection
DAC02,ICCAD02 - Selecting statistical long true paths whose
tests maximize detection of parametric failures - Path coverage metric ASPDAC03
- Estimating the quality of a path set
- Selection/Generation of high quality tests for
target paths ITC01DATE 2004 - Identifying tests that activate longer delay
along the target path - Delay fault diagnosis based on statistical timing
model DATE03, VTS03, DAC03 - Ref Krstic, Wang, Cheng, Abadir, DATE03Best
Paper Award in Test
20Key Results - Testing
- On-Chip Jitter Extraction for Bit-Error-Rate
(BER) Testing of Multi-GHz Signal (UCSB) - Using on-chip, single-shot measurement unit to
sample signal periods for spectral analysis - Demonstrated, through simulation, accurate
extraction of multiple sinusoids and random
jitter components for a 3GHz signal - Publications ASPDAC2004 and DATE2004
21Thrust 3 Design Driver Network Security
Processor (Led by Prof. C. W. Wu Xu Cheng)
- Applications IPSec, SSL, VPN, etc.
- Functionalities
- Public key RSA, ECC
- Secret key AES
- Hashing (Message authentication) HMAC
(SHA-1/MD5) - Truly random number generator (FIPS 140-1,140-2
compliant) - Target technology 0.18?m or below
- Clock rate 200MHz or higher (internal)
- 32-bit data and instruction word
- 10Gbps (OC192)
- Power 1 to 10mW/MHz at 3V (LP to HP)
- Die size 50mm2
- On-chip bus AMBA (Advanced Microcontroller Bus
Architecture)
22Encryption Modules (PKEM)
- Public key encryption module
- Operations
- 32-bit word-based modular multiplication
- Multiplication over GF(p) and GF(2m)
- An RSA cryptography engine with small area
overhead and high speed - Scalable word-width
- TSMC 0.35µm
- 34K gates (1.71.8 mm2 )
- 100MHz clock
- Scalable key length
- Throughput
- 512-bit key 1.79Kbps/MHz
- 1024-bit key 470bps/MHz
23Encryption Modules (SKEM)
- Secret key encryption module
- Operations
- Matrix operations, manipulation
- AES cryptography
- 32-bit external interface
- 58K gates
- Over 200MHz clock
- Throughput 2Gbps
- Support key length of 128/192/256 bits
Technology TSMC 0.25?m CMOS
Package 128CQFP
Core Size 1,279 x 1,271 ?m2
Gate Count 63.4K
Max. Freq. 250MHz
Throughput 2.977 Gbps (128-bit key) 2.510 Gbps (196-bit key) 2.169 Gbps (256-bit key)
24International Collaborations
- Joint NSF/NSC workshop in Aug. 1999 on SOC
(Hsin-Chu, Taiwan) - First team preparation meeting for the proposed
center in Jan. 2000 (Yokohama, Japan) - 2nd planning meeting held in April 2000 (Hawaii,
US) - 3rd planning meeting in Aug. 2000 (Chengde,
China) - Proposal submitted to NSF in Aug. 2000 and funded
in Dec. 2000 - Workshops
- March 30-31, 2001 in Taipei, Taiwan.
- June 23-24, 2001 in Los Angeles, USA
- August 31-September 1, 2001 in HangZhou, China
- March 28-29, 2002, National Tsing Hua
University, Hsinchu, Taiwan - August 20-21, 2002, Peking University, Beijing,
China - November 15-16, 2002, University of California,
Santa Barbara - March 27-29, 2003, National Taiwan University,
Taipei, Taiwan - December 19-21, 2003, Yunnan University, Kunming,
China
25Publications
- 56 research publications up to this point
- 17 in top conferences/journals (DAC, ICCAD,
ASPDAC, ITC, etc.) in the field
26People Education
- Many interactions among participants from
different institutes - Two new IEEE fellows
- Prof. Xiaolang Hong, Tsinghua Univ.
- Prof. Cheng-Wen Wu, National Tsing Hua Univ.
- Involved many young faculty members and
researchers - Trained an army of graduate students