Automatic Measurement of Instruction Cache Capacity in XRay - PowerPoint PPT Presentation

About This Presentation

Title:

Automatic Measurement of Instruction Cache Capacity in XRay

Description:

... compiler to produce library. Examples: ATLAS, FFTW, SPIRAL, ... QEST'05 ... Require online manuals. Actual hardware values vs. number available for optimization ... – PowerPoint PPT presentation

Number of Views:50

Avg rating:3.0/5.0

Slides: 26

Provided by: kamen2

Learn more at: http://www.csc.lsu.edu

Category:

Tags: atlas | automatic | cache | capacity | instruction | measurement | online | xray

Transcript and Presenter's Notes

Title: Automatic Measurement of Instruction Cache Capacity in XRay

1
Automatic Measurement of Instruction Cache
Capacityin X-Ray

Kamen Yotov
kyotov_at_us.ibm.com
IBM T. J. Watson Research Center
Joint work with
Tyler Steele, Sandra Jackson,
Keshav Pingali, Paul Stodghill
Department of Computer Science
Cornell University

2
Motivation self-optimizing software

Goal portable performance
Self-optimizing software
Generates code with parameters whose optimal
values depend on the platform (hardware / OS /
compiler)
Determines experimentally optimal parameter
values
Uses native C compiler to produce library
Examples ATLAS, FFTW, SPIRAL,

3
Example Register Blocking for MMM

Hardware parameters
Number of FP registers (NR)
I-Cache Capacity (ICC)
A simple model for the register tile size for
MMM
Yotov et al. IEEE05
MU x NU MU NU Temp NR
KU (unroll of K loop)
does not depend on NR
depends on ICC
Need to know NR and ICC!

4
Why not consult the manuals?

Self-optimizing systems
Require online manuals
Actual hardware values vs. number available for
optimization
For software optimization, hardware values may
not be relevant
(e.g.) number of hardware registers may not be
equal to number of registers available for
holding program values (register 0 on SPARC)
Incomplete
Parameters like capacity and line size of
off-chip caches vary from model to model
Even same model of computer may be shipped with
different cache organizations
Not usually documented in processor manuals
Moving Target

5
Automatic Measurement Tools

lmbench
OS benchmark, some CPU / Memory benchmarks
Larry McVoy, BitMover, Inc.
Carl Staelin, HP
Calibrator
Memory hierarchy benchmark
Stefan Manegold
Centrum voor Wiskunde en Informatica
MOB
Memory hierarchy benchmark
Josep Blanquer, Robert Chalmers
University of California Santa Barbara

6
X-Ray

Set of micro-benchmarks in ANSI C89
Download and compile on any architecture
(portable)
Deduce hardware parameter values from timing
results
Some amount of O/S specific code
High-resolution timing routines
Super-page allocation
Currently support Linux
Windows and Solaris, IRIX, and AIX in the works
Paradox
Compiler optimizations may contaminate timing
results
Cannot afford to turn off all optimizations

7
Example Latency of Integer ADD(Step by Step)

t gettime()
r1 r2
return gettime() t

Problem hard to measure small time intervals
accurately
8
Step by Step (cont.)

t gettime()
while (--R) //R is number of repetitions
r1 r2
return gettime() t

Problem loop overhead
9
Step by Step (cont.)

t gettime()
i R / U
while (--i) //loop unrolled U times
r1 r2
r1 r2
........
r1 r2
return gettime() t

Problem compiler optimizations
10
Step by Step (cont.)

t gettime()
i R / U
switch (v)
case 0 loop
case 1 r1 r2
case 2 r1 r2
.................
case U r1 r2
if (--i)
goto loop
if (!v) return gettime() t else use(r1,r2)

Solution volatile int v 0
11
Latency of integer ADD nano-benchmark C code

Want to measure
r1r2
Generate C Code from specification
ltr1r2, ltr1, r2 intgtgt

volatile int v 0
volatile int vr 0
register int r1 vr
register int r2 vr
t gettime()
i R / U
switch (v)
case 0 loop
case 1 r1 r2
case 2 r1 r2
.................
case U r1 r2
if (--i)
goto loop
if (!v)
return gettime() t
else

12
X-Ray architecture
13
Instruction Throughput

Specification

Control Engine

N3, B1
14
Micro-benchmarks in X-Ray

CPU
Frequency
Instruction Latency
Instruction Throughput
Instruction Existence
FPU on embedded processors
FMA on general purpose processors
SMP and SMT
Memory Hierarchy
Number of Registers of various types (int, float,
SSE, )
Multilevel Caches, TLB
Associativity
Block Size
Capacity
Latency
Instruction Cache Capacity

15
Previous Approaches for Memory Hierarchy
Parameters

Saavedra Benchmark (Hennessy-Patterson)
Accesses elements of an array constant stride
apart
Measures average memory access time
Deficiencies
Considers all levels simultaneously
Works only for capacities that are powers-of-2
Suffers from a number of implementation level
deficiencies
Constant stride accesses
Loop overhead problems
Overlapping memory operations
Prone to compiler optimizations

16
ExampleIsolation of lower cache levels

Idea for Ln measurements
Use sequences as for L1 measurements
Make L1Ln-1 transparent to measurements
Unique in isolating the behavior of Ln so that
all higher levels miss
Approach
Use sequences of sequences
Convolution of sequences

?

17
Measuring I-Cache Capacity

Approach for Data Cache does not work
Array of pointers ? Code sequence with branches
Such branches are very predictable
Nearly impossible to get precise timing
Measure time to execute special code sequence of
size N statements
Find the biggest N for which there is no
significant increase in time per statement

18
Nano-benchmark

Similar to Instruction Throughput
Parameters (1, 4)
Grow length N
Code size computed
(char )finish (char )start

19
Sensitivity

Graph for Pentium M
9 more in the paper
Performance oscillates
Even after averaging out noise
Cannot wait for jump
Need more robust measurement

20
Control Engine Script

Start with N256
Compute
Mean
Standard deviation
For
Binary-search
Detect jump when time is more than

21
Experimental Results
22
Pentium 4

Does not cache ISA instructions, but uops
Trace cache
Measure the number of instructions
Smoothing in the nano-benchmark minimum of time
in

23
Conclusions

X-Ray A framework and tool
First to measure instruction cache capacity
Algorithms for precise measurements of some
important hardware parameters
Experimental results on many modern architectures
Other X-Ray resources
Memory Hierarchy parameter measurement appeared
at SIGMETRICS05
CPU parameter measurement appeared at QEST05
Improving X-Ray is work in progress

24
Current and Future Work

2-address vs. 3-address code
Out-of-Order execution
Number Physical registers
Number / Type Functional Units
Cache
bandwidth
write mode
sharedness
replacement policy

25
Thank you!

My E-Mail
kamen_at_yotov.org
kyotov_at_us.ibm.com
Cornell Group homepage
http//iss.cs.cornell.edu
This work emerged from a joint project with David
Paduas group at UIUC
http//polaris.cs.uiuc.edu/newframework.html
Download X-Ray!
http//iss.cs.cornell.edu/software/x-ray.aspx

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

CIMO Survey National Summaries of Methods and Instruments Related to Solid Precipitation Measurement at Automatic Weather Stations - Very Preliminary results - PowerPoint PPT Presentation

CIMO Survey National Summaries of Methods and Instruments Related to Solid Precipitation Measurement at Automatic Weather Stations - Very Preliminary results - - National Summaries of Methods and Instruments Related to Solid Precipitation Measurement at Automatic Weather Stations - Very Preliminary results - | PowerPoint PPT presentation | free to view

Automatic Pool Cleaners Market Breakdown Data by Manufacturers, Product, and End User PowerPoint PPT Presentation

Automatic Pool Cleaners Market Breakdown Data by Manufacturers, Product, and End User - A new report available with decisiondatabases.com on Automatic Pool Cleaners Market which provides an in-depth analysis during the forecast period. This report focuses on top manufactures with capacity, production, price, revenue and Market share. | PowerPoint PPT presentation | free to view

Automatic Harvester Market Future Forecast Report till 2025 PowerPoint PPT Presentation

Automatic Harvester Market Future Forecast Report till 2025 - A new report available with decisiondatabases.com on Automatic Harvester Market which provides an in-depth analysis during the forecast period. This report focuses on top manufactures with capacity, production, price, revenue and Market share. | PowerPoint PPT presentation | free to view

THINK ‘SMART’,THINK WOHR AUTOMATIC CAR PARKING PowerPoint PPT Presentation

THINK ‘SMART’,THINK WOHR AUTOMATIC CAR PARKING - As urban and semi-urban centres suffer from a severe shortage of parking space compounded by sky-rocketing land prices, multilevel car parking systems are the only solution. In large car parks, it's not always easy to find an unoccupied spot, and it takes time to park and retrieve vehicles. Moreover, in busy towns and cities, management of parking lots poses a serious challenge. Suitable manpower to guard, guide and manage the system is not only hard to find, but expensive. Automatic car parking systems can counter this problem, making daily life so much easier. | PowerPoint PPT presentation | free to view

THINK ‘SMART’,THINK WOHR AUTOMATIC CAR PARKING (1) PowerPoint PPT Presentation

THINK ‘SMART’,THINK WOHR AUTOMATIC CAR PARKING (1) - As urban and semi-urban centres suffer from a severe shortage of parking space compounded by sky-rocketing land prices, multilevel car parking systems are the only solution. In large car parks, it's not always easy to find an unoccupied spot, and it takes time to park and retrieve vehicles. Moreover, in busy towns and cities, management of parking lots poses a serious challenge. Suitable manpower to guard, guide and manage the system is not only hard to find, but expensive. Automatic car parking systems can counter this problem, making daily life so much easier. | PowerPoint PPT presentation | free to view

Fully Automatic Fly Ash Brick Making Machine PowerPoint PPT Presentation

Fully Automatic Fly Ash Brick Making Machine - Leading manufacturers of construction machines like automatic fly ash brick making machine - ABMH - 8SPDX, ABMH - 8SP and SPDX, ABMH - 8SP, hollow solid block making hydraulics machine, mosaic tiles machines, designer vibro forming machines, pan mixture and concrete mixture machines. | PowerPoint PPT presentation | free to view

Automatic Car Parking Solution from Wohr Parking Systems PowerPoint PPT Presentation

Automatic Car Parking Solution from Wohr Parking Systems - Around the world, more and more people are opting for Automatic Car Parking Systems. And the reasons are easy to understand. Maximise the number of parking spaces with minimum land usage. Provide convenience and safety for end-users. Provide parking for cars on multiple levels, stacked vertically. Cars are retrieved and parked automatically using a system of pallets, lifts and signalling devices. Elimination of ramps, driving lanes, pedestrian pathways and reduction in ceiling heights. Many systems utilize a steel framework (some use thin concrete slabs) rather than the monolithic concrete design of the multi-storey parking garage. These factors contribute to an overall volume reduction and further space savings for the Automatic Parking System. | PowerPoint PPT presentation | free to view

Automatic Driving School at New South Wales. PowerPoint PPT Presentation

Automatic Driving School at New South Wales. - Such a large number of individuals approach driver preparing with the essential point of breezing through the driving test. The insights on new drivers demonstrate that breezing through the test. There are genuine, quantifiable outcomes of poor driving abilities. Safe Driving School gives the Best Driving Lesson Parramatta and its trainees are instructed to drive securely and accurately so they are talented, certain and prepared to make due on the streets. Read More: http://www.safedrivingschool.com.au/ | PowerPoint PPT presentation | free to view

Automatic ice-cream characterization by electrical impedance spectroscopy PowerPoint PPT Presentation

Automatic ice-cream characterization by electrical impedance spectroscopy - The presentation discuss the use of Electrical Impedance Spectroscopy for automatic characterization of ice cream mixes. The feasibility to discriminate between milk based creamy and fruit based mixes by means of non desctructive analysis of the sample electrical parameters is shown. If you want to know more about this, please read the following paper: Marco Grossi, Massimo Lanzoni, Roberto Lazzarini, Bruno Riccò, “Automatic ice-cream characterization by impedance measurements for optimal machine setting”, Measurement 45, 2012, 1747-1754. https://www.researchgate.net/publication/229458285_Automatic_Ice-Cream_Characterization_by_Impedance_Measurements_for_Optimal_Machine_Setting | PowerPoint PPT presentation | free to view

Arduino based Automatic Temperature Controlled Fan Speed Regulator PowerPoint PPT Presentation

Arduino based Automatic Temperature Controlled Fan Speed Regulator - Using an analog temperature LM35 interfaced to the built in ADC of a programmed Arduino to develop varying duty cycle of PWM output for a driver IC to run a DC motor automatically according to the sensed temperature at different speed based on the temperature sensed. | PowerPoint PPT presentation | free to view

Global Automatic Guided Vehicles Market Research Report 2016 PowerPoint PPT Presentation

Global Automatic Guided Vehicles Market Research Report 2016 - The grandresearch report is about-The report firstly introduced the Automatic Guided Vehicles basics: definitions, classifications, applications and industry chain overview; industry policies and plans; product specifications; manufacturing processes; cost structures and so on. Then it analyzed the world's main region market conditions, including the product price, profit, capacity, production, capacity utilization, supply, demand and industry growth rate etc. In the end, the report introduced new project SWOT analysis, investment feasibility analysis, and investment return analysis. Visit here-http://www.grandresearchstore.com/instrument/global-automatic-guided-vehicles-market-research-report-2016 | PowerPoint PPT presentation | free to view

Urban, Suburban, or Rural? a GPS cache activity: by Hope Vincent Social Studies GLE 14 PowerPoint PPT Presentation

Urban, Suburban, or Rural? a GPS cache activity: by Hope Vincent Social Studies GLE 14 - Urban, Suburban, or Rural? a GPS cache activity: by Hope Vincent Social Studies GLE 14 Tell us about your community. Use Skype to visit with other children or adults ... | PowerPoint PPT presentation | free to view

Transparent Cache Market - Global Outlook and Forecast 2021-2027 PowerPoint PPT Presentation

Transparent Cache Market - Global Outlook and Forecast 2021-2027 - The global Transparent Cache market was valued at xx million in 2020 and is projected to reach US$ xx million by 2027, at a CAGR of xx% during the forecast period. We surveyed the Transparent Cache companies, and industry experts on this industry, involving the revenue, demand, product type, recent developments and plans, industry trends, drivers, challenges, obstacles, and potential risks. | PowerPoint PPT presentation | free to view

Automatic Wet Blasting Machines Market Professional Survey Report 2018 PowerPoint PPT Presentation

Automatic Wet Blasting Machines Market Professional Survey Report 2018 - This report studies Automatic Wet Blasting Machines in Global market, especially in North America, China, Europe, Southeast Asia, Japan and India, with production, revenue, consumption, import and export in these regions, from 2013 to 2018, and forecast to 2025. This report focuses on top manufacturers in global market, with production, price, revenue and market share for each manufacturer, covering AB SHOT TECNICS, S.L. Blastline CLEMCO INDUSTRIES VIXEN Wheelabrator Hodge Clemco | PowerPoint PPT presentation | free to view

Global Automatic Guided Vehicles Market Research Report 2016 PowerPoint PPT Presentation

Global Automatic Guided Vehicles Market Research Report 2016 - VISIT HERE @ http://www.grandresearchstore.com/instrument/global-automatic-guided-vehicles-market-research-report-2016 This Report Provided By GrandResearchStore Is About,Automatic Guided Vehicles basics: definitions, classifications, applications and industry chain overview; industry policies and plans; product specifications; manufacturing processes; cost structures and so on. Then it analyzed the world's main region market conditions, including the product price, profit, capacity, production, capacity utilization, supply, demand and industry growth rate etc. In the end, the report introduced new project SWOT analysis, investment feasibility analysis, and investment return analysis. | PowerPoint PPT presentation | free to view

Automatic Self-Piercing Rivets Market Trends, Emerging Opportunities, And Top Key Players PowerPoint PPT Presentation

Automatic Self-Piercing Rivets Market Trends, Emerging Opportunities, And Top Key Players - The mushrooming manufacturing of luxury vehicles is causing a sharp surge in the demand for automatic self-piercing rivets across the world. | PowerPoint PPT presentation | free to view

Automatic Number Plate Recognition Camera Market Research Report 2018 PowerPoint PPT Presentation

Automatic Number Plate Recognition Camera Market Research Report 2018 - Global Automatic Number Plate Recognition Camera Market Research Report 2018 provides a complete data analysis with Market value, Sales, Price, Industry Analysis and Forecast with the help of Industry Experts. | PowerPoint PPT presentation | free to view

Global Automatic Amino-acid Analyzor Market Research Report 2017 PowerPoint PPT Presentation

Global Automatic Amino-acid Analyzor Market Research Report 2017 - VISIT HERE @ https://www.grandresearchstore.com/diagnostic-and-biotech/global-automatic-amino-acid-analyzor-market-research-report-2017 This report provided by GrandResearchStore is about,Automatic Amino-acid Analyzor in Global market, especially in North America, Europe, China, Japan, Southeast Asia and India, focuses on top manufacturers in global market, with capacity, production, price, revenue and market share for each manufacturer, covering ZefSci Biochrom Hitachi | PowerPoint PPT presentation | free to view

Global Automatic Distillation Analyzer Industry Market 2016 Industry Key Trends, Demand, Growth, Size, Review, Share, Analysis to 2020 PowerPoint PPT Presentation

Global Automatic Distillation Analyzer Industry Market 2016 Industry Key Trends, Demand, Growth, Size, Review, Share, Analysis to 2020 - Avail more information from Sample Brochure of report @ https://goo.gl/Wo9HDM A detailed qualitative analysis of the factors responsible for driving and restraining growth of the Global Automatic Distillation Analyzer Industry Market and future opportunities are provided in the report. | PowerPoint PPT presentation | free to view

Automatic Tipper Market Analysis by Supply, Sales, Demand, Status and Forecasts 2016 to 2020 PowerPoint PPT Presentation

Automatic Tipper Market Analysis by Supply, Sales, Demand, Status and Forecasts 2016 to 2020 - The Automatic Tipper market 2016 report summarizes Automatic Tipper industry policy and plan, product specification, manufacturing process, cost structure etc. the report deeply analyzed the world's main region market conditions that including the product price, profit, capacity, production, capacity utilization, supply, demand and industry growth rate. View complete report at https://goo.gl/mJDK7y . | PowerPoint PPT presentation | free to view

Architectural Analysis of a DSP Device, the Instruction Set and the Addressing Modes PowerPoint PPT Presentation

Architectural Analysis of a DSP Device, the Instruction Set and the Addressing Modes - Architectural Analysis of a DSP Device, the Instruction Set and the Addressing Modes SYSC5603 (ELG6163) Digital Signal Processing Microprocessors, Software and ... | PowerPoint PPT presentation | free to view

Global Automatic Fat Extraction Apparatus Market Research Report 2017 PowerPoint PPT Presentation

Global Automatic Fat Extraction Apparatus Market Research Report 2017 - VISIT HERE @ https://www.grandresearchstore.com/diagnostic-and-biotech/global-automatic-fat-extraction-apparatus-market-research-report-2017 This report provided by GrandResearchStore is about,Automatic Fat Extraction Apparatus in Global market, especially in North America, Europe, China, Japan, Southeast Asia and India, focuses on top manufacturers in global market, with capacity, production, price, revenue and market share for each manufacturer, covering ANKON BUCHI Gerhardt Omega Scientific Pte Ltd | PowerPoint PPT presentation | free to view

Basic Caching Terminology PowerPoint PPT Presentation

Basic Caching Terminology - Certain terms that are most frequently used with regard to caching are origin server, cache hit ratio, freshness, stale content, validation and invalidation. | PowerPoint PPT presentation | free to view

Global Semi Automatic Test Equipment Market Research Report 2017 PowerPoint PPT Presentation

Global Semi Automatic Test Equipment Market Research Report 2017 - This Report provided by 24 Market Reports is about, Semi Automatic Test Equipment Report by Material, Application, and Geography Global Forecast to 2021 is a professional and in-depth research report on the world's major regional market conditions, focusing on the main regions (North America, Europe and Asia-Pacific) and the main countries. | PowerPoint PPT presentation | free to view

Europe Automatic Hospital Doors Industry 2016 - Market Size, Share, Trends & Forecast PowerPoint PPT Presentation

Europe Automatic Hospital Doors Industry 2016 - Market Size, Share, Trends & Forecast - The Europe Automatic Hospital Doors Industry 2016 report focuses on Europe major leading industry players providing information such as company profiles, product picture and specification, capacity, production, price, cost, revenue and contact information. Upstream raw materials and equipment and downstream demand analysis is also carried out. The Automatic Hospital Doors industry development trends and marketing channels are analyzed. | PowerPoint PPT presentation | free to view

Automatic Folder Gluer Machine Market Professional Survey Report 2018 PowerPoint PPT Presentation

Automatic Folder Gluer Machine Market Professional Survey Report 2018 - Automatic Folder Gluer Machine market in Global market, especially in North America, China, Europe, Southeast Asia, Japan and India, with production, revenue, consumption, import and export in these regions, from 2013 to 2018, and forecast to 2025. | PowerPoint PPT presentation | free to view

SVAVO 340ml Automatic Soap Dispenser Infrared Touchless Motion Bathroom Dispenser Smart Sensor Liquid Soap Dispenser for Kitchen PowerPoint PPT Presentation

SVAVO 340ml Automatic Soap Dispenser Infrared Touchless Motion Bathroom Dispenser Smart Sensor Liquid Soap Dispenser for Kitchen - Type: Liquid Soap Dispensers Feature: Liquid Soap Dispenser Liquid Soap Dispenser Type: Automatic Soap Dispenser Main Material: ABS Model Number: V-473 Brand Name: Svavo Material: ABS plastic Capacity: 350ml Voltage: DC6V Currective: 480mA Unit size: L84*W110*H240mm Effective induction distance: 0-10cm Install: Deck mounted Power: 4pcs "AAA" batteries (not included) Volume settings: 1.5ml/3ml/4.5ml Feature: Suitable for using liquid soap, shampoo,alcohol etc. ShareShare on Facebook TweetTweet on Twitter Pin itPin on Pinterest | PowerPoint PPT presentation | free to view