Cache Memory III - PowerPoint PPT Presentation

1 / 23

About This Presentation

Title:

Cache Memory III

Description:

Configured at boot time. Direct mapped. Uses write-back policy ... Set at boot time. L1 cache line size L2 cache size. Direct mapping simplifies replacement ... – PowerPoint PPT presentation

Number of Views:79

Avg rating:3.0/5.0

Slides: 24

Provided by: sda57

Category:

Tags: iii | boottime | cache | memory

Transcript and Presenter's Notes

Title: Cache Memory III

1
Cache Memory III
Instructor Koling Chang email
kchang_at_cs.ucdavis.edu
2
Space Overhead

The three mapping functions introduce different
space overheads
Overhead decreases with increasing degree of
associativity
Several examples in the text

4 GB address space 32 KB cache
3
Overhead Calculation

32K/32(32-51)/8/32K (32-byte fully-associative)
32K/32(32-5-1021)/8/32K (32-byte 4-way set
associative)
32K/32(32-5-101)/8/32K (32-byte direct mapped)
32K/4(32-21)/8/32K (4-byte fully-associative)
32K/4(32-2-1321)/8/32K (4-byte 4-way set
associative)
32K/4(32-2-131)/8/32K (4-byte direct mapped)

4
Outline

Types of cache misses
Types of caches
Example implementations
Pentium
PowerPC
MIPS
Cache operation summary
Design issues
Cache capacity
Cache line size
Degree of associatively

5
Types of Cache Misses

Three types
Compulsory misses
Due to first-time access to a block
Also called cold-start misses or compulsory line
fills
Capacity misses
Induced due to cache capacity limitation
Can be avoided by increasing cache size
Conflict misses
Due to conflicts caused by direct and
set-associative mappings
Can be completely eliminated by fully associative
mapping
Also called collision misses

6
Types of Cache Misses (cont.)

Compulsory misses
Reduced by increasing block size
We prefetch more
Cannot increase beyond a limit
Cache misses increase
Capacity misses
Reduced by increasing cache size
Law of diminishing returns
As a variable factor is added to fixed factors,
after some point the marginal product of the
variable factor declines.
Conflict misses
Reduced by increasing degree of associativity
Fully associative mapping no conflict misses

7
Types of Caches

Separate instruction and data caches
Initial cache designs used unified caches
Current trend is to use separate caches (for
level 1)

8
Types of Caches (cont.)

Several reasons for preferring separate caches
Locality tends to be stronger
Can use different designs for data and
instruction caches
Instruction caches
Read only, dominant sequential access
No need for write policies
Can use a simple direct mapped cache
implementation
Data caches
Can use a set-associative cache
Appropriate write policy can be implemented
Disadvantage
Rigid boundaries between data and instruction
caches

9
Types of Caches (cont.)

Number of cache levels
Most use two levels
Primary (level 1 or L1)
On-chip
Secondary (level 2 or L2)
Off-chip
Examples
Pentium
L1 32 KB
L2 up to 2 MB
PowerPC
L1 64 KB
L2 up to 1 MB

10
Types of Caches (cont.)

Two-level caches work as follows
First attempts to get data from L1 cache
If present in L1, gets data from L1 cache (L1
cache hit)
If not, data must come from L2 cache or main
memory (L1 cache miss)
In case of L1 cache miss, tries to get from L2
cache
If data are in L2, gets data from L2 cache (L2
cache hit)
Data block is written to L1 cache
If not, data comes from main memory (L2 cache
miss)
Main memory block is written into L1 and L2
caches
Variations on this basic scheme are possible

11
Types of Caches (cont.)
Virtual and physical caches
12
Example Implementations

We look at three processors
Pentium
PowerPC
MIPS
Pentium implementation
Two levels
L1 cache
Split cache design
Separate data and instruction caches
L2 cache
Unified cache design

13
Example Implementations (contd)

Pentium allows each page/memory region to have
its own caching attributes
Uncacheable
All reads and writes go directly to the main
memory
Useful for
Memory-mapped I/O devices
Large data structures that are read once
Write-only data structures
Write combining
Not cached
Writes are buffered to reduce access to main
memory
Useful for video buffer frames

14
Example Implementations (contd)

Write-through
Uses write-through policy
Writes are delayed as they go though a write
buffer as in write combining mode
Write back
Uses write-back policy
Writes are delayed as in the write-through mode
Write protected
Inhibits cache writes
Write are done directly on the memory

15
Example Implementations (contd)

Two bits in control register CR0 determine the
mode
Cache disable (CD) bit
Not write-through (NW) bit

w
Write-back
16
Example Implementations (contd)

PowerPC cache implementation
Two levels
L1 cache
Split cache
Each 32 KB eight-way associative
Uses pseudo-LRU replacement
Instruction cache read-only
Data cache read/write
Choice of write-through or write-back
L2 cache
Unified cache as in Pentium
Two-way set associative

17
Example Implementations (contd)

Write policy type and caching attributes can be
set by OS at the block or page level
L2 cache requires only a single bit to implement
LRU
Because it is 2-way associative
L1 cache implements a pseudo-LRU
Each set maintains seven PLRU bits (B0-B6)

18
Example Implementations (contd)
PowerPC placement policy (incl. PLRU)
19
Example Implementations (contd)

MIPS implementation
Two-level cache
L1 cache
Split organization
Instruction cache
Virtual cache
Direct mapped
Read-only
Data cache
Virtual cache
Direct mapped
Uses write-back policy

L1 line size 16 or 32 bytes
20
Example Implementations (contd)

L2 cache
Physical cache
Either unified or split
Configured at boot time
Direct mapped
Uses write-back policy
Cache block size
16, 32, 64, or 128 bytes
Set at boot time
L1 cache line size ? L2 cache size
Direct mapping simplifies replacement
No need for LRU type complex implementation

21
Cache Operation Summary

Various policies used by cache
Placement of a block
Direct mapping
Fully associative mapping
Set-associative mapping
Location of a block
Depends on the placement policy
Replacement policy
LRU is the most popular
Pseudo-LRU is often implemented
Write policy
Write-through
Write-back

22
Design Issues

Several design issues
Cache capacity
Law of diminishing returns
Cache line size/block size
Degree of associativity
Unified/split
Single/two-level
Write-through/write-back
Logical/physical

23
Design Issues (contd)
Last slide

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

Lecture 14: Large Cache Design III PowerPoint PPT Presentation

Lecture 14: Large Cache Design III - ... networking basics * ... Avg. routing distance: Diameter : Bisection bandwidth ... the request may be held at an intermediate router until the ... | PowerPoint PPT presentation | free to view

CHAPTER 7 LARGE AND FAST: EXPLOITING MEMORY HIERARCHY PowerPoint PPT Presentation

CHAPTER 7 LARGE AND FAST: EXPLOITING MEMORY HIERARCHY - CHAPTER 7 LARGE AND FAST: EXPLOITING MEMORY HIERARCHY Topics to be covered Principle of locality Memory hierarchy Cache concepts and cache organization | PowerPoint PPT presentation | free to view

Cache Coherence and Memory Consistency PowerPoint PPT Presentation

Cache Coherence and Memory Consistency - OR Dirty in exactly one cache (Exclusive) OR Not in any caches ... Memory Consistence Models. Why should sequential consistency be the only correct one? ... | PowerPoint PPT presentation | free to view

Computer Architecture PowerPoint PPT Presentation

Computer Architecture - Computer Architecture Part III-C: Memory Access and Management Memory Access Methods Ways of locating data stored in main memory Three types Addressed memory ... | PowerPoint PPT presentation | free to view

Improving Cache Performance PowerPoint PPT Presentation

Improving Cache Performance - Page fault. Address fault. Memory mapping (address translation) ... Fetches from predicted path. FPU. Five functional units: Add. Multiply. Divide/square root ... | PowerPoint PPT presentation | free to view

Lecture 1: Introduction and Memory Systems PowerPoint PPT Presentation

Lecture 1: Introduction and Memory Systems - Lecture 1: Introduction and Memory Systems CS 7810 Course organization: 7 lectures on memory systems 3 lectures on cache coherence and consistency | PowerPoint PPT presentation | free to view

MEMORY PERFORMANCE EVALUATION PowerPoint PPT Presentation

MEMORY PERFORMANCE EVALUATION - MEMORY PERFORMANCE EVALUATION OF HIGH THOUGHPUT SERVERS Garba Ya u Isa Master s Thesis Oral Defense Computer Engineering King Fahd University of Petroleum & Minerals | PowerPoint PPT presentation | free to view

Lecture 16: Cache Innovations / Case Studies PowerPoint PPT Presentation

Lecture 16: Cache Innovations / Case Studies - Lecture 16: Cache Innovations / Case Studies Topics: prefetching, blocking, processor case studies (Section 5.2) | PowerPoint PPT presentation | free to view

Cache Based Iterative Algorithms PowerPoint PPT Presentation

Cache Based Iterative Algorithms - Title: Supercomputing 2001 Tutorial: Cache Based Iterative Algorithms Author: Craig C. Douglas Last modified by: ruede Created Date: 9/26/2001 1:01:36 PM | PowerPoint PPT presentation | free to view

Distributed Shared Memory: A Survey of Issues and Algorithms PowerPoint PPT Presentation

Distributed Shared Memory: A Survey of Issues and Algorithms - Distributed Shared Memory: A Survey of Issues and Algorithms B,. Nitzberg and V. Lo University of Oregon | PowerPoint PPT presentation | free to view

13AMT Procesory III. PowerPoint PPT Presentation

13AMT Procesory III. - 13AMT Procesory III. Lecture 4 Ing. Martin Molhanec, CSc. | PowerPoint PPT presentation | free to view

III. Multicore Processors (5) PowerPoint PPT Presentation

III. Multicore Processors (5) - Source:Barney B., 'IBM POWER Systems Overview', Livermore Computing, 2006, ... al., IBM System p5 Quad-Core Module Based on POWER5 Technology,' Redbooks paper, ... | PowerPoint PPT presentation | free to view

Memory Part 3 PowerPoint PPT Presentation

Memory Part 3 - Otherwise, the block containing the RA is loaded into the cache, and the word is ... In modern cache configurations, the loading of the cache and delivering the ... | PowerPoint PPT presentation | free to view

Cache-Collision Timing Attacks Against AES PowerPoint PPT Presentation

Cache-Collision Timing Attacks Against AES - Cache-Collision Timing Attacks Against AES Joseph Bonneau Stanford University jbonneau@stanford.edu Ilya Mironov Microsoft Research mironov@microsoft.com | PowerPoint PPT presentation | free to view

Advanced Computer Architecture 5MD00 / 5Z033 Memory Hierarchy PowerPoint PPT Presentation

Advanced Computer Architecture 5MD00 / 5Z033 Memory Hierarchy - Advanced Computer Architecture 5MD00 / 5Z033 Memory Hierarchy & Caches Henk Corporaal www.ics.ele.tue.nl/~heco/courses/aca h.corporaal@tue.nl TUEindhoven | PowerPoint PPT presentation | free to view

Memory Hierarchy Basics PowerPoint PPT Presentation

Memory Hierarchy Basics - Set Row address on address lines & strobe RAS. Entire row read & stored in column latches ... RAS. row. col. Entire row buffered here ... | PowerPoint PPT presentation | free to view

SUN ULTRASPARC-III ARCHITECTURE PowerPoint PPT Presentation

SUN ULTRASPARC-III ARCHITECTURE - SUN ULTRASPARC-III ARCHITECTURE CMPE 511 PRESENTATION Prepared by:Balk r Kayaalt Introduction SPARC stands for a Scalable Processor ARChitecture. | PowerPoint PPT presentation | free to view

Virtual Memory PowerPoint PPT Presentation

Virtual Memory - Topics Motivations for VM Address translation Accelerating translation with TLBs Motivations for Virtual Memory Use Physical DRAM as a Cache for the Disk Address ... | PowerPoint PPT presentation | free to view

MS108 Computer System I PowerPoint PPT Presentation

MS108 Computer System I - Title: Cache and Memory Author: lingu Last modified by: Alex Liang Created Date: 8/16/2006 12:00:00 AM Document presentation format: (4:3) | PowerPoint PPT presentation | free to view

Toward an Advanced Intelligent Memory System PowerPoint PPT Presentation

Toward an Advanced Intelligent Memory System - Need to do: chip layout and fabrication development of the compiler. Funds needed for: ... Fabricate chips. Build a workstation with an intelligent memory system ... | PowerPoint PPT presentation | free to view

Chapter 5 Memory III PowerPoint PPT Presentation

Chapter 5 Memory III - Notice the 'U' shape: some is good, too much is bad. Michigan State University ... An 8-way associative cache has close to the same miss rate as fully associative ... | PowerPoint PPT presentation | free to view

Cache Design and Tricks PowerPoint PPT Presentation

Cache Design and Tricks - Prefetching too early, however will mean that other accesses might flush the prefetched data from the cache. Memory accesses may take 50 processor clock cycles or more. | PowerPoint PPT presentation | free to view

Reducing Garbage Collector Cache Misses PowerPoint PPT Presentation

Reducing Garbage Collector Cache Misses - Inflected Form(s): plural -ties ... from affinis bordering on, related by marriage, from ad- finis end, border ... 1 : relationship by marriage ... | PowerPoint PPT presentation | free to view

Virtual Memory March 23, 2000 PowerPoint PPT Presentation

Virtual Memory March 23, 2000 - Virtual Memory March 23, 2000 Topics Motivations for VM Address translation Accelerating address translation with TLBs Pentium II/III memory system | PowerPoint PPT presentation | free to view

Cache%20Memories%20October%206,%202006 PowerPoint PPT Presentation

Cache%20Memories%20October%206,%202006 - Impact of caches on performance. The memory mountain. class12.ppt. 15-213 ' ... Cache memories are small, fast SRAM-based memories managed automatically in hardware. ... | PowerPoint PPT presentation | free to view

The Memory Hierarchy October 5, 2004 PowerPoint PPT Presentation

The Memory Hierarchy October 5, 2004 - Random-Access Memory (RAM) Key features. RAM is traditionally ... Solid state disks (flash cards, memory sticks, etc.) Smart cards, embedded systems, appliances ... | PowerPoint PPT presentation | free to view

Lecture 7: Implementing Cache Coherence PowerPoint PPT Presentation

Lecture 7: Implementing Cache Coherence - Lecture 7: Implementing Cache Coherence Topics: implementation details | PowerPoint PPT presentation | free to view