Caching IV

About This Presentation

Title:

Caching IV

Description:

program relocation. protection. Pages: virtual memory blocks ... have test programs that exercise all instruction. have a full report that explains your design ... – PowerPoint PPT presentation

Number of Views:15

Avg rating:3.0/5.0

Slides: 18

Provided by: scie226

Learn more at: https://people.engr.tamu.edu

Category:

more less

Transcript and Presenter's Notes

Title: Caching IV

1
Caching IV

Andreas Klappenecker
CPSC321 Computer Architecture

2
Virtual Memory

Processor generates virtual addresses
Memory is accessed using physical addresses
Virtual and physical memory is broken into blocks
of memory, called pages
A virtual page may be
absent from main memory, residing on the disk
or may be mapped to a physical page

3
Virtual Memory

Main memory can act as a cache for the secondary
storage (disk)
Virtual address generated by processor (left)
Address translation (middle)
Physical addresses (right)

4
Pages virtual memory blocks

Page faults if data is not in memory, retrieve
it from disk
huge miss penalty, thus pages should be fairly
large (e.g., 4KB)
reducing page faults is important (LRU is worth
the price)
can handle the faults in software instead of
hardware
using write-through takes too long so we use
writeback
Example page size 2124KB 218 physical pages
main memory lt 1GB virtual memory lt 4GB

5
Page Faults

Incredible high penalty for a page fault
Reduce number of page faults by optimizing page
placement
Use fully associative placement
full search of pages is impractical
pages are located by a full table that indexes
the memory, called the page table
the page table resides within the memory

6
Page Tables
The page table maps each page to either a page in
main memory or to a page stored on disk
7
Page Tables

8
Making Memory Access Fast

Page tables slow us down
Memory access will take at least twice as long
access page table in memory
access page
What can we do?

Memory access is local gt use a cache that keeps
track of recently used address translations,
called translation lookaside buffer
9
Making Address Translation Fast

A cache for address translations translation
lookaside buffer

10
Translation Lookaside Buffer

Some typical values for a TLB
TLB size 32-4096
Block size 1-2 page table entries (4-8bytes
each)
Hit time 0.5-1 clock cycle
Miss penalty 10-30 clock cycles
Miss rate 0.01-1

11
TLBs and Caches
12
More Modern Systems

Very complicated memory systems

13
Some Issues

Processor speeds continue to increase very
fast much faster than either DRAM or disk
access times
Design challenge dealing with this growing
disparity
Trends
synchronous SRAMs (provide a burst of data)
redesign DRAM chips to provide higher bandwidth
or processing
restructure code to increase locality
use prefetching (make cache visible to ISA)

14
Where can a Block be Placed?
Name Number of Sets Blocks per Set
Direct mapped Blocks in Cache 1
Set associative (Blocks in Cache) Associativity Associativity (typically 2-8)
Fully associative 1 Number of blocks in cache
15
How is a Block Found?
Associativity Number of Sets Comparisons
Direct mapped Index 1
Set associative Index the set, search among elements Degree of Associativity
Fully associative search all cache entries size of the cache
Fully associative separate lookup table 0
16
Algorithm for Success