Virtual Memory - PowerPoint PPT Presentation

1 / 78
About This Presentation
Title:

Virtual Memory

Description:

Virtual Memory Prof. Sin-Min Lee Department of Computer Science – PowerPoint PPT presentation

Number of Views:110
Avg rating:3.0/5.0
Slides: 79
Provided by: Lee149
Category:

less

Transcript and Presenter's Notes

Title: Virtual Memory


1
Virtual Memory
CS147 Lecture 18
  • Prof. Sin-Min Lee
  • Department of Computer Science

2
(No Transcript)
3
(No Transcript)
4
(No Transcript)
5
Fixed (Static) Partitions
  • Attempt at multiprogramming using fixed
    partitions
  • one partition for each job
  • size of partition designated by reconfiguring the
    system
  • partitions cant be too small or too large.
  • Critical to protect jobs memory space.
  • Entire program stored contiguously in memory
    during entire execution.
  • Internal fragmentation is a problem.

6
Simplified Fixed Partition Memory Table (Table
2.1)
7
Table 2.1 Main memory use during fixed
partition allocation of Table 2.1. Job 3 must
wait.
Job List J1 30K J2 50K J3 30K J4 25K
Original State
After Job Entry
100K
Job 1 (30K)
Partition 1
Partition 1
Partition 2
25K
Job 4 (25K)
Partition 2
25K
Partition 3
Partition 3
50K
Job 2 (50K)
Partition 4
Partition 4
8
Dynamic Partitions
  • Available memory kept in contiguous blocks and
    jobs given only as much memory as they request
    when loaded.
  • Improves memory use over fixed partitions.
  • Performance deteriorates as new jobs enter the
    system
  • fragments of free memory are created between
    blocks of allocated memory (external
    fragmentation).

9
Dynamic Partitioning of Main Memory
Fragmentation (Figure 2.2)
10
Dynamic Partition Allocation Schemes
  • First-fit Allocate the first partition that is
    big enough.
  • Keep free/busy lists organized by memory location
    (low-order to high-order).
  • Faster in making the allocation.
  • Best-fit Allocate the smallest partition that
    is big enough
  • Keep free/busy lists ordered by size (smallest
    to largest).
  • Produces the smallest leftover partition.
  • Makes best use of memory.

11
First-Fit Allocation Example (Table 2.2)
  • J1 10K
  • J2 20K
  • J3 30K
  • J4 10K
  • Memory Memory Job Job Internal
  • location block size number
    size Status fragmentation
  • 10240 30K J1 10K Busy 20K
  • 40960 15K J4 10K Busy 5K
  • 56320 50K J2 20K Busy 30K
  • 107520 20K Free
  • Total Available 115K Total Used 40K

Job List
12
Best-Fit Allocation Example(Table 2.3)
  • J1 10K
  • J2 20K
  • J3 30K
  • J4 10K
  • Memory Memory Job Job Internal
  • location block size number
    size Status fragmentation
  • 40960 15K J1 10K Busy 5K
  • 107520 20K J2 20K Busy None
  • 10240 30K J3 30K Busy None
  • 56230 50K J4 10K Busy 40K
  • Total Available 115K Total Used 70K

Job List
13
First-Fit Memory Request
14
Best-Fit Memory Request
15
Best-Fit vs. First-Fit
  • First-Fit
  • Increases memory use
  • Memory allocation takes less time
  • Increases internal fragmentation
  • Discriminates against large jobs
  • Best-Fit
  • More complex algorithm
  • Searches entire table before allocating memory
  • Results in a smaller free space (sliver)

16
Release of Memory Space Deallocation
  • Deallocation for fixed partitions is simple
  • Memory Manager resets status of memory block to
    free.
  • Deallocation for dynamic partitions tries to
    combine free areas of memory whenever possible
  • Is the block adjacent to another free block?
  • Is the block between 2 free blocks?
  • Is the block isolated from other free blocks?

17
Case 1 Joining 2 Free Blocks
18
Case 2 Joining 3 Free Blocks
19
Case 3 Deallocating an Isolated Block
20
Relocatable Dynamic Partitions
  • Memory Manager relocates programs to gather all
    empty blocks and compact them to make 1 memory
    block.
  • Memory compaction (garbage collection,
    defragmentation) performed by OS to reclaim
    fragmented sections of memory space.
  • Memory Manager optimizes use of memory improves
    throughput by compacting relocating.

21
Compaction Steps
  • Relocate every program in memory so theyre
    contiguous.
  • Adjust every address, and every reference to an
    address, within each program to account for
    programs new location in memory.
  • Must leave alone all other values within the
    program (e.g., data values).

22
Memory Before After Compaction (Figure 2.5)
23
Contents of relocation register close-up of Job
4 memory area (a) before relocation (b) after
relocation and compaction (Figure 2.6)
24
Virtual Memory
  • Virtual Memory (VM) the ability of the CPU and
    the operating system software to use the hard
    disk drive as additional RAM when needed (safety
    net)
  • Good no longer get insufficient memory error
  • Bad - performance is very slow when accessing VM
  • Solution more RAM

25
Motivations for Virtual Memory
  • Use Physical DRAM as a Cache for the Disk
  • Address space of a process can exceed physical
    memory size
  • Sum of address spaces of multiple processes can
    exceed physical memory
  • Simplify Memory Management
  • Multiple processes resident in main memory.
  • Each process with its own address space
  • Only active code and data is actually in memory
  • Allocate more memory to process as needed.
  • Provide Protection
  • One process cant interfere with another.
  • because they operate in different address spaces.
  • User process cannot access privileged information
  • different sections of address spaces have
    different permissions.

26
Virtual Memory
27
Levels in Memory Hierarchy
cache
virtual memory
Memory
disk
8 B
32 B
4 KB
Register
Cache
Memory
Disk Memory
size speed /Mbyte line size
32 B 1 ns 8 B
32 KB-4MB 2 ns 100/MB 32 B
128 MB 50 ns 1.00/MB 4 KB
20 GB 8 ms 0.006/MB
larger, slower, cheaper
28
DRAM vs. SRAM as a Cache
  • DRAM vs. disk is more extreme than SRAM vs. DRAM
  • Access latencies
  • DRAM 10X slower than SRAM
  • Disk 100,000X slower than DRAM
  • Importance of exploiting spatial locality
  • First byte is 100,000X slower than successive
    bytes on disk
  • vs. 4X improvement for page-mode vs. regular
    accesses to DRAM
  • Bottom line
  • Design decisions made for DRAM caches driven by
    enormous cost of misses

DRAM
Disk
SRAM
29
Locating an Object in a Cache (cont.)
  • DRAM Cache
  • Each allocate page of virtual memory has entry in
    page table
  • Mapping from virtual pages to physical pages
  • From uncached form to cached form
  • Page table entry even if page not in memory
  • Specifies disk address
  • OS retrieves information

Cache
Page Table
Location
0
On Disk

1
30
A System with Physical Memory Only
  • Examples
  • most Cray machines, early PCs, nearly all
    embedded systems, etc.

Memory
0
Physical Addresses
1
N-1
Addresses generated by the CPU point directly to
bytes in physical memory
31
A System with Virtual Memory
  • Examples
  • workstations, servers, modern PCs, etc.

Memory
Page Table
Virtual Addresses
Physical Addresses
0
1
P-1
Disk
Address Translation Hardware converts virtual
addresses to physical addresses via an OS-managed
lookup table (page table)
32
Page Faults (Similar to Cache Misses)
  • What if an object is on disk rather than in
    memory?
  • Page table entry indicates virtual address not in
    memory
  • OS exception handler invoked to move data from
    disk into memory
  • current process suspends, others can resume
  • OS has full control over placement, etc.

Before fault
After fault
Memory
Memory
Page Table
Page Table
Virtual Addresses
Physical Addresses
Virtual Addresses
Physical Addresses
CPU
CPU
Disk
Disk
33
Terminology
4
  • Cache a small, fast buffer that lies between
    the CPU and the Main Memory which holds the most
    recently accessed data.
  • Virtual Memory Program and data are assigned
    addresses independent of the amount of physical
    main memory storage actually available and the
    location from which the program will actually be
    executed.
  • Hit ratio Probability that next memory access is
    found in the cache.
  • Miss rate (1.0 Hit rate)

34
Importance of Hit Ratio
5
  • Given
  • h Hit ratio
  • Ta Average effective memory access time by CPU
  • Tc Cache access time
  • Tm Main memory access time
  • Effective memory time is
  • Ta hTc (1 h)Tm
  • Speedup due to the cache is
  • Sc Tm / Ta
  • Example
  • Assume main memory access time of 100ns and cache
    access time of 10ns and there is a hit ratio of
    .9.
  • Ta .9(10ns) (1 - .9)(100ns) 19ns
  • Sc 100ns / 19ns 5.26
  • Same as above only hit ratio is now .95 instead
  • Ta .95(10ns) (1 - .95)(100ns) 14.5ns

35
Cache vs Virtual Memory
6
  • Primary goal of Cache
  • increase Speed.
  • Primary goal of Virtual Memory increase Space.

36
Cache Replacement Algorithms
15
  • Replacement algorithm determines which block in
    cache is removed to make room.
  • 2 main policies used today
  • Least Recently Used (LRU)
  • The block replaced is the one unused for the
    longest time.
  • Random
  • The block replaced is completely random a
    counter-intuitive approach.

37
LRU vs Random
16
  • Below is a sample table comparing miss rates for
    both LRU and Random.

Cache Size Miss Rate LRU Miss Rate Random
16KB 4.4 5.0
64KB 1.4 1.5
256KB 1.1 1.1
  • As the cache size increases there are more blocks
    to choose from, therefore the choice is less
    critical ? probability of replacing the block
    thats needed next is relatively low.

38
Virtual Memory Replacement Algorithms
17
  • 1) Optimal
  • 2) First In First Out (FIFO)
  • 3) Least Recently Used (LRU)

39
Optimal
18
  • Replace the page which will not be used for the
    longest (future) period of time.

Faults are shown in boxes hits are not shown.
1 2 3 4 1 2 5 1
2 5 3 4 5
7 page faults occur
40
Optimal
19
  • A theoretically best page replacement algorithm
    for a given fixed size of VM.
  • Produces the lowest possible page fault rate.
  • Impossible to implement since it requires future
    knowledge of reference string.
  • Just used to gauge the performance of real
    algorithms against best theoretical.

41
FIFO
20
  • When a page fault occurs, replace the one that
    was brought in first.

Faults are shown in boxes hits are not shown.
1 2 3 4 1 2 5 1
2 5 3 4 5
9 page faults occur
42
FIFO
21
  • Simplest page replacement algorithm.
  • Problem can exhibit inconsistent behavior known
    as Beladys anomaly.
  • Number of faults can increase if job is given
    more physical memory
  • i.e., not predictable

43
Example of FIFO Inconsistency
22
  • Same reference string as before only with 4
    frames instead of 3.

Faults are shown in boxes hits are not shown.
1 2 3 4 1 2 5 1 2
5 3 4 5
10 page faults occur
44
LRU
23
  • Replace the page which has not been used for the
    longest period of time.

Faults are shown in boxes hits only
rearrange stack
1 2 3 4 1 2 5 1
2 5 3 4 5
1
2
5
5
1
2
2
5
1
9 page faults occur
45
LRU
24
  • More expensive to implement than FIFO, but it is
    more consistent.
  • Does not exhibit Beladys anomaly
  • More overhead needed since stack must be updated
    on each access.

46
Example of LRU Consistency
25
  • Same reference string as before only with 4
    frames instead of 3.

Faults are shown in boxes hits only
rearrange stack
1 2 3 4 1 2 5 1
2 5 3 4 5
1
2
1
2
5
4
1
5
1
2
3
4
2
5
1
2
3
4
4
4
7 page faults occur
47
Servicing a Page Fault
(1) Initiate Block Read
  • Processor Signals Controller
  • Read block of length P starting at disk address X
    and store starting at memory address Y
  • Read Occurs
  • Direct Memory Access (DMA)
  • Under control of I/O controller
  • I / O Controller Signals Completion
  • Interrupt processor
  • OS resumes suspended process

Processor
Reg
(3) Read Done
Cache
Memory-I/O bus
(2) DMA Transfer
I/O controller
Memory
disk
Disk
48
Handling Page Faults
  • Memory reference causes a fault called a page
    fault
  • Page fault can happen at any time and place
  • Instruction fetch
  • In the middle of an instruction execution
  • System must save all state
  • Move page from disk to memory
  • Restart the faulting instruction
  • Restore state
  • Backup PC not easy to find out by how much
    need HW help

49
Page Fault
  • If there is ever a reference to a page, first
    reference will trap to OS ? page fault
  • Hardware traps to kernel
  • General registers saved
  • OS determines which virtual page needed
  • OS checks validity of address, seeks page frame
  • If selected frame is dirty, write it to disk
  • OS brings schedules new page in from disk
  • Page tables updated
  • Faulting instruction backed up to when it began
  • Faulting process scheduled
  • Registers restored
  • Program continues

50
What to Page in
  • Demand paging brings in the faulting page
  • To bring in additional pages, we need to know the
    future
  • Users dont really know the future, but some OSs
    have user-controlled pre-fetching
  • In real systems,
  • load the initial page
  • Start running
  • Some systems (e.g. WinNT will bring in additional
    neighboring pages (clustering))

51
VM Page Replacement
  • If there is an unused page, use it.
  • If there are no pages available, select one
    (Policy?) and
  • If it is dirty (M 1)
  • write it to disk
  • Invalidate its PTE and TLB entry
  • Load in new page from disk
  • Update the PTE and TLB entry!
  • Restart the faulting instruction
  • What is cost of replacing a page?
  • How does the OS select the page to be evicted?

52
Measuring Demand Paging Performance
  • Page Fault Rate (p)
  • 0 lt p lt 1.0 (no page faults to every ref is a
    fault)
  • Page Fault Overhead
  • fault service overhead read page restart
    process overhead
  • Dominated by time to read page in
  • Effective Access Time
  • (1-p) (memory access) p (page fault overhead)

53
Performance Example
  • Memory access time 100 nanoseconds
  • Page fault overhead 25 millisec (msec)
  • Page fault rate 1/1000
  • EAT (1-p) 100 p (25 msec)
  • (1-p) 100 p 25,000,000
  • 100 24,999,900 p
  • 100 24,999,900 1/1000 25 microseconds!
  • Want less than 10 degradation
  • 110 gt 100 24,999,900 p
  • 10 gt 24,999,900 p
  • p lt .0000004 or 1 fault in 2,500,000 accesses!

54
Page Replacement Algorithms
  • Want lowest page-fault rate.
  • Evaluate algorithm by running it on a particular
    string of memory references (reference string)
    and computing the number of page faults on that
    string.
  • Reference string ordered list of pages accessed
    as process executes
  • Ex. Reference String is A B C A B D A D B C B

55
The Best Page to Replace
  • The best page to replace is the one that will
    never be accessed again
  • Optimal Algorithm - Beladys Algorithm
  • Lowest fault rate for any reference string
  • Basically, replace the page that will not be used
    for the longest time in the future.
  • If you know the future, please see me after
    class!!
  • Beladys Algorithm is a yardstick
  • We want to find close approximations

56
Page Replacement - FIFO
  • FIFO is simple to implement
  • When page in, place page id on end of list
  • Evict page at head of list
  • Might be good? Page to be evicted has been in
    memory the longest time
  • But?
  • Maybe it is being used
  • We just dont know
  • FIFO suffers from Beladys Anomaly fault rate
    may increase when there is more physical memory!

57
FIFO vs. Optimal
  • Reference string ordered list of pages accessed
    as process executes
  • Ex. Reference String is A B C A B D A D B C B
  • OPTIMAL
  • A B C A B D A D B C B

System has 3 page frames
5 Faults
toss A or D
A B C D A B C
FIFO A B C A B D A D B C B
toss ?
7 faults
58
Second Chance
  • Maintain FIFO page list
  • On page fault
  • Check reference bit
  • If R 1 then move page to end of list and clear
    R
  • If R 0 then evict page

59
Clock Replacement
  • Create circular list of PTEs in FIFO Order
  • One-handed Clock pointer starts at oldest page
  • Algorithm FIFO, but check Reference bit
  • If R 1, set R 0 and advance hand
  • evict first page with R 0
  • Looks like a clock hand sweeping PTE entries
  • Fast, but worst case may take a lot of time
  • Two-handed clock add a 2nd hand that is n PTEs
    ahead
  • 2nd hand clears Reference bit

60
Not Recently Used Page Replacement Algorithm
  • Each page has Reference bit, Modified bit
  • bits are set when page is referenced, modified
  • Pages are classified
  • not referenced, not modified
  • not referenced, modified
  • referenced, not modified
  • referenced, modified
  • NRU removes page at random
  • from lowest numbered non empty class

61
Least Recently Used (LRU)
  • Replace the page that has not been used for the
    longest time

3 Page Frames Reference String - A B C A B D A
D B C
LRU 5 faults
A B C A B D A D B C
62
LRU
  • Past experience may indicate future behavior
  • Perfect LRU requires some form of timestamp to be
    associated with a PTE on every memory reference
    !!!
  • Counter implementation
  • Every page entry has a counter every time page
    is referenced through this entry, copy the clock
    into the counter.
  • When a page needs to be changed, look at the
    counters to determine which are to change
  • Stack implementation keep a stack of page
    numbers in a double link form
  • Page referenced move it to the top
  • No search for replacement

63
LRU Approximations
  • Aging
  • Keep a counter for each PTE
  • Periodically check Reference bit
  • If R 0 increment counter (page has not been
    used)
  • If R 1 clear the counter (page has been used)
  • Set R 0
  • Counter contains of intervals since last access
  • Replace page with largest counter value
  • Clock replacement

64
Contrast Macintosh Memory Model
  • MAC OS 19
  • Does not use traditional virtual memory
  • All program objects accessed through handles
  • Indirect reference through pointer table
  • Objects stored in shared global address space

Handles
65
Macintosh Memory Management
  • Allocation / Deallocation
  • Similar to free-list management of malloc/free
  • Compaction
  • Can move any object and just update the (unique)
    pointer in pointer table

Handles
66
Mac vs. VM-Based Memory Mgmt
  • Allocating, deallocating, and moving memory
  • can be accomplished by both techniques
  • Block sizes
  • Mac variable-sized
  • may be very small or very large
  • VM fixed-size
  • size is equal to one page (4KB on x86 Linux
    systems)
  • Allocating contiguous chunks of memory
  • Mac contiguous allocation is required
  • VM can map contiguous range of virtual addresses
    to disjoint ranges of physical addresses
  • Protection
  • Mac wild write by one process can corrupt
    anothers data

67
MAC OS X
  • Modern Operating System
  • Virtual memory with protection
  • Preemptive multitasking
  • Other versions of MAC OS require processes to
    voluntarily relinquish control
  • Based on MACH OS
  • Developed at CMU in late 1980s

68
(No Transcript)
69
(No Transcript)
70
(No Transcript)
71
(No Transcript)
72
(No Transcript)
73
(No Transcript)
74
(No Transcript)
75
Page Replacement Policy
  • Working Set
  • Set of pages used actively heavily
  • Kept in memory to reduce Page Faults
  • Set is found/maintained dynamically by OS
  • Replacement OS tries to predict which page would
    have least impact on the running program

Common Replacement Schemes Least Recently Used
(LRU) First-In-First-Out (FIFO)
76
Page Replacement Policies
  • Least Recently Used (LRU)
  • Generally works well
  • TROUBLE
  • When the working set is larger than the Main
    Memory

Working Set 9 pages Pages are executed in
sequence (0?8 (repeat))
THRASHING
77
Page Replacement Policies
  • First-In-First-Out(FIFO)
  • Removes Least Recently Loaded page
  • Does not depend on Use
  • Determined by number of page faults seen by a page

78
Page Replacement Policies
  • Upon Replacement
  • Need to know whether to write data back
  • Add a Dirty-Bit
  • Dirty Bit 0 Page is clean No writing
  • Dirty Bit 1 Page is dirty Write back
Write a Comment
User Comments (0)
About PowerShow.com