Superficial Review of Uniprocessor Architecture and Related concepts

About This Presentation

Title:

Superficial Review of Uniprocessor Architecture and Related concepts

Description:

Load R, L # move contents of L into register R ... Although more often the CPU is executing register-only instructions. Load and store latency ... – PowerPoint PPT presentation

Number of Views:78

Avg rating:3.0/5.0

Slides: 10

Provided by: laxmika

Learn more at: http://charm.cs.uiuc.edu

Category:

more less

Transcript and Presenter's Notes

Title: Superficial Review of Uniprocessor Architecture and Related concepts

1
(Superficial!) Review ofUniprocessor
Architecture andRelated concepts

CS 433
Laxmikant Kale
University of Illinois at Urbana-Champaign
Department of Computer Science

2
Early machines

We will present a series of idealized and
simplified models
Read more about the real models in architecture
textbooks
The idea here to review the concepts and define
our vocabulary

Processor
Location 1
Location 0
Memory
Location k
3
Early machines

Early machines Complex instruction sets, (lets
say) no registers
Processor can access any memory location equally
fast
Instructions
Operations Add L1, L2, L3 (Add contents of
Location L1 to that of Location L2, and store
results in L3.)
Branching Branch to L4 (Note that some
locations store program instructions),
Coonditional Branching If (L1gtL2) goto L3

Processor
Location 1
Location 0
Memory
Location k
4
Registers

Processors are faster than memory
they can deal with datawithin the processor much
faster
So, create some locations in processor for
storing data
Called registers Often with a special register
called Accumulator
Now we need new instructions for dealing with
data in registers
Data movement instructions
Move from register to memory, memory to register,
register to register, and memory to memory
Computation instructions
In addition to the previous ones, we now add
instructions to allow one or more operands being
a register

Processor
registers
CPU
Memory
5
Load-Store architectures (RISC)

Do not allow memory locations to be operands
For computations as well as control instructions
Only instructions to reference memory are
Load R, L move contents of L into register R
Store R, L move contents of register R into
memory location L
Notice that the number of instructions is now
dramatically reduced
Further, allow only relatively simple
instructions to do register-to-register
operations
More complex operations implemented in software
Compiler has a bigger responsibility now

6
Caches

The processor still has to wait for data from
memory
I.e. Load and Store instructions are slower
Although more often the CPU is executing
register-only instructions
Load and store latency
Dictionary meaning latency is the delay between
stimulus and response
OR delay between a data-transfer instruction and
beginning of data transfer
But, faster SRAM memory is available (although
expensive)
Idea just like registers, put some more of data
in faster memory
Which data??
Principle of locality (empirical observation)
Data accessed correlates with past accesses,
spatially and temporarily

7
Caches
Processor still issues load and store
instructions as before, but the cache controller
intercepts the requests, and if the location has
been cached, deals with it using cache Data
transfer between cache and memory is not seen by
the processor
Processor
Cache controller
Cache
Memory
8
Cache Issues

Level 2 cache
Cache lines
Bring a bunch of data at once
exploit spatial locality
block transfers are faster
64-128 byte cache lines typical
Trade-offs or why larger and large cache lines
arent good either
How is cache managed?
Its job given an address, find if it is cache,
and return contents if so.
Also, write data back to memory when needed
and bring data from the memory when needed
Ideally, a fully associative cache will be good
Keep cache lines anywhere in the physical cache
But looking up is hard

9
Cache management

Alternative scheme
Each cache line (I.e. address) has exactly one
place in the cache memory where it can be stored.
Of course, there are more than one cache lines
that will have the same area of cache memory as
their possible target
Why?
Only one cache line can live inside a cache block
at a time
If you want to bring in a new one, the old one
must be emptied
A tradeoff set-associative caches
Have each line map to more than 1 (say 4)
physical locations