Central Processing Unit - PowerPoint PPT Presentation

1 / 46
About This Presentation
Title:

Central Processing Unit

Description:

With the invention of complex ICs, hardware complexity became a non-issue. ... In the early 1980s, designers tried to balance that by moving towards simpler ... – PowerPoint PPT presentation

Number of Views:44
Avg rating:3.0/5.0
Slides: 47
Provided by: www24Br
Category:

less

Transcript and Presenter's Notes

Title: Central Processing Unit


1
Chapter 8
  • Central Processing Unit
  • Sample Realistic Designs

2
Major Components of the CPU
  • Every CPU consists of the three basic components
    shown in the figure below.
  • Registers hold the inputs of the ALU operations
    and eventually receive the results.
  • The control unitcontrols the operationof both
    ALU andregistersthrough the control signals.
  • The ALU performs the actualoperations.

Registers
Control Unit
ALU
3
Realistic Organization
  • With a large number of registers, dedicated
    connections are impossible.
  • Some form of BUS mechanism has to be used to
    organize the connections.
  • Most ALU operations require two pieces of data.
  • We can send them to temporary ALU registers one
    at a time.
  • Better yet, we can utilize a two bus system.

4
Register File Control
  • Once the results are ready, they have to be sent
    to the proper register for storage.
  • The instruction must specify the register and the
    control unit must enable the right control input.
  • Registers are usually given a code to reduce the
    size of the instructions.
  • This code can be decoded to create the needed
    load control inputs for the registers.

5
Controlling the ALU
  • The ALU is a multi-function unit.
  • The control unit needs to specify through control
    signals which operation to be performed.
  • Putting all of the above together, each
    microoperation needs the following information
  • Select inputs for BUS A.
  • Select inputs for BUS B
  • Destination register code.
  • ALU operation code.

6
Register Organization
Input Data
R1
R2
R3
R4
R5
R6
R7
Load Controls
S E L B
S E L A
MUX 1
MUX 2
3 X 8 Decoder
Bus B
Bus A
Arithmetic Logic Unit ALU
O P R
SELD
3
3
3
5
Control Word
SELA
SELB
SELD
OPER
7
Example of a Microoperation
  • The control word mentioned above has 14 bits.
    They will represent the different parts of the
    microoperations to be performed.
  • For example, the microoperation
  • R1 ? R2 R3
  • has the following control word
  • Field SELA SELB SELD OPER
  • R2 R3 R1 SUB
  • 010 011 001 00101
  • The control word is
  • 010 011 001 000101

8
Stack Organization
  • A useful feature that is included in almost every
    CPU.
  • The stack is a storage device that stores
    information in a Last In First Out (LIFO) manner.
  • The stack in digital computers is essentially a
    memory unit with a dedicated address register
    the Stack Pointer that continuously points to
    the upper most item in the stack.
  • Items are added to the stack using a PUSH
    operation and removed from it using a POP.
  • These operations are simulated through
    incrementing and decrementing the register.

9
Register Stack
  • If the microprocessor has enough registers, it is
    actually possible to implement the stack
    operation using registers.
  • The stack pointer register in this situation
    would contain the index of the register
    containing the item at the top of the stack.
  • With a register stack there would also be a need
    for a couple of flag registers to determine when
    the stack is completely full or completely empty.

10
Stack Based CPUs
  • There are several CPUs that were designed without
    general purpose registers.
  • Instead these CPUs had a fast memory stack that
    could be used instead of the registers.
  • In order to effectively use such a system,
    mathematical expressions have to be re-written in
    a slightly different manner.

11
Infix Notation
  • Common arithmetic expressions are written with
    the operator between its operands.
  • This causes a problem for programmers.
  • Consider the following expression
  • A B C D
  • The program must
  • Read the entire expression
  • Extract all of the operands
  • Extract all of the operations
  • Decide which operations to do first.

12
Prefix Notation
  • It is possible to re-write arithmetic expressions
    so that the operation is specified before its
    operands.
  • This way, there is no need to parse the entire
    expression.
  • Read an operation, scan forward until the its two
    operands are obtained, execute it, continue.
  • The previous expression can be re-written as
  • A B C D
  • We read the operator first.
  • Scan forward, we find an operation. Therefore,
    the first operand of the is the result of this
    . Perform the operation.
  • We find another operation. Perform it.
  • Now we have the two operands for the , perform
    it.

13
Reverse Polish Notation
  • The previous notation is known either as prefix
    notation or Polish Notation since it was
    defined by a Polish mathematician.
  • A more popular notation is actually the reverse
    of this one Reverse Polish Notation (RPN).
  • Postfix notation.
  • The operands are specified first, then the
    operator.
  • This notation is extremely popular with stack
    based CPUs.
  • Like the CPUs in HPs scientific calculators.

14
RPN
  • Our sample expression can be written as
  • A B C D
  • It can be evaluated as follows
  • Scan from the left, as soon as an operation is
    found, perform it on the two operands immediately
    to its left.
  • Replace the operation and its two operands with
    the result.
  • Continue forward.

15
RPN Evaluation Example
  • Evaluate the following expression
  • 1 2 3 4 5 6
  • First, re-write it in RPN
  • 4 5 6 2 3 1
  • We find 4 5 first. Evaluate that. 20. The
    expression now becomes
  • 20 6 2 3 1
  • Then we find 20 6 . Evaluate it. 120.
  • 120 2 3 1
  • Now we find 2 3 . Evaluate it. 6.
  • 120 6 1
  • Now we find 120 6 . Evaluate it. 126.
  • 126 1
  • Evaluate the last expression 126 1 .
  • The result is 127.

16
Conversion to RPN
  • We must follow the hierarchy of operations
  • Perform all operations inside inner parenthesis
    first, then outer ones.
  • Perform multiplication and division before
    addition and subtraction.
  • Example
  • Translate the following expression to RPN
  • (A B) C (D E) F
  • The result is
  • A B D E C F

17
Evaluating RPN Expressions with a Stack
  • Push all operands on the stack until the first
    operation.
  • Pop the first two elements off the stack and
    perform the operation.
  • Push the result back on the stack.
  • Continue.

18
Example
  • Evaluate the following expression using a stack
  • 3 4 5 6
  • Push the 3 on the stack.
  • Push the 4 on the stack.
  • Pop the 4 and the 3, perform the operation.
  • Push the result (12) on the stack.
  • Push 5 on the stack.
  • Push 6 on the stack.
  • Pop the 6 and the 5, perform the operation.
  • Push the result (30) on the stack.
  • Pop the 30 and the 12 off the stack, perform the
    operation.
  • Push the result (42) on the stack.

19
Example 2
  • Evaluate the following expression using a stack
  • 1 2 3 4 5 6
  • First, re-write it in RPN
  • 4 5 6 2 3 1

3
2
5
6
2
6
1
4
20
120
120
120
126
4
20
120
126
127
4
5

6

2
3


1

20
Section 8.8
  • Reduced Instruction Set Computer
  • (RISC)

21
Instruction Set vs. Architecture
  • The design of the instruction set is an important
    aspect of computer architecture.
  • The instruction set chosen determines the way
    machine language programs are constructed.
  • Early computers had small and simple instruction
    sets.
  • Due mainly to the need to reduce the hardware
    needed to implement them.

22
CISC - Complex Instruction Set Computers
  • With the invention of complex ICs, hardware
    complexity became a non-issue. This lead to the
    development of some highly complex architectures.
  • Architectures with instruction sets that
    contained more than 100 instructions became
    widely spread.
  • The trend was to move operations from software to
    hardware.
  • Machine instructions like COS, SIN and TAN
    started to appear.
  • Actually, some processors also had machine
    instructions for matrix operations.

23
RISC - Reduced Instruction Set Computers
  • Complex instruction sets had a large number of
    complex instructions.
  • The complex instructions required a long time to
    execute.
  • The instructions required a lot of memory
    accesses.
  • Some of the instructions were so specialized that
    they were used quite infrequently.
  • In the early 1980s, designers tried to balance
    that by moving towards simpler instruction sets.

24
CISC Characteristics
  • Designers wanted to simplify the process of
    compilation.
  • Rather than translate a high level language
    instruction into many machine language
    instructions, why not design machine language
    instructions that implemented them directly.
  • Complex machine language instructions.

25
CISC Characteristics
  • In order to be efficient with memory use,
    variable length instructions were used.
  • Register based instructions were short, 1-2
    bytes, while memory based instructions were long,
    up to 5 bytes.
  • Packing such variable length instructions into a
    fixed-length memory requires some very special
    decoding circuits.

26
CISC Characteristics
  • Instructions in typical CISC processors provide
    for direct manipulation of operands in memory.
  • This will require multiple memory references
    during the execution of the instruction.
  • The reason for including these instructions is to
    simplify the compilation of high-level language
    programs.
  • Remember, most variables in a high-level language
    program are implemented as memory locations.
  • As more instructions and addressing modes are
    added to a processor, more logic would be needed
    to support them.
  • Ultimately, this leads to a lower performance.

27
CISC Characteristics in Summary
  • A large number of instructions.
  • Typically 100 200 instructions.
  • Some instructions that perform specialized tasks
    and are used infrequently.
  • A large variety of addressing modes.
  • Variable length instruction formats.
  • Instructions that manipulate operands in memory
    directly.

28
RISC Characteristics
  • RISC tries to reduce execution time by
    simplifying the instruction set of the computer.
  • The basic characteristics of RISC processors are
  • Relatively few instructions.
  • Relatively few addressing modes.
  • Memory access limited to load and store
    instructions.
  • All operations are done within the registers of
    the CPU.
  • Fixed-length, easily decoded instruction formats.
  • Single cycle instruction execution.
  • Hardwired rather than micro-programmed control
    units.

29
RISC Characteristics
  • Mostly register to register operations.
  • Only simple load and store for memory access.
  • Operands are read into registers using a load
    instruction.
  • The operation is done between the registers.
  • Results are stored in memory using an explicit
    store instruction.
  • This simplifies the instruction set and forces
    the optimization of register usage.
  • This also removes the need for many complex
    addressing modes.

30
RISC Characteristics
  • Simple instruction formats.
  • The instruction length is fixed.
  • Instructions are aligned to memory words.
  • Easy to decode instruction formats.
  • This simplifies the control logic.
  • Hard-wired control is used to speed-up the
    generation of control signals.

31
Single Cycle Instruction Execution
  • A main feature of RISC processors is their
    ability to complete the execution of an
    instruction every clock cycle.
  • This is done by overlapping the fetch, decode and
    execute cycles of two or three instructions by
    using pipelining.
  • Most CISC processors today also depend on this
    important feature for speeding up their
    performance.

32
Additional Features of RISC Processors
  • A relatively large number of registers.
  • This would be useful for storing intermediate
    results.
  • Use of overlapped register windows.
  • This helps speed-up procedure calls.
  • Compiler support for efficient translation of
    high-level language programs to make use of these
    features.

33
Over-lapped Register Windows
  • When a function call in a high-level language
    program requires many operations to implement
  • Register values in the calling program must be
    saved.
  • Parameters must be placed into appropriate
    registers for the subroutine.
  • The subroutine is called.
  • On the return path, a similar set of operations
    are needed.
  • The subroutine has to save its return values in
    the appropriate registers.
  • Control is returned to the calling program.
  • Registers values in the calling program are
    restored.
  • All of these are very time consuming.

34
Over-lapped Register Windows
  • Given that function calls occur very often in
    high-level language programs, someway of speeding
    up this process has to be found.
  • Some processors use a separate register bank for
    each procedure.
  • No need to save and restore the calling
    procedures registers.
  • RISC processors do a similar thing but it is not
    dedicated.

35
Over-lapped Register Windows
  • Each procedure is allocated a group of registers.
  • When a function call is being executed, a set of
    registers are automatically assigned to the new
    procedure.
  • Therefore, there is no need to save and restore
    the calling procedures registers.
  • This new set of registers overlaps by a certain
    amount with the registers of the calling
    procedure.
  • This overlap is used for passing parameters.
  • When a function is terminated, the registers
    allocated to it are freed for later use by a
    different procedure.

36
Over-lapped Register Window Example
R0
  • Our CPU has a total of 74 registers.
  • When a program starts, 10 registers are allocated
    for global data.
  • The main program is allocated 10 registers for
    its local data.
  • The main program calls function A.
  • 6 registers are allocated for passing data back
    and forth between main and A.
  • 10 registers are allocated to A for local data.
  • A calls B.
  • 6 registers are allocated common to A and B.
  • 10 registers are allocated to B for local data.
  • Each procedure can access a total of 32 registers.

Global Reg.
R9
R10
Main Local Reg.
R19
R20
Shared Main, A
R25
R26
A Local Reg.
R35
R36
Shared A, B
R41
R42
B Local Reg.
R51
R52
Free Reg.
R73
37
Over-lapped Register Window Example
R15 R10 R73 R64
Common to D and A
Local to D
R63 R58
Common to C and D
R57 R48
Proc D
Local to c
R47 R42
Common to B and C
R41 R32
Proc C
Local to B
R31 R26
Common to A and B
R9 R0
R25 R16
Proc B
Common to all procedures
Local to A
R15 R10
Common to A and D
Proc A
Global registers
38
Effects on Programming
  • High level languages no effect.
  • All of this is done by the compiler.
  • Assembly language registers no longer have a
    set name.
  • If you write an instruction of the form
  • ADD R1, R2
  • there is no guarantee that you will actually use
    R1 and R2 of the processor.
  • The above instruction means, use the first
    register and the second register in my window.

39
Over-lapped Window Parameters
  • The number of registers allocated for each type
    is a parameter of the processor design.
  • G the number of global registers.
  • L the number of local registers.
  • C the number of common register.
  • W the number of windows
  • Window size L 2C G
  • Registers in the processors (L C)W G (reg.
    file)
  • all of these depend on the design of the CPU.
  • G 10, L 10, C 6 , W 4
  • 101210 32 reg. register file (106)x4 10
    74
  • In some processor designs, these parameters are
    decided dynamically.
  • Depending on the total number of procedures and
    the total number of registers, a procedures
    window size may change.
  • The operating system now has to be very
    intelligent.

40
Berkeley RISC I Example RISC CPU
  • 32-bit processor.
  • 32-bit address.
  • 8, 16, or 32-bit data.
  • 32-bit instruction format.
  • 31 instructions.
  • 12 Data Manipulation.
  • 11 Data Transfer.
  • 8 Program Control.
  • 3 addressing modes
  • Register.
  • Immediate.
  • Relative to PC.
  • 138 registers.
  • 10 Global registers.
  • 8 windows of 32 registers each.

41
Berkeley RISC I
  • Since only 32 registers are accessible at any
    point in time, only 5 bits are needed for
    register selection.
  • Instructions utilize a three address format.
  • Destination Register.
  • Source Register.
  • Second Source Register or Immediate Data.
  • Register R0 is a constant 0 all the time.
  • It can be used to fool the processor into
    performing additional addressing modes.

42
Instruction Formats
Register Mode
S2
Not Used
0
Rs
Rd
Opcode
5
8
1
5
5
8
S2 - Register
Register-Immediate Mode
S2
1
Rs
Rd
Opcode
13
1
5
5
8
S2 Immediate Data
Signed-extended 13 bit const
PC Relative Mode
Y
Cond
Opcode
19
5
8
Y Relative Displacement
43
Instruction Formats
  • Memory access instructions use Rs to specify a
    32-bit address in a register and S2 to specify an
    offset
  • R0 contains all 0s (zero quantity)
  • COND field replaces the Rd field for jump
    instructions (specify one of 16 possible branch
    conditions)

44
Instruction Formats
  • ADD R22,R21,R23 R23 ? R22 R21
  • ADD R22,150,R23 R23 ? R22 150
  • ADD R0,R21,R22 R22 ? R21 (MOVE)
  • ADD R0, 150,R22 R22 ? 150(Load immed)
  • ADD R22, 1,R22 R22 ? R22 1 (Inc)
  • LDL (R22) 150,R5 R5 ? MR22 150
  • LDL (R22) 0,R5 R5 ? MR22
  • LDL (R0) 500,R5 R5 ? M500

45
Data Transfer Operations
46
Data Manipulation Operations
Write a Comment
User Comments (0)
About PowerShow.com