Transmeta and Dynamic Code Optimization

About This Presentation

Title:

Description:

Number of Views:386

Avg rating:3.0/5.0

Slides: 16

Provided by: ashwinrb

Learn more at: https://cs.login.cmu.edu

Category:

Tags: code | dynamic | fetching | optimization | transmeta

Transcript and Presenter's Notes

Title: Transmeta and Dynamic Code Optimization

1
Transmeta and Dynamic Code Optimization

2
Stuff Compilers Dont (Cant?) Do

3
Therefore Dynamic Code Optimization

4
How Do You Implement This?

5
I-COP (Instruction Path Coprocessors)

What?
Add another processor that watches the
instructions retire and can perform operations on
them
Why?
Performance!
Principles
Keep the optimizations out of the critical path
Avoid slowdown due to software

6
Structure

Multiple VLIW processor slices makes the
I-COP simple, but still able to keep up
I-COP slices have 10 special instructions for
pattern matching in addition to 12 normal RISC
type

7
Applications of I-COP

8
The I-COP Processor

Multiple VLIW slices allow multi-level statically
scheduled and explicitly encoded parallelism
Predication and delay slots obviate branch
prediction
32 integer registers, 8 predicate registers
22 instructions, 12 RISC type, and 10 special
Pattern matching, bit manipulation,
instrumentation
Fill buffer collects instructions for analysis
Task queue acts as FIFO scheduler

9
The I-COP Processor Cont.
10
Examples Of Special Instructions

SearchReplace
Finds a given pattern and replaces it with
another given pattern, returns the number of
replacements accomplished
Subset
Tests if the bits set in a given register are a
subset of those set in a second register

11
Transmeta Crusoe

12
So how do they do it?

13
Cheesy Marketing Image
14
Code-Morphing Software

Translates an entire basic-block at once
Also does instruction re-ordering, branch
prediction, register renaming
The translations are stored in a translation
cache (part of main memory)
Instruments code to help with branch prediction,
and detecting candidates for heavy optimizations

15
Code Morphing Software (cont.)

Also has some help from the hardware
Shadowed and working register sets
Alias hardware (load-and-protect operations)
Translated bit for each page table entry
Performance of systems with Crusoe 2-3 times
longer battery life, performance comparable to
Intel mobile processors

Write a Comment

User Comments (0)