Scalable Program Analysis Using Boolean Satisfiability: The Saturn Project - PowerPoint PPT Presentation

1 / 52

About This Presentation

Title:

Scalable Program Analysis Using Boolean Satisfiability: The Saturn Project

Description:

Saturn. 8. A Parable Continued ... Saturn. 9. This Talk. An approach to achieving both precision and scalability ... Saturn. 10. The Main Idea. For precision, ... – PowerPoint PPT presentation

Number of Views:83

Avg rating:3.0/5.0

Slides: 53

Provided by: yiche2

Learn more at: https://www.cis.upenn.edu

Category:

more less

Transcript and Presenter's Notes

Title: Scalable Program Analysis Using Boolean Satisfiability: The Saturn Project

1
Scalable Program Analysis Using Boolean
SatisfiabilityThe Saturn Project

Alex Aiken
Stanford University

2
The (Current) Idea

Verify properties of large systems!

3
Well, No . . .

Some systems work on large programs
Millions of lines of code
Some systems verify properties
E.g., alias-aware type state
Some do both
But only in conference papers

4
Scaling vs. Precision

Scaling
Need to handle multi-million line programs
Why?
Because that is where automatic analysis does the
most good
Because they are there
Pushes towards low-complexity algorithms
Precision
High degree of automation a requirement
Little user input (few annotations)
Efficient to use output (few spurious warnings)
Pushes towards high-complexity algorithms

5
Set-up For A Story . . .

Alias Analysis
Basic to verification
Paradigmatic problem
x
y
Can x and y be aliases?

Dimensions of precision
sensitive,insensitive
Flow-
X 1
Y X 1
Context-
F() H()
G() H()

6
A Parable About Alias Analysis
Four KLOC of code from Linux . . .
The limit of (most) flow-sensitive,
context-sensitive alias analyses.
One KLOC of code from Linux . . .
One page of code from Linux . . .
7
A Parable Continued
200 KLOC
Context-sensitive, flow-insensitive alias
analysis to 600 KLOC
8
A Parable Continued
Flow-insensitive, context-insensitive alias
analysis scales to 2MLOC
But . . . Linux is 6MLOC Windows is 50MLOC
9
This Talk

An approach to achieving both precision and
scalability
Based on SAT and other constraint solvers
Some examples
A sound alias analysis
Unsound null dereference analysis
Unsound lock checker

10
The Main Idea

For precision, delay abstraction
Model function/loop bodies very precisely
(Almost) no abstraction intraprocedurally
For scalability, abstract at function boundaries
Summarize a functions behavior
Summaries designed per property
Analysis design summary design
Intuition Programmers also abstract at these
boundaries

11
Straight-line Code

void f(int x, int y)
int z x y
assert(z x)

x
z
y

R
12
Straight-line Code

void f(int x, int y)
int z x y
assert(z x)

Query Is-Satisfiable(? )
Answer Yes x 001 y 000 Negated
assertion is satisfiable. Therefore, the asserti
on may fail.
R
13
Control Flow Preparation

Our approach
Assume a loop free program
Treat loops as tail-recursive functions
Loops and functions handled the same way

14
Control Flow Example
if (c) x a else x
b
res x
G c, x a31a0 G ?c, x b31b0 G c ? ?
c, x v31v0
where vi (c?ai)?(?c?bi)
if (c)
?c
c
x a
x b
true
res x

Merges
preserve path sensitivity
select bits based on the values of incoming guards

15
Pointers Overview

May point to different locations
Thus, use points-to sets
p l1,,ln
but path sensitive
Use guards on points-to relationships
p (g1, l1), , (gn, ln)

16
Pointers Example
G true, p (true, x)
p x if (c) p y res p
if (c) res y else if (?c) res x
G c, p (true, y)
G true, p (c, y) (??c, x)
17
Pointers Recap