Empirical Software Engineering Strategies - PowerPoint PPT Presentation

1 / 59

About This Presentation

Title:

Empirical Software Engineering Strategies

Description:

Survey (interview, questionnaire) ex post facto ... Manipulate variables of interest (software, treatments, expertise) Small scale tasks ... – PowerPoint PPT presentation

Number of Views:67

Avg rating:3.0/5.0

Slides: 60

Provided by: catherinev9

Category:

more less

Transcript and Presenter's Notes

Title: Empirical Software Engineering Strategies

1
Empirical Software Engineering Strategies

Anneliese Amschler Andrews
Department of Computer Science
University of Denver
andrews_at_cs.du.edu

2
Types

Survey (interview, questionnaire)
ex post facto
Case Study (observation, measurement, statistical
analysis)
Experiment (highly controlled, manipulate one or
more variables)
quasi-experiment (no random
assignment)

3
Surveys

Retrospective
Understand population
Many variables possible
Purposes descriptive, explanatory, explorative
Questionnaires, Interviews

4
Case Studies

Single entity or phenomenon
Specific time and space
Data collection and analysis
Industrial evaluation (process improvement,
different techniques)
Comparison (sister project, partial application)
Validation
Multiple sites

5
Case study

Concern
Small case study may not scale
Lack of control
Influence of confounding factors
Generalizability
Not reproducible

6
Case Study Positives

Positives
Typical behavior
Realistic
Scale
Usually many variables

7
Experiments

Precise control
Manipulate variables of interest (software,
treatments, expertise)
Small scale tasks
High cost
Confirm theories, conventional wisdom
Explore relationships
Evaluate accuracy of models
Validate measures

8
Experiment process

Definition
Planning
Operation
Analysis and interpretation
Presentation and package

9
Strategies Comparison
10
(No Transcript)
11
(No Transcript)
12
(No Transcript)
13
(No Transcript)
14
(No Transcript)
15
(No Transcript)
16
Define Experiment- Goal definition template

Object of study
Purpose
Quality focus
Perspective
Context

17
Experiment context
18
Definition framework
19
Experiment Planning

Context selection
Hypothesis formulation
Variable selection
Selection of subjects
Experiment design
Instrumentation
Validity evaluation

20
Context Selection

Goal large, real projects, professional staff
Trade-offs necessary
Off-line vs. on-line
Students vs. professionals
Toy vs. real problems
Specific vs. general

21
Hypothesis Formulation

Null hypothesis, H0
only reason for different outcomes is accidental
Alternative hypothesis, Ha, H1
Reject null hypothesis
Formulation of hypothesis drives statistical tests

22
Errors in Hypothesis Testing

Type-I error
P(type-I error)P(reject H0H0 true)
Type-II error
P(type-II error)P(not reject H0 H0 false)
Power of test can it reveal true pattern in
collected data
powerP(reject H0H0 false)1-P(type-II error)

23
Variables Selection

Independent variables
Controllable
Influence dependent ones
Levels of measurement
Dependent variables
One or more
Direct or indirect (validate!)
Define measurement scales, range of values

24
Selection of Subjects

Influences generalization of results
Population sample
Probability or non-probability sampling

25
Probability Sampling Types

Simple random sampling
Systematic sampling
Stratifies random sampling
Convenience sampling
Quota sampling

26
Choosing Sample Size

Influences error, power of statistical test
Large variability gt large sample
Data analysis method influences sample size

27
Experiment Design

Influenced by
hypotheses,
necessary treatments,
power of statistical tests,
measurement scales,
objects and subjects

28
General Design Principles

Randomization (objects, subjects, order of tests,
population sampling)
Blocking (factor of no interest that influences
outcomes) group by factor level (e. g.
experience)
Balancing (equal number of subjects for each
treatment)

29
Standard Design Types

One factor, 2 treatments
One factor, gt2 treatments
Two factors, 2 treatments
More than two factors each with 2 treatments

30
1 factor, 2 treatments

Completely randomized
1 treatment (T1, T2) per subject
Needs more subjects
Random assignment of S to T1/T2
Paired Comparison
Subject has both treatments
Order of treatment randomized
Balanced sam number of subjects for each order
of treatments

31
1 factor , gt 2 treatments

Completely randomized
1 treatment per subject
Random assignment of treatments
Needs more subjects
Randomized complete block design
Variability between subjects large
Block subjects into groups
Assign order of treatments within block randomly

32
2 factors

22 factorial design
2 factors, each with 2 treatments
4 possible treatment pairs
Randomly assign subjects to each treatment pair
2 stage nested design
Connections between factors leading to related
treatments
Ex F1 PL (OO/F) F2 (not) fault-prone
Qu efficiency of unit testing

33
gt2 factors

2k factorial design
Factors k3, 2 treatments per factor
8 combinations of factors and treatments
Assign subjects randomly to each (balance!)
More than 2 treatments construct Latin square

34
2k fractional factorial design

Assumption higher order factor interactions
negligible
Sparsity of effect principle
Projection property
Sequential experimentation
One-half fractional factorial design
Choose half of full factorial design
Removing 1 factor results in full factorial
design for remainder

35
2k fractional factorial design (cont.)

One-quarter fractional factorial design
Choose one quarter of full fact. Des.
Assumes fewer important factor interactions
May be used sequentially to screen importance of
factor interactions

36
Instrumentation

Objects specs., code, designs
Need to know/control properties (e. g.
seeded/actual faults for code inspection)
Guidelines for participants
Written instructions
Training
Measurement instruments
Unobtrusive during task
Tests, forms, questionnaires
Instrumentation must not affect outcome of
experiment

37
Validity