Evaluation of Information Systems Introduction and Overview - PowerPoint PPT Presentation

1 / 64
About This Presentation
Title:

Evaluation of Information Systems Introduction and Overview

Description:

Then resume full scale development of the system using some other life cycle model ... The measurement scales form the French word for black, noir (as in 'film noir' ... – PowerPoint PPT presentation

Number of Views:53
Avg rating:3.0/5.0
Slides: 65
Provided by: gle9
Category:

less

Transcript and Presenter's Notes

Title: Evaluation of Information Systems Introduction and Overview


1
Evaluation of Information SystemsIntroduction
and Overview
  • INFO 630
  • Glenn Booker

2
Syllabus
  • This class focuses on understanding the types of
    measurements which can support a software
    development or maintenance project
  • We will use the statistics program SPSS to
    manipulate data and generate graphs
  • The Kan text is supplemented by optional readings

3
My Biases
  • DOD and FAA background
  • Systems Engineering approach - because software
    doesnt live in a vacuum!
  • Mostly work with long-lived systems, so
    maintenance issues get lots of attention
  • Metrics focus on supporting decision making
    during a project

4
Why So Many Military Sources?
  • They have vast experience with complex software
    and systems development and acquisition,
  • Which was paid for with tax dollars, so
  • Much of its FREE!

5
Who cares
  • about statistics and measuring software
    activities?
  • The main models for guiding a software project,
    ISO 9000 and the Capability Maturity Model
    Integration (CMMI), both recommend use of
    statistical process control (SPC) techniques to
    help predict future performance by an organization

6
Software Crisis
  • For every six new large-scale software systems
    put into operation, two others are canceled
  • Average software development project overshoots
    its schedule by 50
  • Three quarters of all large scale systems are
    operating failures that either do not function as
    intended or are not used at all

7
Software Crisis
  • Most computer code is handcrafted from raw
    programming languages by artisans using
    techniques they neither measure or are able to
    repeat consistently
  • There is a desperate need to evaluate software
    product and process through measurement and
    analysis
  • Thats why we have required this course!

8
Waterfall LifeCycle Model
9
Waterfall Model
  • Conceptual Development includes defining the
    overall purpose of the product, who would use it,
    and how it relates to other products
  • Requirements Analysis includes definition of WHAT
    the product must do, such as performance goals,
    types of functionality, etc.

10
Waterfall Model
  • Architectural Design, or high level design,
    determines the internal and external interfaces,
    component boundaries and structures, and data
    structures
  • Detailed Design, or low level design, breaks the
    high level design down into detailed
    requirements for every module

11
Waterfall Model
  • Coding is the actual writing of source code,
    scripts, macros, and other artifacts
  • Unit Testing covers testing the functionality of
    each module against its requirements
  • System Testing can include string or component
    tests of several related modules, integration
    testing of several major components, and full
    scale system testing

12
Waterfall Model
  • After system testing, there may be early release
    options, such as alpha and beta testing, before
    official release of the product
  • Early releases test the ability of your
    organization to deliver and support the product,
    respond to customer inquiries, and fix problems

13
Prototyping Life Cycle
  • When requirements are very unclear, an iterative
    prototyping approach can be used to resolve
    interface and feature requirements before the
    rest of development is done
  • Do preliminary requirements analysis
  • Iterate Quick Design, Build Prototype, Refine
    Design until customer is happy

14
Prototyping Life Cycle
  • Then resume full scale development of the system
    using some other life cycle model
  • Its critical to do quick development cycles
    during prototyping, or else youre just
    redeveloping the whole system over and over

15
Spiral Life Cycle
  • Used for resolving severe risks before
    development begins, the spiral life cycle uses
    more types of techniques than just prototyping to
    resolve each big risk
  • Then another life cycle is used to develop the
    system

16
Iterative Life Cycle
  • Many modern techniques, such as the Rational
    Unified Process (RUP) advocate an iterative life
    cycle
  • RUP has four major phases, defined by the
    maturity of the system rather than traditional
    life cycle activities
  • Inception, Elaboration, Construction, and
    Transition

17
Iterative Life Cycle
  • Like the spiral, iterative life cycles are driven
    by the need to resolve key risks, but here they
    are resolved all the way to implementation
  • Much more focus on early implementation of the
    core system, then building on it with each
    iteration

18
Cleanroom Methodology
  • The Cleanroom methodology is a severely rigorous
    approach to software development
  • Uses formal design specification, statistical
    testing, and no unit testing
  • Produces software with certifiable levels of
    reliability
  • Very rarely used

19
Life Cycle Standards
  • The IEEE Software Engineering Standards are one
    source of information on many aspects of software
    development and maintenance
  • The standard ISO/IEC 12207, Software Life Cycle
    Processes has collected all major life cycle
    activities into one overall guidance document

You can download ISO/IEC 12207 see IEEE
instructions on my web site
20
Process Maturity Models
  • Quality standards and goals are often embodied in
    process maturity standards, to guide
    organizations process improvement efforts
  • The primary software standard is the Software
    Engineering Institutes (SEIs) Capability
    Maturity Model Integration (CMMI)

21
CMMI
  • Describes five maturity levels
  • 1. Initial all processes are ad hoc, chaotic,
    not well defined. Do your own thing.
  • 2. Repeatable a project follows a set of defined
    processes for management and conduct of software
    development

22
CMMI
  • 3. Defined every project within the organization
    follows processes tailored from a common set of
    templates
  • 4. Managed statistical control over processes
    has been achieved
  • 5. Optimizing defect prevention and application
    of innovative new process methods are used

23
Other CMMs
  • CMMI is based on the original CMM for Software
    (SW-CMM)
  • The latter led to many other variations before
    the models were integrated circa 2000

24
Malcolm Baldrige
  • The Malcolm Baldrige National Quality Award
    (MBNQA) is a US-based quality award created in
    1988 by the Department of Commerce
  • Includes a broader scope, such as customer
    satisfaction, strategic planning, and human
    resource management

25
ISO 9000
  • The international standard for quality management
    of an organization is ISO 9000
  • Now applies to almost every type of business, but
    was first used for manufacturing
  • Hence includes activities like calibration of
    tools

26
ISO 9000
  • Is facility-based, whereas CMMI is
    organization-based
  • Was revised and republished in December 2000
  • Previous editions were dated 1987 and 1994

27
Enter Measurement
  • Measurement is critical to all process and
    quality models (CMMI, ISO 9000, MBNQA, etc.)
  • Need to define basic concepts of measurement so
    we can speak the same language

28
Engineering in a Nutshell
29
Engineering in a Nutshell
  • So in order to create any Product, we need
    Resources to use Tools in accordance with some
    Processes
  • Each of those major areas (Product, Resources,
    Tools, and Processes) can be a focus of
    measurement

30
Measurement Needs
  • Statistical meaning - need long set of
    measurements for one project, and/or many
    projects
  • Could use measurement to test specific
    hypotheses
  • Industry uses of measurement are to help make
    decisions and track progress
  • Need scales to make measurements!

31
Measurement Scales
  • The measurement scales form the French word for
    black, noir (as in film noir)
  • Nominal (least useful)
  • Ordinal
  • Interval
  • Ratio (most useful)

NOIR is just a mnemonic to remember their
sequence
32
Nominal Scale
  • A nominal (name) scale groups or classifies
    things into categories, which
  • Must be jointly exhaustive (cover everything)
  • Must be mutually exclusive (cant be in two
    categories at once)
  • Are in any sequence (none better or worse)

33
Nominal Scale
  • Common examples include
  • Gender, e.g. This room contains 19 people, of
    whom 10 are female, and 9 male
  • Portions of a system, e.g. suspension,
    drivetrain, body, etc.
  • Job titles (though you could argue theyre
    hierarchical)

34
Ordinal Scale
  • This measurement ranks things in order
  • Sequence is important, but the intervals between
    ranks is not defined numerically
  • Rank is relative, such as greater than or less
    than
  • E.g. grades, CMM Maturity levels, inspection
    effectiveness ratings

35
Interval Scale
  • An interval scale measures quantitative
    differences, not just relative
  • Addition and subtraction are allowed
  • E.g. common temperature scales (F or C), or a
    single date (Feb 15, 1962)
  • A zero point, if any, may be arbitrary (90 F is
    not six times hotter than 15 F!)

36
Ratio Scale
  • A ratio scale is an interval scale with a
    non-arbitrary zero point
  • Allows division and multiplication
  • E.g. defect rates (defects/KSLOC), test scores,
    absolute temperature (K or R)
  • The best type of scale to use, whenever
    feasible

37
Scale Hierarchy
  • Measurement scales are hierarchicalratio
    (best) / interval / ordinal / nominal
  • Lower level scales can always be derived if data
    is from a higher scale
  • E.g. defect rates (a ratio scale) could be
    converted to High, Medium, Low or Acceptable,
    Not Acceptable, which are ordinal scales

38
Why Are Scales Important?
  • The types of analysis which are possible, depend
    on the type of scale used for the measurements
  • In statistics, this is roughly broken into
    parametric tests (for interval or ratio scaled
    data) or non-parametric tests (for nominal or
    ordinal scaled data)
  • Some tests are more specific about the data
    scale(s) needed

39
Internal vs External Attributes
  • Internal - measured purely in terms of the entity
    itself by examining the entity on its own,
    separate from its behavior, e.g. code complexity
  • External - measured with respect to how entity
    relates to its environment behavior of the
    entity is important, e.g. response time

40
Internal vs External Attributes
  • Users (and managers) mostly interested in
    external attributes. External attributes
    measured late in development process
  • Can use internal attribute measurements to
    support decision-making about external attributes
  • Might select architecture based on performance
    needs

41
Basic Measures - Ratio
  • Used for two exclusive populations
  • Ratio ( of testers) ( of developers)
  • E.g. tester to developer ratio is 14

42
Proportions and Fractions
  • Used for multiple (gt 2) populations
  • Proportion (Number of this population) / (Total
    number of population)
  • Sum of all proportions equals unity
  • E.g. survey results
  • Proportions based on integer units whereas
    fractions are based on real numbered units

43
Percentage
  • A proportion or fraction multiplied by 100
    becomes a percentage
  • Only cite percentages when N (total population
    measured) is above 30 to 50 always provide N
    for completeness
  • Why? Statistical methods are meaningless for very
    small populations

44
Rate
  • Rate conveys the change in a measurement, such as
    over time, dx/dt. Rate ( observed events /
    of opportunities)constant
  • Rate requires exposure to the risk being
    measured
  • E.g. defects per KSLOC ( defects)/( of
    KSLOC)1000

45
Data Analysis
  • Raw data is collected, such as the date a
    particular problem was reported
  • Refined data is extracted from one or more raw
    data, e.g. the time it took a problem to be
    resolved
  • Refined data is analyzed to produce derived
    data, such as the average time to resolve problems

46
Models
  • Focus on select elements of the problem at hand
    and ignores irrelevant ones
  • May show how parts of the problem relate to each
    other
  • May be expressed as equations, mappings, or
    diagrams
  • May be derived before or after measurement
    (theory vs. empirical)

47
Models Examples
Simplest model of effort estimation Effort
f(SLOC) (effort is some function of
SLOC) There are many possible representations,
such as
Effort a(SLOC)b
____________________________________________
___
EffortabSLOC
bgt1
0ltblt1
Effort
Effort
Effort
SLOC
SLOC
SLOC
48
Elasticity
  • The elasticity of y with respect to x is the
    percentage change in y when x changes by 1
  • For a logarithmic model, with y as a function
    of two things, x1 and x2
  • ln(y) a bln(x1) cln(x2)
  • Then ln(y) changes b percent if ln(x1) changes
    by 1, therefore, b is the elasticity of ln(y)
    with respect to ln(x1)

49
Elasticity
  • Elasticity is also known as the slope of ln(y)
    with respect to, in this case, ln(x1)
  • Often see this concept to express a change in
    something such as if your blood alcohol level
    goes up 0.02, your accident rate goes up 32
    (I made up those numbers)

50
Exponential Notation
  • You might see output of the form 2.78E-12
  • This example means 2.78 10-12
  • A negative exponent, e.g. 12, makes it a small
    number
  • The leading number, here 2.78, controls whether
    it is a positive or negative number

51
Precision
  • Keep your final output to a consistent level of
    precision, e.g. dont report one number as 12
    and another as 11.8625125982351
  • Pick a reasonable level of precision
    (significant digits) similar to the accuracy of
    your inputs
  • Wait until the final answer to round off

52
Graphing
  • A typical graph shows Y on the vertical axis, and
    X on the horizontal axis
  • Y is the dependent variable, and X is
    independent, since you can pick any value of X
    and determine its matching value of Y

SPSS will sometimes ask for X and Y, other times
independent and dependent variables
53
What is R Squared?
  • Coefficient of determination, R2, is a measure
    of the goodness of fit
  • R2 ranges from 0 to 1.
  • R2 1 is a perfect fit (all data points fall on
    the estimated line)
  • R2 0 means that the variable(s) have no
    explanatory power
  • Having R2 closer to 1 helps choose which math
    model is best suited to a problem

54
Linear Regression
y

y
y

y a bx

y y e
x
Choose best line by minimizing the sum of the
squares of the horizontal distances between the
empirical points and the line
55
Expressing Uncertainty
  • We can show the uncertainty in our measurements
    by putting the standard error after each term
  • A line is given in the form y b0 b1x
  • Show standard errors with y (b0 /- seb0)
    (b1 /- seb1)x
  • See example on next slide

56
Y (6.2/-1.9) (1.3 /-0.42) X
  • The numbers in parentheses are the estimated
    coefficients plus or minus the standard errors
    associated with those coefficients
  • In the example the constant parameter (here the
    y-intercept) was estimated to be 6.2 with a
    standard error of 1.9
  • The parameter associated with x (here, the slope)
    was estimated to be 1.3 with a standard error of
    0.42

57
Level of Confidence
  • Generally, we can say that the actual value of a
    parameter estimate is in the range of 2
    standard deviations of its estimated value with a
    95 level of confidence
  • Thus the value of the constant parameter lies
    between 2.4 (i.e., 6.2 21.9) and 10.0 (i.e.,
    6.2 21.9) with a 95 level of confidence
  • With this level of confidence, the parameter
    estimate associated with x (slope) lies between
    .46 and 2.14

58
Level of Confidence
  • The default level of confidence for statistical
    testing is 95
  • Life-and death measurements (e.g. medical
    testing) use 99
  • Some wimpy software studies might use level of
    confidence as low as 80

59
The t Statistic
  • The t-statistic is defined ast (parameter
    estimate) / (standard error)
  • If t gt 2 then the parameter estimate is
    significantly different from zero at the 95
    level of confidence in the example on slide 56
  • t 6.2/1.9 3.26 for the constant term
  • t 1.3/0.42 3.1 for the slope
  • Hence both terms are significant.

Note For very precise work, use 1.96 instead of 2
60
What can we ignore?
  • If a parameter estimate associated with a
    variable is not significantly different from zero
    at the 95 level of confidence, then the variable
    should be omitted from the analysis
  • This is important later for seeing if a curve fit
    is useful if the coefficients pass the T-test,
    they may be used

61
The 95 Rule
  • The 95 rule helps limit how much is practically
    possible
  • Any parameter with a measured mean and non-zero
    std error could fall in any range of values
    though that range may be very unlikely
  • The 95 range lets us exclude everything outside
    of /- 2 std errors as too unlikely

62
The Normal Distribution
  • The normal or Gaussian distribution is the
    familiar bell shaped curve
  • The width of the distribution is measured by the
    standard deviation s
  • The area under the curve within /- 1s from the
    middle covers 68.26 of all events
  • Within /- 2s covers 95.44

63
The Normal Distribution
  • Within /- 3s covers 99.73
  • Within /- 4s covers 99.9937
  • Within /- 5s covers 99.999943
  • Within /- 6s covers 99.9999998
  • The Six Sigma quality objective is to have the
    number of defects under a couple of parts per
    million (ppm)
  • 3.4 ppm, for reasons covered in the text, instead
    of the value implied

64
Six Sigma
  • Six Sigma was founded by Motorola
  • Its best known for its Black and Green Belt
    certifications
  • It focuses on process improvements needed to
    consistently achieve extremely high levels of
    quality
Write a Comment
User Comments (0)
About PowerShow.com