Chapter 7 Part II Digital Image Processing - PowerPoint PPT Presentation

1 / 160
About This Presentation
Title:

Chapter 7 Part II Digital Image Processing

Description:

The classification of the feature types represented by digital images using ... of spatial relationships is often a prelude to digital image classification. ... – PowerPoint PPT presentation

Number of Views:592
Avg rating:3.0/5.0
Slides: 161
Provided by: bobh6
Category:

less

Transcript and Presenter's Notes

Title: Chapter 7 Part II Digital Image Processing


1
Chapter 7Part IIDigital Image Processing
Geography 4260Remote Sensing
GEOG 4260
2
Digital Image Processing
GEOG 4260
  • Image Classification

3
Digital Image Processing
GEOG 4260
  • The classification of the feature types
    represented by digital images using visual image
    interpretation techniques relies on the same
    elements of interpretation used for air photo
    interpretation, i.e. shape, size, pattern, tone,
    texture, shadows, site, and association.
  • Digital image interpretation relies mainly on
    color, i.e. on comparisons of digital numbers
    found in different bands in different parts of an
    image.

4
Digital Image Processing
GEOG 4260
  • The objective of digital image classification
    procedures is to categorize the pixels in an
    image into land cover classes.
  • The output is a thematic image with a limited
    number of feature classes as opposed to a
    continuous image with varying shades of gray or
    varying colors representing a continuous range of
    spectral reflectances.

5
Digital Image Processing
GEOG 4260
  • The range of digital numbers in different bands
    for particular features is known as a spectral
    pattern or spectral signature.
  • Pattern in this sense does not have a spatial
    component. A spectral pattern can be composed of
    adjacent pixels or widely separated pixels.

6
Digital Image Processing
GEOG 4260
  • Spectral pattern recognition refers the
    classification procedures that can be used to
    group similar pixels into feature classes.
  • Computerized spatial pattern recognition is much
    more complex, and is still in its infancy in
    spite of more than three decades of research.

7
Digital Image Processing
GEOG 4260
  • Temporal pattern recognition can also be
    accomplished by digital methods. This involves
    detecting and interpreting changes in spectral
    responses at different times.
  • Spectral, spatial and temporal classification
    methodologies can be applied simultaneously or
    sequentially to improve the overall success of an
    image classification process, and the visual
    interpretation of spatial relationships is often
    a prelude to digital image classification.

8
Digital Image Processing
GEOG 4260
  • Digital image classification techniques can
    generally be classified into two types
  • Unsupervised classification techniques,
  • Supervised classification techniques, and
  • Hybrid classification techniques.
  • Often, however, these types are used sequentially
    or iteratively.

9
Digital Image Processing
GEOG 4260
  • Unsupervised image classification techniques rely
    on the computer to classify spectrally-similar
    pixels into classes in such a way that the
    digital numbers in each class have values within
    all bands that are more similar to the values
    associated with other pixels in the same class
    than they are to the digital numbers of pixels in
    other classes.

10
Digital Image Processing
GEOG 4260
  • Supervised classification techniques require the
    image analyst to define the classification
    categories and identify a representative samples
    of pixels to the computer.
  • The computer then assign all of the remaining
    pixels to one of the predefined classes on the
    basis of the similarities between the digital
    number in the training pixels and the digital
    numbers in all other pixels.

11
Digital Image Processing
GEOG 4260
  • Hybrid classification techniques are designed to
    improve the results of separately applied
    unsupervised and supervised classification
    techniques.
  • All of these techniques are generally applied to
    multispectral images, but similar techniques have
    been developed for hyperspectral images.

12
Digital Image Processing
GEOG 4260
  • Supervised Classification

13
Digital Image Processing
GEOG 4260
  • Supervised classification is considered first
    because the methodology used is generally easier
    to understand. Supervised classification is
    accomplished in four steps
  • The analyst defines a classification scheme,
  • The analyst identifies pixels known to fall in
    each class for the computer,
  • The computer put each image pixel into a class
    based on the multispectral data ranges of the
    training pixels, and then
  • The computer generates a classified image.

14
Digital Image Processing
GEOG 4260
  • The purpose of the image classification project
    dictates the classes into which the computer will
    be asked to categorize image pixels, and it is
    the analysts responsibility to define these
    classes.
  • For example, a forestry application will require
    different classes than a hydrologic or
    agricultural application. However, general
    purpose classifications containing a large number
    of land cover classes are also common.

15
Digital Image Processing
GEOG 4260
  • Once the classes are defined, the analyst must
    develop training classes, i.e. sets of pixels
    within each land cover class encapsulating the
    range of variability of the digital numbers in
    each band.
  • This process will be described in more detail
    after a discussion of how the computer uses these
    data ranges.

16
Digital Image Processing
GEOG 4260
  • The Classification Stage

17
Digital Image Processing
GEOG 4260
  • The computer begins the process of image
    classification by arranging the digital numbers
    of each pixel along the range of digital numbers
    in each band, illustrated here with only two
    bands.

18
Digital Image Processing
GEOG 4260
  • The clusters of pixels are normally arranged in
    multidimensional space since three or more bands
    will usually be utilized in a classification.

19
Digital Image Processing
GEOG 4260
  • One of several possible classification
    strategies, or classifiers, will then be applied
    to assign the remaining pixels into one of the
    predefined classes. Three classifiers will
    considered here
  • The minimum-distance-to-means classifier,
  • The parallelepiped classifier, and
  • The Gaussian maximum likelihood classifier.

20
Digital Image Processing
GEOG 4260
  • The minimum-distance-to-means classifier compares
    each unclassified pixel to the mean digital
    number within each band and assigns the pixel to
    the class whose mean is the shortest distance
    away in hyperdimensional space.

21
Digital Image Processing
GEOG 4260
  • The set of mean digital numbers for each cluster
    is known as the clusters mean vector. Pixels that
    are to be classified also have a set of digital
    numbers in each of the same bands.
  • The software calculates each unclassified pixels
    distance from each mean vector and assigns the
    pixel to the class where this distance is a
    minimum. (Note The mean vector is not this
    distance it is the n-dimensional location of
    the cluster).

22
Digital Image Processing
GEOG 4260
  • This distance between an unclassified pixel and
    each mean vector is easily visualized in two or
    three dimensions, but the computer can easily
    calculate the distance in any number of
    dimensions.

23
Digital Image Processing
GEOG 4260
  • The minimum-distance-to-means classifier is
    computationally efficient because it involves
    only simple mathematical calculations, but it is
    based only on the means of each cluster and is
    not sensitive to the variances of the training
    data clusters.

24
Digital Image Processing
GEOG 4260
  • The variance of a training class is a measure of
    the within-band dispersion of the digital
    numbers. A training class with high variance has
    a large range of digital numbers within one or
    more spectral bands.

25
Digital Image Processing
GEOG 4260
  • It is not uncommon to find pixels that are closer
    to the mean of one cluster with limited variance
    which are actually members a second cluster with
    more variance whose mean is more distant.

26
Digital Image Processing
GEOG 4260
  • Because of the potential for misclassification
    when nearby classes have high variances, the
    minimum-distance-to-mean classifier is not very
    suitable in these cases but is entirely
    acceptable when the classes have low variances
    and low correlation.

27
Digital Image Processing
GEOG 4260
  • Parallelepiped Classifier

28
Digital Image Processing
GEOG 4260
  • A parallelepiped classifier is sensitive to
    within class variance because it encloses all of
    the pixels within a training class in
    parallelepipeds, n-dimensional equivalents of the
    rectangles shown here.

29
Digital Image Processing
GEOG 4260
  • Image pixels that fall outside of the
    parallelepipeds are classified as unknown.
    Pixels in overlapping parallelepipeds are
    classified as uncertain or are arbitrarily
    placed in one or both of the overlapping classes.

30
Digital Image Processing
GEOG 4260
  • Like the minimum-distance-to-mean classifier, the
    parallelepiped classifier is computationally
    efficient (and therefore fast).
  • Additionally, however, it avoids the problems
    associated with assigning pixels to adjacent
    classes when one of the classes has a high
    variance (as long as the classes dont overlap).

31
Digital Image Processing
GEOG 4260
  • Training classes can have high correlation or
    high covariance, or both.
  • The presence of training classes with high
    correlation or high covariance complicates the
    classification process therefore, an basic
    understanding of correlation and covariance is
    necessary to an understanding of the
    classification process.

32
Digital Image Processing
GEOG 4260
  • Training classes with high correlation have
    overlapping clouds of training pixels. This makes
    it impossible to assign new pixels in the overlap
    area with any degree of certainty.

33
Digital Image Processing
GEOG 4260
  • High correlation cannot occur with a single
    training class because it correlation describes
    the similarity (or lack of similarity) between
    two or more classes.

34
Digital Image Processing
GEOG 4260
  • A training class with high covariance has a cloud
    of pixels that is elongated diagonally when
    plotted on a two-axes scatter diagram.

35
Digital Image Processing
GEOG 4260
  • Covariance refers to the arrangement of digital
    numbers within two or more bands of a single
    training class and is independent of the digital
    numbers found within other training classes.

36
Digital Image Processing
GEOG 4260
  • High covariance, however, increases the volume of
    the parallelepipeds, making it more likely that
    pixels will fall within overlap regions and
    therefore have higher correlations with other
    training classes.

37
Digital Image Processing
GEOG 4260
  • Both high correlation and high covariance are
    common within spectral response categories.
  • Parallelepiped classifiers can be modified to
    include stepped boundaries to reduce the amount
    of misclassification, but they cant eliminate
    overlap areas if they exist. The Gaussian maximum
    likelihood classifier was designed to overcome
    this limitation.

38
Digital Image Processing
GEOG 4260
  • Gaussian Maximum Likelihood Classifier

39
Digital Image Processing
GEOG 4260
  • The Gaussian maximum likelihood classifier
    assigns pixels to classes after considering both
    the variance and covariance within training
    classes.
  • It does so by assuming that the distribution of
    the clouds of points representing the training
    data is normally distributed, i.e. exhibits a
    Gaussian distribution.

40
Digital Image Processing
GEOG 4260
  • In one dimension, the frequency distribution of a
    Gaussian distribution forms a bell-shaped curve.

41
Digital Image Processing
GEOG 4260
  • In two dimensions, probability ellipses define
    the likelihood that any particular point is
    associated with a particular distribution, with
    lower (but finite) probabilities beyond each
    ellipse.

42
Digital Image Processing
GEOG 4260
  • These ellipses are defined here by
    equiprobability contours, lines along which the
    probability that a point falling at that location
    belongs to a particular class.

43
Digital Image Processing
GEOG 4260
  • In three or more dimensions, probability space is
    represented by n-dimensional volumes. Visualizing
    more than three dimensional volumes is
    challenging for us, but not particularly
    difficult from a computational standpoint.

44
Digital Image Processing
GEOG 4260
  • Even though n-dimensional equiprobability
    contours overlap, the probabilities that any
    particular unclassified pixel is associated with
    each class can be calculated.
  • Pixels are then simply assigned to the class with
    the highest probability that they actually belong
    to that class. Although it is possible that
    pixels can be misclassified, probability theory
    suggests that any particular pixel is more likely
    to be correctly classified than it is to be
    misclassified.

45
Digital Image Processing
GEOG 4260
  • An extension of the maximum likelihood classifier
    is the Bayesian classifier which depends on
  • A priori knowledge of the land cover types that
    are most likely to occur, and
  • The relative costs of misclassification.

46
Digital Image Processing
GEOG 4260
  • A Bayesian classifier uses weighting factors to
    increase the likelihood that pixels will be
    assigned to more common land cover classes as
    long as the risks of misclassifying a pixel to a
    common class isnt going to cause too much of a
    problem.
  • Less common classes are given lower weights as
    are classes where misclassification of a pixel
    into that class would produce more serious
    problems related to the objectives of the
    classification process.

47
Digital Image Processing
GEOG 4260
  • A Bayesian classifier is superior to a simpler
    maximum likelihood classifier if the image
    analyst has sufficient information. But, this
    isnt usually the case.
  • As a result, most maximum likelihood classifiers
    assume equal likelihood and equal costs
    associated with misclassification.

48
Digital Image Processing
GEOG 4260
  • Maximum likelihood classifiers are much more
    computationally intensive than simpler
    classifiers, particularly with larger images
    containing more bands and with larger numbers of
    classes.
  • Several techniques can be used to increase the
    efficiency of the classification process,
    including
  • Using a lookup table to assign classes,
  • Reducing the dimensionality of the data, and
  • Using a stratified classifier or decision tree.

49
Digital Image Processing
GEOG 4260
  • Lookup tables are database tables that contain
    the values of one variable associated with all
    possible unique values of another variable.

50
Digital Image Processing
GEOG 4260
  • Without a lookup table
  • The calculations necessary to determine the
    probabilities of a pixels belonging to each of
    the several classes must be performed, and then
  • The probabilities must be sorted to determine
    the class to which that pixel should be assigned.
  • Both of these steps have to be repeated for every
    unclassified pixel.

51
Digital Image Processing
GEOG 4260
  • A lookup table containing all existing
    combinations of pixel values in each band and the
    class to which such a pixel should be assigned
    avoids repeating these calculations for every
    pixel.

52
Digital Image Processing
GEOG 4260
  • In other words, the probabilities are calculated
    and sorted once for each unique set of digital
    numbers instead of once for each unclassified
    pixel.

53
Digital Image Processing
GEOG 4260
  • A lookup table accelerates the classification
    process because the class to be assigned to
    multiple pixels with identical digital numbers
    can be found in the lookup table without having
    to recalculate and sort probabilities.

54
Digital Image Processing
GEOG 4260
  • Note that it is not necessary to calculate the
    probabilities for possible combinations of
    digital numbers because the minimum and maximum
    digital numbers in each band can be determined
    ahead of time to limit the ranges of values that
    need to be included in the lookup table.

55
Digital Image Processing
GEOG 4260
  • Increasing the number of bands included in the
    image classification process increases the number
    of calculations necessary to use a maximum
    likelihood classifier.
  • Likewise, being able to reduce the number of
    bands would reduce the number of calculations and
    improve the speed of the classification. That can
    be accomplished through the application of
    principal or canonical components analysis prior
    to the application of the image classifier.

56
Digital Image Processing
GEOG 4260
  • Landsat MSS data contains four bands, but the
    inherent dimensionality of the data is two bands.
    In other words, principal components analysis can
    be applied to the original data to produce two
    rasters that contain all of the information
    necessary to assign the data to land cover
    classes.
  • Canonical component analysis achieves similar
    results.

57
Digital Image Processing
GEOG 4260
  • Stratified classifiers (a.k.a. layered
    classifiers or decision tree classifiers) can
    also be used to reduce the number of calculations
    necessary.
  • These classifiers assign pixels to more
    easily-identified classes first, leaving only a
    subset of the pixels to be classified with a
    maximum likelihood classifier later.

58
Digital Image Processing
GEOG 4260
  • Because water has very low reflectance in
    infrared, pixels can often be easily classified
    as water or not water by looking at the
    digital numbers in an infrared band. Other land
    covers may require data from only two or three
    bands.
  • Assigning easily identified pixels to classes
    with simpler classifiers reduces the number of
    pixels that need to be assigned to classes by a
    slower maximum likelihood classifier.

59
Digital Image Processing
GEOG 4260
  • Classifiers are used to assign pixels to known
    land cover classes, but they require input data
    to produce results. The input data are used to
    create statistical measures of the digital
    numbers (in each band) that can be expected to be
    found within each class.
  • These data are assembled by the image analyst
    during the preceding training stage.

60
Digital Image Processing
GEOG 4260
  • The Training Stage

61
Digital Image Processing
GEOG 4260
  • Supervised classification assigns unclassified
    pixels to classes defined by an image analyst
    using one of the previously-described
    classifiers
  • A minimum-distance-to-means classifier,
  • A parallelepiped classifier, or
  • A Gaussian maximum likelihood classifier.
  • However, it is the analysts job to identify the
    pixels within each class that the computer will
    use to assign other pixels to the classes.

62
Digital Image Processing
GEOG 4260
  • During the training stage of image
    classification, an image analyst selects
    individual pixels that are believed to represent
    both the typical reflectance values for each land
    cover class and the range of values likely to be
    found in each class.
  • This is an iterative process in that the analyst
    has the opportunity to improve the data by
    evaluating them and adding or removing individual
    pixels to improve the utility of a training class.

63
Digital Image Processing
GEOG 4260
  • The success of the classification effort depends
    on
  • The analysts understanding of the process,
  • The analysts knowledge of the land cover types
    and their distribution in the particular
    geographic area, and
  • The availability of reference data to fill gaps
    in the analysts knowledge.

64
Digital Image Processing
GEOG 4260
  • The multidimensional clouds of data points that
    represent the spectral responses of individual
    pixels are spectral classes.

65
Digital Image Processing
GEOG 4260
  • Each spectral class includes multiple pixels to
    represent the range of spectral responses within
    a land cover type in a particular image area.
  • But, multiple spectral classes are often required
    within each land cover class to adequately
    represent a land cover class within the final
    classification scheme.

66
Digital Image Processing
GEOG 4260
  • For example, a water class might include deep
    clear lakes and shallow muddy ponds in addition
    to other water areas.
  • If the computer is expected to distinguish water
    and other land cover types, then all of the
    possible spectral classes of water and all of the
    possible spectral classes.

67
Digital Image Processing
GEOG 4260
  • Other land cover classes may also contain
    multiple spectral classes, each of which must be
    included in the training data.
  • Even a simple classification of cover types into
    broad classes can require a very large number of
    spectral classes selected from an even larger
    number of training sites.

68
Digital Image Processing
GEOG 4260
  • Training sites are geographic areas within which
    all of the pixels are believed to belong to a
    single spectral class, i.e. uniform areas of
    known land cover types.

69
Digital Image Processing
GEOG 4260
  • ERDAS IMAGINE uses areas of interest to enclose
    training sites. An area of interest is a polygon
    enclosing one or more pixels representing a land
    cover class in the proposed classification scheme.

70
Digital Image Processing
GEOG 4260
  • The more general name for an area of interest
    used in this manner is a viewing window. All of
    the pixels within a viewing window should be
    representative of a particular spectral class.

71
Digital Image Processing
GEOG 4260
  • Because all of the pixels in a viewing window
    represent a particular spectral class, viewing
    windows should avoid edge areas so that
    transition zones are not included in the class.

72
Digital Image Processing
GEOG 4260
  • The pixels within each of the viewing windows
    used to define a particular spectral class
    provide the digital numbers that will be used by
    the image classifier.
  • Depending of the type of classifier, these
    numbers will be used to calculate minimum
    distances to means, locate parallelepipeds, or
    define probability contours.

73
Digital Image Processing
GEOG 4260
  • ERDAS IMAGINE allows an image analyst to create
    areas of interest by
  • Dragging a mouse to enclose rectangular or oval
    areas,
  • Clicking a series of points to define the
    boundaries of irregular polygons,
  • Clicking a series of points to define line
    running through a series of adjacent pixels, or
  • Selecting a single seed pixel whose digital
    numbers are used by the software to find
    statistically similar pixels.

74
Digital Image Processing
GEOG 4260
  • Only a very small sample of pixels is needed to
    calculate the statistics for a statistics-based
    classifier such as a maximum likelihood
    classifier.
  • However, supervised classifications are normally
    performed with tens to hundreds of pixels in each
    of several windows representing each training
    class

75
Digital Image Processing
GEOG 4260
  • Increasing the number of representative pixels
    increases that statistical estimates of the mean
    vector and covariance which are needed to
    calculate the probabilities that unclassified
    pixels are members of particular classes.
  • In general, using more pixels to define a
    spectral class improves the quality of the
    statistics for that class.

76
Digital Image Processing
GEOG 4260
  • Selecting widely-dispersed multiple training
    sites for each spectral class also improves the
    statistical description of the variability within
    that class.
  • In other words, selecting fewer pixels from
    multiple training sites dispersed throughout the
    image generally works better than selecting a
    large number of pixels from either a single
    training site or a small number of nearby
    training sites.

77
Digital Image Processing
GEOG 4260
  • Using multiple training sites without limit,
    however, increases the time needed to
    characterize the class.
  • The objective is to adequately capture the
    spectral variability within a training class
    without including too many redundant pixels and
    without creating redundant training classes.

78
Digital Image Processing
GEOG 4260
  • ERDAS IMAGINE and other image processing software
    packages can produce and display graphs,
    statistics and images that allow an image analyst
    to recognize gaps and redundancies in the
    training data. An analyst can then use this
    information to refine the data before they are
    used for classification.
  • Training set refinement refers to the process of
    using these information resources to improve the
    training data.

79
Digital Image Processing
GEOG 4260
  • Training set refinement tools and processes
    include
  • Histograms and other graphical views of the
    data,
  • Quantitative measures of category separability,
  • Self-classification of the training set data,
  • Interactive preliminary classification of pixels
    that were not included in the training classes,
    and
  • Classification of a representative image
    subscene.

80
Digital Image Processing
GEOG 4260
  • Graphical Representations of the Training Data

81
Digital Image Processing
GEOG 4260
  • Graphical representations of the training data
    include
  • Histograms,
  • Coincident spectral plots, and
  • Two-dimensional scatter diagrams.

82
Digital Image Processing
GEOG 4260
  • Histograms

83
Digital Image Processing
GEOG 4260
  • A histogram is a graph of the number of
    observations for each pixel value within a
    spectral band (or within a training class).
  • Pixel values are on the horizontal axis and pixel
    counts are on the vertical axis.

84
Digital Image Processing
GEOG 4260
  • In ERDAS IMAGINE, histograms include the median
    pixel value (as a digital number and graphically
    as a red line) and the minimum and maximum pixel
    values.

85
Digital Image Processing
GEOG 4260
  • Histograms of training sites are useful because
    they provide a visual indication of the normality
    of the distribution.
  • This is especially important when a Gaussian
    maximum likelihood classifier is to be employed
    because this classifier assumes that the training
    data are normally distributed.

86
Digital Image Processing
GEOG 4260
  • This histogram illustrates a bimodal
    distribution.
  • If these data represented a training class, they
    would be unacceptable for use with a maximum
    likelihood classifier.

87
Digital Image Processing
GEOG 4260
  • A bimodal histogram indicates that a class
    contains two different spectral subclasses and
    suggests that two different training classes
    should be developed to differentiate them.

88
Digital Image Processing
GEOG 4260
  • This is true even if the subclasses will be
    recombined in the final land cover classification
    because it makes it less likely that other land
    use types will be assigned to the resultant
    unimodal spectral subclasses.

89
Digital Image Processing
GEOG 4260
  • Histograms illustrate the distribution of digital
    numbers within a band, but they dont make it
    easy to compare the distributions of digital
    numbers for different cover types in multiple
    bands.
  • Coincident spectral plots are designed to
    overcome this shortcoming of histograms.

90
Digital Image Processing
GEOG 4260
  • Coincident Spectral Plots

91
Digital Image Processing
GEOG 4260
  • Coincident spectral plots display the mean and
    variance of the digital numbers associated with
    each spectral class in each band. (Standard
    deviation is one measure of the variance of a
    distribution).

92
Digital Image Processing
GEOG 4260
  • The coincident spectral plots illustrated here
    suggest that classification of the features
    represented by these training classes can be
    accomplished with some degree of confidence in
    spite of significant overlap.

93
Digital Image Processing
GEOG 4260
  • Two-Dimensional Scatter Plots

94
Digital Image Processing
GEOG 4260
  • Two-dimensional scatter plots are used to compare
    the spectral response patterns of one or more
    training classes in two spectral bands and are
    useful for providing a visual indication of the
    separability of training classes utilizing only
    those two bands.

95
Digital Image Processing
GEOG 4260
  • Scatter plots can also be used to visualize the
    degree of correlation between two bands
  • Bands that are highly correlated (either
    negatively or positively) are not very useful for
    separating features from one another, but
  • Bands that show poor correlation usually
    contains data that will allow the separation of
    different feature types from one another.

96
Digital Image Processing
GEOG 4260
  • Histograms, coincident spectral plots, and
    scatter diagrams provide qualitative information
    that an image analyst can use to subjectively
    evaluate spectral classes, but they dont provide
    quantitative measures that can be used to assess
    signatures.

97
Digital Image Processing
GEOG 4260
  • Quantitative Measures of
  • Category Separability

98
Digital Image Processing
GEOG 4260
  • Quantitative measures of category separability
    attempt to objectively summarize the ability of
    training classes to distinguish between the
    feature types they represent.
  • ERDAS IMAGINE provides four such quantitative
    measures, including the Euclidean distance
    between training class means in n-dimensional
    space.

99
Digital Image Processing
GEOG 4260
  • The results of all of these measures are normally
    presented in matrix format. The zeros along the
    diagonal within this matrix of Euclidean
    distances indicate that classes cannot be
    separated from themselves, while higher values
    indicate a higher level of separability.

100
Digital Image Processing
GEOG 4260
  • Note that the numbers on opposite sides of the
    diagonal are mirror images of themselves.
  • This is simply a result of the fact that the
    distances between spectral class means can be
    measured in either direction. Matrices of other
    quantitative measures of separability are also
    symmetrical.

101
Digital Image Processing
GEOG 4260
  • Euclidean distances doesnt include a measure of
    the variance within training classes because it
    only considers the mean of each spectral class.
  • The second quantitative measure provided by ERDAS
    IMAGINE, divergence, does include the variance in
    the calculation.

102
Digital Image Processing
GEOG 4260
  • The formula used to calculate divergence is
    complex, but it is normally used by computer
    software rather than image analysts making it
    simple to apply.

103
Digital Image Processing
GEOG 4260
  • Higher divergence numbers, like higher Euclidean
    distances, indicate higher separability while
    zeros are again found along the diagonals.
  • Divergence values have no fixed range. Therefore,
    they are used to make relative comparisons of
    separability.

104
Digital Image Processing
GEOG 4260
  • Transformed divergence, a third quantitative
    measure of separability, weights the covariance
    of the pairs by their distances in such a way
    that the values range from 0 to 2000.

105
Digital Image Processing
GEOG 4260
  • The following general rules apply
  • Spectral classes can be separated using only two
    bands if their transformed divergence of the two
    spectral classes between for those two bands is
    greater than 1900,
  • Separability is fairly good if the transformed
    divergence is between 1700 and 1900, and
  • Separability is poor if the transformed
    divergence is below 1700.

106
Digital Image Processing
GEOG 4260
  • The same training classes were used to generate
    these transformed divergence values as were used
    in the earlier example.
  • An obvious advantage of transformed divergence is
    that it normalizes the data for easy comparisons.

107
Digital Image Processing
GEOG 4260
  • The maximized values for most of the pairs of
    spectral classes indicate that these signatures
    are suitable for separating these classes.
    Complete separability is made possible by the
    high quality of Thematic Mapper data and the fact
    that all seven bands were included in the
    signatures.

108
Digital Image Processing
GEOG 4260
  • However, such high quality signatures cannot be
    produced without very careful selection of pixels
    within each training class regardless of the
    quality of the image data.

109
Digital Image Processing
GEOG 4260
  • The Jefferies-Matusita (JM) distance, a final
    quantitative measure of spectral separability, is
    produced in a manner that is similar to the
    transformed divergence and is interpreted
    similarly, but it has a maximum value of 1414.

110
Digital Image Processing
GEOG 4260
  • Self-Classification of Training Data Sets

111
Digital Image Processing
GEOG 4260
  • Self-classification uses the training data to
    classify the pixels that were used to generate
    the training data.
  • Although it might seem that all of the pixels
    will necessarily be correctly classified, that is
    not always the case. Some pixels may end up being
    incorrectly classified if there are overlaps in
    the ranges of digital numbers used to create the
    classes.

112
Digital Image Processing
GEOG 4260
  • The percentage of misclassified pixels in each
    training class is normally presented in the form
    of a error matrix.

113
Digital Image Processing
GEOG 4260
  • Training class signatures that arent able to
    accurately classify the pixels used to create
    them arent likely to accurately classify the
    remaining pixels in an image.
  • However, the converse is not necessarily true
    Training classes that successfully classify their
    component pixels arent necessarily able to
    accurately classify the remaining pixels in an
    image because those remaining pixels may not be
    represented by any of the available signatures.

114
Digital Image Processing
GEOG 4260
  • Training classes that exclude signatures
    representing one or more land cover types can
    only put those land covers into an existing class
    or into an unknown class.
  • Therefore, self-classification is useful for
    determining if signatures are poorly developed,
    but it cant be used to show that the training
    classes are complete or that they will do a good
    job of classifying all of the pixels omitted from
    their development.

115
Digital Image Processing
GEOG 4260
  • Interactive Preliminary Classification

116
Digital Image Processing
GEOG 4260
  • As the name implies, interactive preliminary
    classification allows an analyst to test the
    accuracy and completeness of training classes
    during their development.
  • Usually, the preliminary classification uses a
    computationally efficient classifier such as a
    parallelepiped classifier even when a more
    sophisticated classifier will be used to perform
    the final classification.

117
Digital Image Processing
GEOG 4260
  • During this process, the analyst is able to add
    or remove pixels from the preliminary training
    classes to immediately see the results of the
    modified classification.
  • This interactive process easily identifies
    individual pixels that either improve or degrade
    the quality of the classification, resulting in
    final spectral classes that produce good results.

118
Digital Image Processing
GEOG 4260
  • Representative Subscene Classification

119
Digital Image Processing
GEOG 4260
  • Representative subscene classification is similar
    to interactive preliminary classification except
    that the classification is performed with the
    type of classifier that the analyst plans to use
    in the final classification step and only part of
    the entire image is classified.
  • Using part of the full image allows the analyst
    to concentrate on areas where the cover types are
    already well known and makes the process less
    time consuming.

120
Digital Image Processing
GEOG 4260
  • Final Comments on the Training Stage

121
Digital Image Processing
GEOG 4260
  • Image classifiers are designed for efficiency
    (speed) and accuracy.
  • The training stage, however, must be conducted in
    a manner that produces maximum accuracy.
    Introducing shortcuts at the training stage is
    likely to produce poor results in the final
    classification.

122
Digital Image Processing
GEOG 4260
  • The most difficult part of the training process
    is not the development of spectral classes for
    distinctly different land cover types such as
    water, forest and agriculture.
  • Instead, the problems arise in developing
    spectral classes that will assign pixels in
    transition zones and areas of mixed cover types
    to the appropriate land cover classes.

123
Digital Image Processing
GEOG 4260
  • By definition, transition zones are areas
    containing elements of the cover types that are
    more homogenous on either side of the transition
    zone.
  • As a result, spectral signatures often have to be
    developed separately for transition zones and the
    analyst needs to develop a consistent
    classification scheme that will either assign
    these areas to one or another primary cover type
    or to a transitional cover type.

124
Digital Image Processing
GEOG 4260
  • Refining spectral classes is often a
    trial-and-error process where the analyst adds or
    removes pixels from a developing training class
    to test the effect that the modification has on
    the ability of the classifier to produce
    acceptable results.
  • Sometimes it is necessary to remove
    rarely-occurring land cover type from the
    classification scheme in order to avoid
    misclassifying pixels that belong to more common
    land cover types.

125
Digital Image Processing
GEOG 4260
  • The training stage may also make it apparent that
    some of the originally-proposed land cover
    classes will need to be combined into more
    general classes because the spectral responses of
    the proposed classes are indistinguishable.
  • For example, the data may not make it possible to
    distinguish individual tree species even though
    they are adequate to classify trees into more
    general categories such as evergreen, deciduous
    and mixed forests.

126
Digital Image Processing
GEOG 4260
  • Sometimes, the inherent spectral responses of
    similar cover types may make it impossible to
    separate them using a single multiband image.
  • In these cases, it may be necessary to acquire
    additional data in the form of field data or
    images acquired on other dates or with other
    sensors, or to use other classification
    techniques such as visual interpretation of the
    digital image or of other images including higher
    resolution digital images or aerial photography.

127
Digital Image Processing
GEOG 4260
  • Unsupervised Classification

128
Digital Image Processing
GEOG 4260
  • Unsupervised classification differs from
    supervised classification in that
  • The image analyst does not design the
    classification scheme nor develop training
    classes, and
  • The computer uses algorithms that aggregate
    similar pixels into classes based on their
    similarity with each other and their
    dissimilarity to the remaining pixels rather than
    their likely land cover types.

129
Digital Image Processing
GEOG 4260
  • With unsupervised classification, the land cover
    types associated with each class are initially
    unknown and the computer produces no information
    to aid in their identity.
  • It is the image analysts job to associate the
    classes defined by the computer with the land
    cover types in the image that these classes
    represent.

130
Digital Image Processing
GEOG 4260
  • The procedural steps are reversed in these two
    classification methodologies
  • In supervised classification, the analyst
    defines land cover types and then develops
    spectral classes that can be used by the computer
    to identify those pixels that are members of each
    class.
  • In unsupervised classification, the computer
    develops spectral classes and then the analyst
    associates the spectral classes with land cover
    types.

131
Digital Image Processing
GEOG 4260
  • Unsupervised classification has two significant
    advantages over supervised classification
  • The computer can assign pixels to
    spectrally-distinct classes which an analyst
    might not recognize as existing, and
  • The computer can identify a much larger number
    of spectrally-distinct classes than an analyst
    might consider to exist.

132
Digital Image Processing
GEOG 4260
  • Even if an analyst recognizes that distinct
    subclasses exist, unsupervised classification
    techniques allow the analyst to avoid developing
    spectral classes for each unique class and
    subclass.
  • Instead, the computer creates a large number of
    distinct classes and then the analyst can combine
    them into final classes as deemed appropriate.

133
Digital Image Processing
GEOG 4260
  • The methods used in both supervised and
    unsupervised classification require the
    assignment of individual pixels into a finite
    number of spectral classes. Each of these
    spectral classes is presumed to represent a
    unique land cover type.

134
Digital Image Processing
GEOG 4260
  • Unsupervised classification uses one of many
    available clustering algorithms to determine
    natural groups of spectrally-similar pixels. Most
    of these fall into two general types
  • K-means algorithms have the computer
    more-or-less arbitrarily select a number of
    pixels as the starting points for defining
    clusters, and
  • Moving window algorithms that use a moving
    window to identify small groups of
    spectrally-similar pixels to use as starting
    points for defining clusters.

135
Digital Image Processing
GEOG 4260
  • K-Means Algorithms

136
Digital Image Processing
GEOG 4260
  • Standard k-means algorithms require the analyst
    to set the number of clusters to be defined.
  • The computer then selects one pixel to initially
    represent each class. These seed pixels are
    arbitrarily scattered throughout the
    multidimensional image space defined by the
    digital numbers in each available band for each
    pixel. In other words, the seed pixels have large
    differences in most of their digital numbers.

137
Digital Image Processing
GEOG 4260
  • After selecting seed pixels, the computer uses
    the mean vector of each seed pixel to assign the
    remaining pixels to clusters around the nearest
    mean vector.
  • New mean vectors are then calculated using all of
    the pixels in each of these new clusters. All of
    the pixels in the image are then reassigned to
    the nearest of these new mean vectors.

138
Digital Image Processing
GEOG 4260
  • The process of calculating new mean vectors and
    reassigning pixels to the nearest mean vector is
    repeated until only a limited number of pixels
    need to be shifted to other classes because there
    is little movement of the mean vectors.
  • The threshold percentage of pixels reassigned is
    another variable controlled by the image analyst,
    i.e. the inputs to the computer include a
    percentage of reassigned pixels below which the
    process stops.

139
Digital Image Processing
GEOG 4260
  • A final input provided by the analyst is the
    number of iterations allowed.
  • The computer stops calculating new mean vectors
    and reassigning pixels when
  • Fewer pixels than the input threshold are being
    reassigned, or
  • The maximum number of iterations has been
    completed.

140
Digital Image Processing
GEOG 4260
  • ERDAS IMAGINE provides a variant of the k-means
    approach known as Iterative Self-Organizing Data
    Analysis or ISODATA.
  • The ISODATA algorithm is similar to other k-means
    algorithms, but a significant difference is that
    the ISODATA algorithm allows the computer to
    determine the final number of clusters while this
    value is set by the image analyst in other
    k-means applications.

141
Digital Image Processing
GEOG 4260
  • The number of classes can change because the
    computer is allowed to merge spectrally-similar
    preliminary clusters (i.e. clusters whose mean
    vectors are nearby) and to split clusters whose
    standard deviation within any single band is
    larger than a predefined threshold.

142
Digital Image Processing
GEOG 4260
  • If splitting a cluster with a large standard
    deviation produces clusters that are smaller than
    an analyst-specified threshold, the new clusters
    are simply eliminated and their constituent
    pixels are reassigned to the remaining cluster
    whose mean vector is nearest.
  • As stated earlier, the process then repeats until
    either few pixels are being reassigned or a
    maximum number of iterations has been completed.

143
Digital Image Processing
GEOG 4260
  • Moving Window Algorithms

144
Digital Image Processing
GEOG 4260
  • Unsupervised classification algorithms that use a
    moving window to identify initial clusters are
    predicated on the idea that the initial sets of
    pixels used to define preliminary classes should
    be spectrally similar because the final clusters
    will contain spectrally-similar pixels.

145
Digital Image Processing
GEOG 4260
  • No such algorithms are provided by ERDAS IMAGINE,
    but they are widely used and are the basis of
    many unsupervised classifications discussed in
    the remote sensing literature.
  • Therefore, a basic understanding of the
    methodology used by these algorithms is important
    to the correct interpretation of research
    conclusions.

146
Digital Image Processing
GEOG 4260
  • The algorithms take into consideration the
    texture or roughness of the pixels within a
    moving window passed through the image.
  • Texture is defined by the multi-dimensional
    variance of the digital numbers within a moving
    window passed through the image.

147
Digital Image Processing
GEOG 4260
  • The analyst inputs a threshold variance below
    which the pixels within the window are considered
    smooth, and the computer then moves a window
    through the image until it encounters a smooth
    window.
  • The mean of these pixels then becomes the center
    of the first preliminary cluster.

148
Digital Image Processing
GEOG 4260
  • The computer then continues searching for other
    smooth windows and defines the mean of each
    newly-found smooth window as the next cluster
    center.
  • The analyst is required to specify the maximum
    number of initial clusters. This number is
    usually relatively large (e.g. 50) because the
    computer will eventually combine many of these
    into a smaller number of spectrally-similar
    clusters.

149
Digital Image Processing
GEOG 4260
  • When the analyst-defined number of preliminary
    cluster centers has been reached, the computer
    calculates the distances between each pair of
    preliminary clusters and merges the two clusters
    that are separated by the smallest distance.
  • New statistics are then calculated and the two
    then-nearest clusters are merged. This process is
    repeated until all of the remaining clusters are
    more spectrally-distinct than an analyst-defined
    threshold.

150
Digital Image Processing
GEOG 4260
  • The clusters that werent eliminated by merging
    are used to classify all of the remaining pixels
    using one of the classifiers that are used for
    supervised classifications (e.g. a
    minimum-distance-to-means classifier or a maximum
    likelihood classifier).

151
Digital Image Processing
GEOG 4260
  • In a sense, this process is similar to a
    supervised classification except that the
    computer selects the training classes by
    identifying the spectrally-smoothest areas of the
    input image.
  • The final classification uses the computers
    training classes instead of an analysts training
    classes, but the final classification uses the
    same types of classifiers.

152
Digital Image Processing
GEOG 4260
  • Disadvantages of this approach result from the
    facts that some important land cover classes may
    not be included in the classification because
  • They are inherently rough at certain scales
    (e.g. rivers or roads), or
  • They exist in areas that werent processed
    before maximum number of smooth clusters was
    found.

153
Digital Image Processing
GEOG 4260
  • A final variation of this procedure overcomes
    these limitations by involving the analyst in the
    selection of some of the initial clusters.
  • This involves elements of both supervised and
    unsupervised classification, and many other
    hybrid classification methodologies are commonly
    used in image classification.

154
Digital Image Processing
GEOG 4260
  • Final Comments on Unsupervised Classification

155
Digital Image Processing
GEOG 4260
  • Unsupervised classification procedures are
    designed to identify spectrally-similar classes
    of pixels.
  • They are incapable, however, of associating these
    classes with landcover classes. This associative
    process can be as difficult as the process of
    developing and refining the spectral classes that
    are used in supervised classifications.

156
Digital Image Processing
GEOG 4260
  • There are four possible relationships between
    spectral classes and land cover classes
  • A one-to-one relationship,
  • A many-to-one relationship, and
  • A many-to-one relationship.
  • Many-to-one and one-to-many relationships are
    more common and are relatively easy to deal with.
    One-to-many relationships, though, are especially
    problematic.

157
Digital Image Processing
GEOG 4260
  • A one-to-one relationship exists when each of the
    spectral classes represents a distinct landcover
    class.
  • If this type of relationship exists, the analyst
    only needs to recognize the relationships and
    assign appropriate class names.

158
Digital Image Processing
GEOG 4260
  • In a many-to-one relationship, two or more
    spectral classes are logically grouped to define
    a single landcover class.
  • For example, an unsupervised classification might
    produce distinct spectral classes that the
    analyst recognizes as deep clear water, slightly
    turbid lakes, and shallow muddy ponds. These can
    conveniently be assigned to a water landcover
    class unless the analyst is especially interested
    in the differences between these water features.

159
Digital Image Processing
GEOG 4260
  • The analysts job is more difficult, however, if
    one to many relationships exist.
  • For example, the analyst may wish to produce a
    classification that separates deciduous and
    evergreen forest types in a forestry application.
    If the computer generates three spectral classes
    which the analyst recognizes as deciduous,
    evergreen and mixed forest, these spectral
    classes dont provide any method to achieve the
    analysts objective.

160
GEOG 4260
  • Next Chapter 8
  • Microwave and Lidar Remote Sensing Systems
Write a Comment
User Comments (0)
About PowerShow.com