CSG -- Signature Generator
Creates signatures for a particular window or a region under a
bitmap mask using data from a set of image channels. Each signature
consists of channel means, deviations, thresholds and relative 'a
priori' probabilities. The user can select the channels, threshold,
and 'a priori' probability for the class.
Signatures are most often generated as input to the classification
program MLC but also have useful applications in Principal Component
Analysis.
See Also: CSR, MLC, MINDIS, PCA
PARAMETERS
CSG is controlled by the following global parameters:
Name Prompt Count Type
FILE Database File Name 1-64 Char
DBIC Database Input Channel List 1-16 Int
MASK Area Mask (Window or Bitmap) 0-4 Int
VALU Grey-level Value List 1 Int
THRS Gaussian Threshold 0-1 Real
BIAS Class Bias 0-1 Real
REPORT Report mode: TERM/OFF/filename 1-64 Char
The following parameter receives output:
LASC Last Segment Created 1 Int
FILE
Specifies the name of the PCIDSK image file used to derive signature
segments.
EASI>FILE="filespec"
DBIC
Specifies the image channels to be sampled for signature creation.
EASI>DBIC=i,j,...,p
- Ranges of channels can be specified with negative values. For
example: {1,-4,10} is internally expanded to {1,2,3,4,10}.
- Up to 16 channels can be specified.
- Duplicate channels are NOT allowed.
MASK
Specifies the window or bitmap used to define the area to sample,
within the input channels, in order to generate the signature.
EASI>MASK=i | use area under bitmap
EASI>MASK=i,j,k,l | use area under defined window
EASI>MASK= | use entire image
VALU
Specifies an integer value (1-254) that will be assigned during
image classification, to the pixels considered as belonging to this
spectral signature.
EASI>VALU=n
Values of 0 and 255 are invalid, as they will be assigned to the
null class and multiply class, respectively.
THRS
Specifies the Gaussian feature space threshold. The greater this
value, the larger the hyperellipsoid or parallelepiped in feature
space. THRS is a REAL number in units of standard deviation.
EASI>THRS=n
EASI>THRS= | defaults to 3.0
BIAS
Specifies the BIAS or relative `a priori' probabilities for the
signature. If all signatures have equal bias, their `a priori'
probabilities are equal. BIAS `biases' the maximum likelihood
classifier's assignment of a pixel when it lies within overlapping
portions of class signatures in feature space. BIAS is a REAL
number.
EASI>BIAS=nn.nn
EASI>BIAS= | defaults to 1.0
REPORT
Specifies the file to which the generated report is appended.
EASI>REPORT="filename"
EASI>REPORT= | defaults to terminal output
Note: The following names have special meaning:
EASI>REPORT="TERM" | generates reports on your terminal
EASI>REPORT="DISK" | generates reports on file "IMPRPT.LST"
EASI>REPORT="OFF" | switch off report generation
LASC
The segment number of the created signature is stored in this
parameter.
DETAILS
Spectral signature data is derived from image data on selected
database channels (DBIC) as sampled under a selected window or
bitmap (MASK). This data is principally used to define partitions in
the feature space of the image, which will subsequently be used to
classify the data. Signatures can also be used in the PCA program to
define the new measurement axes.
CSG creates a new signature segment, and stores the following:
- CORRELATION MATRIX for all selected channels;
- COVARIANCE MATRIX, DETERMINANT OF COVARIANCE MATRIX, INVERSE
COVARIANCE MATRIX, and TRIANGULAR INVERSE COVARIANCE MATRIX for
all selected channels;
- MEAN and STANDARD DEVIATION for each selected channel;
- Classification grey-level coding VALU for the classified output
Theme Map;
- Gaussian THRESHOLD value (in standard deviation units) for radius
of hyperellipsoid from class mean;
- LOWER and UPPER distances (in standard deviation units) of
parallelepiped boundaries from channel means;
- Relative 'a priori' probability weighting (BIAS).
The Gaussian threshold (THRS) and relative 'a priori' probability
(BIAS) of the signature segment may be specified by the user prior
to signature generation. THRS defaults to 3.0 standard deviation
units, and BIAS defaults to 1.0. Other values, such as channel
means, standard deviations, and lower and/or upper parallelepiped
limits may be modified using CSE (Note - initially, the lower and
upper parallelepiped limits (LOLIM and UPLIM) are defaulted to the
value of THRS).
The `segment-number' of this data segment will be stored in the
parameter LASC. Many programs that will use/modify this data will
expect the `segment-number' to be stored in the parameter DSIG. The
EASI command DSIG=LASC, executed IMMEDIATELY after the execution of
CSG, will move this value into DSIG.
Use CSR to generate a soft- or hard-copy signature report on the
report device of your choice.
Use MLC for multi-class classification of this signature in
conjunction with other class signatures.
Use PCA to optimize the enhancement for a particular image feature
or to transform the image data according to the statistics of a
different image dataset.
REPORT
CSG only reports alterations that it forces on the class CORRELATION
matrix to ensure that the COVARIANCE matrix is well conditioned. If
a report from CSG does NOT appear, even though you asked for one,
this means that no `conditioning' of the CORRELATION (and
concomitantly, the COVARIANCE) matrix has occurred.
CSG's matrix alteration report will appear on your terminal even if
you have turned reporting off.
To get a full printout of the statistical vectors and matrices to be
used in subsequent classification, see the CSR program
documentation.
MESSAGES and WARNINGS
The mathematics used in creating a signature segment is based on
statistics and involves matrix manipulations. A number of situations
can occur which will result in statistical or mathematical problems.
These include the following:
- variance within a channel is poor;
- variance between channels is poor;
- training area is too small to collect `good' statistics.
In general, CSG will attempt to reduce these problems by introducing
minor modifications of values. The user is informed of these actions
and re-conditionings with one of the following messages:
Error 81: Sample size too small.
For mathematical reasons, a training site must have at LEAST one
pixel set for each channel used in DBIC. For example, if DBIC
specifies 4 channels then the training bitmap specified by MASK must
have at least 4 bits (i.e., pixels) set.
Sample size = xxx (for n channels, yyy is recommended).
This is a warning only and can be ignored. For statistical reasons
(involving confidence in estimates), training sites should be at
least 5*(n*n+n) pixels large. Larger training sites usually yield
better statistics and thus better classification results.
Standard deviation xx was forced from xxxx to 0.0001.
This is a warning only and can be ignored. Usually, this message
indicates that all (or almost all) the pixels in a channel under the
training site were of the same grey level. To prevent mathematical
problems, a small variance (0.0001) was introduced.
Correlation matrix cell (n,m) was forced from xxx.xx to yyy.yy.
This is a warning only and can be ignored. Usually, this message
indicates that two or more channels in the training site were highly
correlated (very similar). A small change is introduced to prevent
mathematical problems.
EXAMPLE
Example 1: This example creates a spectral signature (based upon TM
bands 5, 6, 7, and 8) for the residential land use of the IRVINE.PIX
database. The contents of this signature can later be listed (using
CSR) or edited (using CSE) before it is used as input to the maximum
likelihood classifier (MLC).
EASI>FILE="irvine.pix" | database file
EASI>DBIC=5,6,7,8 | image channels used to define sig.
EASI>MASK=12 | bitmap of residential training areas
EASI>VALU=64 | theme value assigned during classification
EASI>THRS= | default threshold
EASI>BIAS= | default bias
EASI>RUN CSG
Example 2: This example generates a signature (based upon TM
channels 5, 6, 7, and 8) that will serve as the basis for generating
enhanced images (using principal component analysis - PCA). Once
generated, this signature can be used repeatedly in PCA to apply the
same transformation to datasets representing adjacent scenes (the
same channels) or the same scene from different dates. The purpose
of this is to maintain a consistent colour representation among
several images of a particular scene.
EASI>FILE="irvine.pix" | database file
EASI>DBIC=5,6,7,8 | image channels used to define sig.
EASI>MASK=12 | bitmap of residential training areas
EASI>VALU=64 | value irrelevant in this context
EASI>THRS= | value irrelevant in this context
EASI>BIAS= | value irrelevant in this context
EASI>RUN CSG
About PCI Help Gateway