Simulation Matching and Model Estimation Using Statistical Learning Techniques - PowerPoint PPT Presentation

1 / 60
About This Presentation
Title:

Simulation Matching and Model Estimation Using Statistical Learning Techniques

Description:

Simulation Matching and Model Estimation Using Statistical Learning Techniques ... or Quantile plot whether the. errors are not falling in any pattern. Yes. Exit ... – PowerPoint PPT presentation

Number of Views:145
Avg rating:3.0/5.0
Slides: 61
Provided by: nicholas66
Category:

less

Transcript and Presenter's Notes

Title: Simulation Matching and Model Estimation Using Statistical Learning Techniques


1
Simulation Matching and Model Estimation Using
Statistical Learning Techniques
Principal investigator Prof. Nicholas
Zabaras Other participants Jingbo Wang and V.A.
Badri Narayanan Materials Process Design and
Control Laboratory Sibley School of Mechanical
and Aerospace Engineering188 Frank H. T. Rhodes
Hall Cornell University Ithaca, NY
14853-3801 Email zabaras_at_cornell.edu Phone
(607) 255-9104 URL http//www.mae.cornell.edu/zab
aras/
Materials Process Design and Control Laboratory
2
What is the problem?
  • For a given complex system, we have
  • A collection of data from experiments
    (tests) conducted in the past
  • Can this information help us to
  • Estimate the system parameters with specified
    confidence intervals
  • Predict the outcome of the experiment (test)
    given a new input
  • Identify adjustment to the system parameters to
    meet customer performance requirements
  • or even further
  • Identify modifications in the experimentation
    simulation methods to improve matching their
    results

The answer is yes and it lies in statistical
machine learning methodologies
Materials Process Design and Control Laboratory
3
Proposed approach --- Statistical Machine Learning
Input data
System model (parameters)
Output prediction
Data from previous experiments
  • In statistical machine learning
  • Models (Learners) built using previous data
    (Training sets) enable us to estimate the system
    parameters and predict outcome of new unseen
    inputs
  • Obtaining this information can help us in
    applications of design and control
  • These methods have strong roots in statistics
    and employ highly effective database techniques

Materials Process Design and Control Laboratory
4
Statistical learning An introduction
  • Learning from data

Inputs
Quantitative
Training set
Data (sample set)
Outputs
Qualitative
Testing set
  • Model Learner Predictive model

Supervised learning (Presence of output in model)
Regression (Quantitative output)
  • Learning Paradigms

Classification (Qualitative output)
Unsupervised learning (No output in model)
Materials Process Design and Control Laboratory
5
Statistical decision framework
  • Loss function --- error between prediction and
    measurement

X ,q
))
(
,
(

f
Y
L
where,

ÃŽ
vector (measurement)
Random output

Rk
Y
X ,q
Functional relation between X and Y
)
(
f
  • In a general framework, we try to minimize the
    expected loss from prediction
  • Depending on the loss function used and the type
    of model (with known or unknown functional
    relations), we have different learning methods

Materials Process Design and Control Laboratory
6
Popular statistical learning methods
Nature of output data
Categorical (e.g. On, Off)
Quantitative
  • Pattern recognition and associative methods
  • Nearest neighbor methods
  • Functional approximation techniques
  • Maximum a posteriori probability method
  • Neural networks
  • Regression analysis
  • Partial/Ridge/Lasso/ PCA and other subset
    techniques
  • Bayesian estimates
  • Discriminant analysis
  • Regression on indicator sets
  • ANOVA
  • Maximum likelihood estimates

Regression
Common methods
Classification
These methods are few of the most popular
statistical learning methods available
Materials Process Design and Control Laboratory
7
Functional input-output relation not known
  • This is a pure data mining problem. In this
    case,
  • the solution sequence is as follows
  • - study the distribution of data helps to
    visualize the spread of data, detect the presence
    of outliers, etc.
  • - choose the basis functions to fit the data in
    case the input/output relationship is perceived
    to be nonlinear
  • - use LS or other optimization methods to
    evaluate the weight associated with each basis
    function
  • - Build the model this model is called as
    learner and is used for output prediction and
    parameter estimation

Materials Process Design and Control Laboratory
8
Functional input-output relation not known
  • A predictive model can be designed using linear
    regression

Let the inputs be (X1,X2) and the output Y.
First assume that the output is a function of
(X1, X2, X1X2, ). Then a subset
selection may indicate that X1 X1X2 best
explain the data
2
2
X
X
,
1
2
No
Yes
Exit
Materials Process Design and Control Laboratory
9
Known functional input-output relation
  • With known functional relationship between input
    and output, the problem is reduced to that of
    estimation of the function (system) parameters
    that best explain the data
  • Here it is assumed that once the optimal values
    of system parameters are known, the output can be
    estimated from the input using the given
    functional relationship
  • System parameter estimation is performed in
    steps
  • Input data reduction and smoothing for large
    data set (Kernel
  • methods, spline bases, etc.)
  • Selection of type of approximation (Linear or
    nonlinear)
  • Choice of model (Robust regression, Maximum
    likelihood, Bayesian
  • and other PDF based methods, Neural
    networks, etc.)

Materials Process Design and Control Laboratory
10
Formulation of the problem by statistical learning
System parameters (?1,?2,..., ?s)
Input data (X1,X2,...,Xp)
Predicted output (Y1,Y2,...,Yk)
  • For a complex system as shown above
  • Xi, ?i, Yi --- all random variables
  • The functional form of input output relation is
    known
  • A sample data set is available
  • and the following information
  • Known statistical nature of input and output
  • Known uncertainty in the system parameters
  • can we
  • Estimate optimal values for the system parameters
    that can allow accurate prediction of future test
    cases?
  • Provide confidence intervals for the optimal
    system parameters?
  • Predict future outcomes of the model with desired
    accuracy?

Materials Process Design and Control Laboratory
11
An example
Simple beam simulation matching problem
Materials Process Design and Control Laboratory
12
Example problem (schematic)
Consider the beam shown
L
P
a
x
h
E Elastic Modulus
y
b
Well known equations for the maximum deflection
and maximum stress in terms of the system
parameters
2


6
P
a



-

2
P
a
a
3
L
(
)

s
max

y
max
2
3

b
h


E
b
h
Materials Process Design and Control Laboratory
13
Schematic of the testing cycle
Uncertainty added due to manufacturing variations
Modified beam parameter values are uncertain
Compliance design
Prototype
To consider any design modifications for better
performance in customer operating conditions
Manufacturing process
Compliance testing
Pre-compliance design beam
Actual production beam parameters may vary from
those of compliance test
Currently produced beams
  • Testing facility has to be recorrelated for
    every new beam

Test conditions variations, measurement errors
and other undefined uncertainties
Materials Process Design and Control Laboratory
14
Problem statement
  • Simulation of the beam cycle matching problem
  • A design specification (L, b, h, E) for the beam
    is introduced. These pre-compliance parameters
    serve as basis for production of the prototype
    beam.
  • Since the operating conditions used for testing
    may not be the same as the customer operating
    conditions, the inputs are adjusted to give the
    same maximum operating stress.
  • Prototype of the beam is produced based on
    pre-compliance specifications and compliance
    tests are carried out to consider any required
    design modifications to the beam that can improve
    its performance under customer operating
    conditions.
  • Depending on the compliance tests the beam
    parameter values are updated. These values are
    used in subsequent production of beams.
  • Beams are now produced. Production acceptance
    tests are carried out on each of the manufactured
    beams. These tests are carried out in the
    presence of the customer under test operating
    conditions that best represent the customer
    operating conditions.

Materials Process Design and Control Laboratory
15
Problem statement
  • Uncertainties are introduced in the above
    framework due to following reasons
  • Changes in manufacturing characteristics (L, b,
    h, E) after compliance test
  • Manufacturing variations Variations introduced
    during production due to tooling errors,
    machining allowances, etc.
  • Test variations Variations in test site
    surrounding conditions for e.g. temperature,
    humidity, presence of any new structure, error in
    measurement of test quantities, etc.
  • Other measurement errors Errors introduced in
    final reported values of beam parameters due to
    measurement uncertainty
  • The beams in production can show different
    results if subjected to compliance tests

Materials Process Design and Control Laboratory
16
Objectives and questions to address
Data collected from previous production
acceptance tests and compliance tests
Can we estimate the beam parameters along with
their intervals of confidence?
Can we update the model as and when new test data
arrive instead of building the model over again?
Materials Process Design and Control Laboratory
17
Problem statement ...
  • Information provided
  • Pre-compliance design values for the beam
    parameters
  • A model for the beam that provides functional
    input-output relationship. Inputs here are the
    load (P) and the point of application (a). The
    measured outputs are the maximum stress (smax)
    and the maximum deflection (Ymax)
  • A brief description of compliance and acceptance
    test conditions
  • Estimates of the manufacturing and measurement
    variations (actual values are not known). These
    estimates would be of use in probabilistic
    methods like Bayesian methods
  • Data collected based on 20 acceptance tests and 2
    compliance tests. This data comprises of the test
    inputs and the measured outputs

Materials Process Design and Control Laboratory
18
Objectives
  • Based on the provided information
  • Obtain optimal estimates for the beam parameters
    (L, b, h, E) such that future use cases can be
    predicted with best possible accuracy
  • Compute the intervals of confidence for these
    estimates
  • Predict the test outcomes for new unseen inputs,
    perform accuracy studies for the above obtained
    beam parameter values test error,
    cross-validation, etc.
  • Predict and/or explain what would be the outcome
    if compliance tests were performed on the beams
    in production.
  • Investigate the effect of manufacturing and
    measurement variations on the estimated
    parameters This can be viewed as a sensitivity
    problem

Several methods can be used to achieve these
objectives
Materials Process Design and Control Laboratory
19
Three individual methods were examined
Regression approach
  • Mean squared loss
  • Weighted mean square loss
  • Linear robust loss

Materials Process Design and Control Laboratory
20
Software environment (S-Plus)
A typical S-Plus working environment
objects
analysis plots
data
Materials Process Design and Control Laboratory
21
General regression models
  • In general regression theory, the output Y is
    modeled as a function of the input (X) and the
    system parameters (?) as follows
  • A loss function that is a function of the output
    (Y), input (X) and the system parameters (?) is
    associated with the regression model. This
    regression problem is in general a hard problem
  • In special cases where the output may be written
    as follows

(?1,...,?s),

(X1,...,XP)
f
Y
ß
m
m
we can use highly efficient variants of linear
regression (squared loss, weighted square loss
and robust regression) to find the optimal
regression parameters ßs and then use them for
prediction.
Materials Process Design and Control Laboratory
22
Forms of loss functions
  • The type of regression methods vary with the
    form of loss functions used. Three of the most
    popular methods use the following loss functions

2
?
-

))
,
(
(
X
f
Y
E
loss
Squared
?
-
)
,
(
X
f
Y
r
)
(

E
loss
Robust
ˆ
s
?
-
S
?
-
-
1
T
)
,
(
(
))
,
(
(
X
f
Y
X
f
Y
E
loss
Weighted
f(X,?) is the known functional relation, S is the
covariance matrix of input random vector, ? is a
bounded loss function and s is a robust scale
parameter
  • For the beam problem, we can express ymax and
    smax as follows

2
3


b
b
Pa
Pa
y
2
1
max

b
Pa
s
3
max
-
6
2
6L



b
b
b
,
,
3
2
1
2
3
3
bh
Ebh
Ebh
This type of input/output functional
relationships enable usage of linear regression
techniques for prediction purposes
Materials Process Design and Control Laboratory
23
System parameter estimation
  • For sufficiently smooth loss functions, we can
    obtain the system parameters (?) by minimizing
    directly L with respect to the system parameters
    (using e.g. gradient optimization techniques).
    This is the case with squared loss and weighted
    squared loss regression.
  • For a loss function with discontinuous
    gradients, we have to use the regression
    coefficients b to obtain estimates of the system
    parameters. This is the case for robust
    regression.

Materials Process Design and Control Laboratory
24
A gradient-based optimization approach
Direct approach (squared and weighted loss only)
  • We minimize the loss function L directly with
    respect to the system parameters ? (L, b, h,
    E), thus obtaining optimal estimates. System
    parameter evaluation here does not require
    knowledge of the regression parameters b.
  • The loss function is expanded in a Taylor series
    about each design point as follows
  • For positive definite Hessian of L, we obtain a
    unique step size Dqk

Materials Process Design and Control Laboratory
25
A gradient based optimization approach ...
  • Mean squared loss function

loss
Squared
-
-
)2
ˆ
(
Deflection
)2,
ˆ
(
Stress
ymax
ymax
E
smax
smax
E
ˆ
smax and ymax are the predictions of the
maximum stress and deflection Correlation
between outputs is ignored leading to different
parameter estimates for stress and deflection
  • The two outputs can be correlated by using the
    weighted loss function with Mahalonobis distance

ˆ
ˆ
-
S
-
-
T
1
Y
Y
Y
E
)
(
(
loss
Weighted
Y
)
ˆ


ymax
Y

and

where
T
Y
T
,
smax
,
ˆ
smax
ˆ
ymax
Materials Process Design and Control Laboratory
26
Estimation of parameters for robust regression
  • The loss function of robust regression is given
    as
  • ?(.) is symmetric bounded loss function and
  • s is a robust scale estimate for residuals

Note S-plus allows for various functional forms
of ?(.) and s.
  • For linear robust regression f(X,?) ßTX and the
    robust loss function is minimized with respect to
    the regression parameters b.
  • The best estimate of ß is based on nearness to
    an appropriate linear regression estimate ßo
    which is defined by
  • This is called a robust MM-estimator

Ref Yohai, V. J. and Zamar, R. H. (1998),
Optimal Locally Robust M-estimates of
Regression. Journal of Statistical Planning and
Inference, 64, 309-323.
Materials Process Design and Control Laboratory
As post-processing, after obtaining the
regression parameters, the system parameter
?L,b, h, E can be computed by solving the
following optimization problem
Above robust loss function has discontinuous
gradients, we use an indirect approach
27
Estimation of parameters for robust regression ...
  • Since the robust loss function has discontinuous
    gradients, we use an indirect approach
  • The regression parameters b are estimated using
    inbuilt programs in S-Plus. These parameters can
    be used for prediction.
  • As post-processing and after obtaining b, the
    system parameters ?L, b, h, E can be computed
    by solving the following optimization problem
  • This kind of prediction of system parameters
    does not always give good results due to biasing
    of regression coefficients b and the nature of
    the spread of data (which was the case for the
    beam problem)
  • Prediction using robust regression is not
    affected by system parameter evaluation!

Materials Process Design and Control Laboratory
28
Advantages and disadvantages of robust regression
  • Advantages
  • Low error rates
  • No assumptions are made about the statistical
    nature of error
  • Prediction does not require information about the
    system parameters (less function evaluations)
  • Outliers (e.g. erroneous or redundant data) can
    be identified and avoided for predictions
  • What are Outliers, What problems do they cause ?
  • Outliers are data from unusual events, for
    example measurement of deformation with an
    improperly mounted strain-gauge

Outliers do not follow the general trend in data
Outliers
Materials Process Design and Control Laboratory
29
Outliers ...
  • Outliers are data that are too noisy due to bad
    data handling
  • Outliers are redundant data i.e. data that
    convey nearly the same information
  • Problems ...
  • Parameter estimates depend on outlier data
  • Probability density function of outliers are
    nearly the same as those of correct data near the
    center but tail probabilities differ considerably
  • Increase Bias of the estimates or predictions
  • Lead to inflated residual sum of squares

Inflated tail probabilities indicating presence
of outliers
PDF plot for smax
Disadvantage Robust regression may lead to
erroneous estimates for beam parameters if the
amount of data is small, however it can be used
to provide reliable predictions.
Materials Process Design and Control Laboratory
30
Results of the three regression methods
  • The optimal system parameters and their
    Cross-Validation (CV) errors in prediction for
    the training set (provided by GE-AE) are
    tabulated below.
  • Initial guess used was L, b, h, E 22.15in,
    2.08in, 1.092in, 0.995e07 -- these are the
    sample means for the production beams. The
    algorithm however showed high sensitivity to
    initial guess, thus sample mean was chosen as the
    best guess.
  • The compliance test results were incorporated
    into the regression process with an importance
    factor Wimp between 0,1 (results shown here
    for Wimp0.1).

A question --- What is Cross-Validation?
Materials Process Design and Control Laboratory
31
Cross-Validation (CV)
  • In general

Training set ( build model)
Sample data set
Testing set ( check performance of model)
  • In the beam simulation problem, only training
    data is available (small sample data set). Then
    CV is used to test the general prediction error
    of the model.
  • Ideas of Cross-Validation
  • Total K data pairs (Xi, Yi), i 1, 2, , K
  • Let (Xi, Yi) be the ith record

y
x
Materials Process Design and Control Laboratory
32
Cross-Validation (CV)
  • In general

Training set ( build model)
sample data set
Testing set ( check performance of model)
  • In the beam simulation problem, only training
    data is available (small sample data set). Then
    CV is used to test the general prediction error
    of the model
  • Ideas of Cross-Validation
  • Total K data pairs (Xi, Yi), i 1, 2, , K
  • Let (Xi, Yi) be the ith record
  • Temporarily remove (Xi, Yi) from the dataset

y
Materials Process Design and Control Laboratory
33
Cross-Validation (CV)
  • In general

Training set ( build model)
sample data set
Testing set ( check performance of model)
  • In the beam simulation problem, only training
    data is available (small sample data set). Then
    CV is used to test the general prediction error
    of the model
  • Ideas of Cross-Validation
  • Total K data pairs (Xi, Yi), i 1, 2, , K
  • Let (Xi, Yi) be the ith record
  • Temporarily remove (Xi, Yi) from the dataset
  • Train on the remaining K-1 data points

y
x
Materials Process Design and Control Laboratory
34
Cross-Validation (CV)
  • In general

Training set ( build model)
sample data set
Testing set ( check performance of model)
  • In the beam simulation problem, only training
    data is available (small sample data set). Then
    CV is used to test the general prediction error
    of the model
  • Ideas of Cross-Validation
  • Total K data pairs (Xi, Yi), i 1, 2, , K
  • Let (Xi, Yi) be the ith record
  • Temporarily remove (Xi, Yi) from the dataset
  • Train on the remaining K-1 data points
  • Note your prediction error at (Xi, Yi)
  • When youve done all points, report the
  • mean error.

In the beam problem, we reported the relative CV
errors on the training data set
Materials Process Design and Control Laboratory
35
Measured maximum stress v.s. the prediction by
linear robust regression
4600
Estimate smax
4500
4400
4300
4200
4300
4400
4500
4600
4700
4800
smax
Materials Process Design and Control Laboratory
36
Measured maximum deflection v.s. the prediction
by linear robust regression
-0.038
-0.039
Estimate Ymax
-0.040
-0.041
-0.050
-0.045
-0.040
-0.035
-0.030
Ymax
Materials Process Design and Control Laboratory
37
Discussion of results from regression methods
  • The linear robust regression has the minimum CV
    error but estimates of parameters are highly
    biased. It requires more combined parameters ß in
    order to get better estimates
  • Square loss and weighted square loss gave almost
    the same estimates and CV errors (i.e. the two
    output quantities are highly independent on each
    other, which is also verified by the figure in
    the next page)
  • Compliance test data were
  • incorporated in the training set
  • with different Wimps. CV error
  • goes up with increasing Wimp
  • Linear robust regression is the
  • best in prediction.

Materials Process Design and Control Laboratory
38
Plot of dependencies in data
4.57
4.58
4.59
4.60
4.61
4.62
4100
4300
4500
4700
4900
22.00
22.05
22.10
22.15
22.20
22.25
1.000
1.025
1.050
1.075
1.100
1.125
20
15
10
description
5
0
22.25
22.20
L
22.15
22.10
22.05
2.10
22.00
2.08
2.06
b
2.04
2.02
2.00
1.125
1.100
h
1.075
1.050
1.025
420
1.000
410
P
400
390
380
4.62
4.61
a
4.60
4.59
4.58
4.57
-0.031
-0.036
ymax
-0.041
-0.046
-0.051
4900
4700
smax
4500
4300
4100
380
390
400
410
420
-0.051
-0.046
-0.041
-0.036
-0.031
0
5
10
15
20
2.00
2.02
2.04
2.06
2.08
2.10
The measured data does not seem to indicate any
possible dependencies
Materials Process Design and Control Laboratory
39
Bayesian approach
Maximum A Posteriori Probability Method
Materials Process Design and Control Laboratory
40
An introduction to a Bayesian approach (MAP)
  • Whats new?
  • - Know the uncertainty information (pdf) of
    parameters to be estimated
  • - Start from a prior (known) pdf of parameters
    ( ) , derive a posterior
  • pdf of parameters ( )
    conditioned on the given experiment data
  • - Maximize the posterior pdf with respect to the
    parameters, i.e. find the
  • parameters having the highest probability
    given the experimental data
  • This is the so called Maximum A
    Posteriori Probability (MAP) Method

Bayesian approach - Makes use of Bayes
theorem
Materials Process Design and Control Laboratory
41
MAP Estimator of the example beam problem
Methodology Maximize
with respect to ? and XK, i.e. what are the most
possible ? and XK given YK
where, ? L, b, h, E ----- the system
parameter vector XK P(1),
a(1), P(2), a(2), , P(20),a(20) ----- input
vector YK ymax(1), smax(1),
ymax(2), smax(2), , ymax(20), smax(20) -----
output vector Loss function L -log(
P(?,XK YK) ) Minimize L with respect to ? and
XK. The denominator of P(?,XK YK) is taken as
constant in the optimization process. How to
obtain P(?), P(XK) and P( YK ?, XK )?
q
,
(
P
)

YK
XK
)
(
)
(
)
,

(
)
,
(
)
,

(
XK
P
P
XK
YK
P
XK
P
XK
YK
P
q
q
q
q


)

,
(
YK
XK
P
q
)
(
)
(
YK
P
YK
P
Materials Process Design and Control Laboratory
42
MAP Estimator of the example beam problem
Assumptions All manufacturing and testing
variations are independent Gaussian random noises
with zero mean. ?
X Functional relations (for each
single test) where, W
and We can obtain Y?,X
)
,
(
P
N
q
q
6
X
X
-
)
3
(
2
?
X
X
X

,
1
2
f
1

1
1
2
f
2
2
?
?
1
3
?
?
?
3
2
3
2
4
ù
é
f
1
)
,
(
P
N
ú
ê
W
f
û
ë
2
Materials Process Design and Control Laboratory
43
MAP Estimator of the example beam problem
Formulation of loss function where C is a
constant representing the denominator term of
P(?, XKYK ) Optimization of the loss function
(Gauss-Newton Method)
1
1

Materials Process Design and Control Laboratory
44
Evaluation of MAP estimator
  • Biased and unbiased estimator
  • - Let be the estimate of a unknown parameter,
    if
  • E ( ) ?true
  • Then, the estimator is unbiased. Otherwise, it
    is biased.
  • - Unbiased estimator is desired because it
    reflects the true system parameter
  • - A practical approach to check unbiasedness
    Monte-Carlo simulation
  • Standard deviation of the estimates should be
    close to the
  • true std values if known (beam problem)

Materials Process Design and Control Laboratory
45
Evaluation of MAP estimator by Monte-Carlo
Simulation
This function uses the arbitrary picked ?true to
generate outputs at a set of input points by
using the truth model of the system, namely the
f1 and f2 functions. Then it adds randomly
obtained noise to these outputs to generate
simulated measurements for the MAP estimator
Set iteration number N of M-C simulation and
?true
Truth model function of the beam
It returns an estimated parameter vector . If
the estimator is unbiased, expectation of
should be ?true
This averaged error should converge to zero for
large N
MAP estimator
Compute the error of estimation
Average the estimation error over N
Yes
No
Nth iteration?
A 1000 run of M-C simulation showed E( ) is a
good approximation of ?true
Materials Process Design and Control Laboratory
46
Results of MAP
  • Data used in the MAP estimator
  • The manufacturing variations were used in P?
  • Measurement variations were used in PX and PW
  • Mean of input vector took the value of the
    measurements
  • All 20 acceptance testing data were used as
    training data set
  • Two different means were used for ? the sample
    mean of 20 products and the pre-compliance
    specifications
  • Computed results

Materials Process Design and Control Laboratory
47
Results of MAP
These errors were obtained by comparing prediction
s at compliance test condition with previous
compliance test results. It will be much more
valuable to compare with the compliance
test results done to production beams since the
design specifications have been changed
Materials Process Design and Control Laboratory
48
Discussion of results from MAP
  • The standard deviations (confidence intervals) of
    parameter estimates computed by using product
    sample mean are much less than that by using
    pre-compliance specifications. The standard
    deviations of the former are very close to the
    true values.
  • By using product sample mean, we also have less
    CV errors.
  • Since the MAP estimator is unbiased and the
    estimates computed by using product sample mean
    have very accurate standard deviations, it is
    able to say it provides a precise estimation of
    the parameters.
  • The MAP estimator is sensitive to the assumptions
    of manufacturing and measurement variations.
  • Since the design parameters have been changed
    after compliance test, it makes no sense to use
    compliance test data in estimation. To improve
    the accuracy of estimation and prediction at
    compliance test condition, the modified design
    parameters (after compliance test) should be
    given as the mean of prior pdf of ?.

Materials Process Design and Control Laboratory
49
An extension of the MAP method
  • A question ----- the manufacturing variation may
    not be
  • available in a real situation
  • Possible solution (pdf filtering)
  • - Assume arbitrary variations for the parameters
    (but still Gaussian)
  • - Use the product data (sample data) to compute
    a sample mean and sample variation
  • - Start the estimation by using computed sample
    pdf
  • - Update the prior pdf mean with new estimates
    and compute the new sample variations
  • - Repeat the process until it converges
  • With large set of data from product acceptance
    tests distributed over the
  • full range of true pdf, the above algorithm is
    able to compute the best
  • parameter without knowing the manufacturing
    variation

Materials Process Design and Control Laboratory
50
Illustration of Pdf filtering process
Probability
Value of ?
Materials Process Design and Control Laboratory
51
Comparison of the methods
  • In general, the regression methods provide lower
    cross-validation errors
  • Linear robust regression has lowest CV error in
    prediction of maximum
  • deflection but the estimates are biased due to
    the nature of this method
  • The Bayesian method has higher CV errors but it
    provides better estimates
  • of the parameters (unbiased and small standard
    deviation)

Materials Process Design and Control Laboratory
52
Summary
  • One can use a combination of linear robust
    regression and Bayesian
  • method to perform the beam simulation and
    matching problem. Linear
  • robust regression is used to predict the maximum
    deflection and stress at
  • given new testing points and Bayesian method is
    used to estimate the
  • system parameters and their confidence intervals.
  • It will be helpful if
  • Modified design parameters are known after
    compliance test
  • The acceptance tests are performed at various
    points (different P and a)

Materials Process Design and Control Laboratory
53
Discussion of simulation matching process
  • Key issues in simulation matching problem

Dominant output of system at compliance test
condition (smax in beam problem)
Nominal value of system output (requested by user)
Materials Process Design and Control Laboratory
54
Discussion of simulation matching process
Key issues in simulation matching problem
Dominant output of system at compliance test
condition (smax in beam problem)
Nominal value of system output (requested by user)
match
match
Prediction of system output at compliance test
(based upon best estimation of system
parameters)
Materials Process Design and Control Laboratory
55
Discussion of simulation matching process
Key issues in simulation matching problem
Dominant output of system at compliance test
condition (smax in beam problem)
Nominal value of system output (requested by user)
First stage
match
match
Prediction of system output at compliance test
(based upon best estimation of system
parameters)
- Compliance test is done for
prototypes of product - Modify the design
system parameter to meet nominal output -
Manufacturing products using modified
parameter - Repeat until nominal
value obtained
Materials Process Design and Control Laboratory
56
Discussion of simulation matching process
Key issues in simulation matching problem
Dominant output of system at compliance test
condition (smax in beam problem)
Nominal value of system output (requested by user)
First stage
Second stage
match
match
Prediction of system output at compliance test
(based upon best estimation of system
parameters)
- Compliance test is done for
prototypes of product - Modify the design
system parameter to meet nominal output -
Manufacturing products using modified
parameter - Repeat until nominal
value obtained
  • - Estimation is based
  • on data from product acceptance test
  • More data, better accuracy
  • The estimation process should be recursive (for
    each new data set,
  • only need to update the
  • previous estimate)

Materials Process Design and Control Laboratory
57
Discussion of simulation matching process
Key issues in simulation matching problem
Dominant output of system at compliance test
condition (smax in beam problem)
Nominal value of system output (requested by user)
First stage
Second stage
match
match
Prediction of system output at compliance test
(based upon best estimation of system
parameters)
- Compliance test is done for
prototypes of product - Modify the design
system parameter to meet nominal output -
Manufacturing products using modified
parameter - Repeat until nominal
value obtained
  • - Estimation is based
  • on data from product acceptance test
  • More data, better accuracy
  • The estimation process should be recursive (for
    each new data set,
  • only need to update the
  • previous estimate)

Materials Process Design and Control Laboratory
58
Need and objective of cycle matching
Objective To make accurate predictions on new
data, thereby avoiding expensive testing
Very expensive testing
Back to back testing
Nozzle testing
Bird hit testing
. . .
Information about all previously conducted tests
and simulations
Stress testing
Updating machine learning method
Statistical model learnt from a huge collection
of previous experiments
testing/simulation data for a new engine /engine
for maintenance
Update system parameters based on new test and
simulation results
Fluid flow interactions
Thermodynamics
. . .
Combustion
Highly complicated simulations
Materials Process Design and Control Laboratory
59
Recursive machine learning system for cycle
matching
A fully converged learnt system
Updated values for system parameters
Info
Full fledged system model based on data
Materials Process Design and Control Laboratory
60
Acknowledgements
  • This preliminary work on statistical machine
    learning
  • techniques and the beam cycle problem to test
    simulation matching techniques were supported by
    GE-GRC, Advanced Mechanical Technologies Program,
    Dr. R. Irani (program manager).
  • Special thanks to Dr. R. Sampath (GE-GRC) for
    several clarifications on the problems examined.

Materials Process Design and Control Laboratory
Write a Comment
User Comments (0)
About PowerShow.com