Directed Exploration of Complex Systems - PowerPoint PPT Presentation

1 / 23
About This Presentation
Title:

Directed Exploration of Complex Systems

Description:

Numerical simulations widely used within NASA and other agencies to investigate ... Serendipitous discovery of Ida-Dactyl pair by Galileo Spacecraft. ... – PowerPoint PPT presentation

Number of Views:31
Avg rating:3.0/5.0
Slides: 24
Provided by: nta3
Category:

less

Transcript and Presenter's Notes

Title: Directed Exploration of Complex Systems


1
Directed Exploration of Complex Systems
  • AISR PI Meeting
  • 2008/05/06

Michael C. Burl, Brian Enke, William J.
Merline
Jet Propulsion Laboratory
Southwest Research Institute
2
Motivation
  • Numerical simulations widely used within NASA and
    other agencies to investigate complex phenomena
    that could not be studied otherwise.
  • Long run times limit amount of knowledge that can
    be extracted.
  • Some parallels with autonomous exploration agent
    that is able to perform experiments in the
    world.
  • Serendipitous discovery of Ida-Dactyl pair by
    Galileo Spacecraft.

3
Simulations to Study Formation
4
Courtesy D. Durda, SwRI
5
Dynamical Systems Viewpoint
M. C. Burl V-0615-b
  • Simulator as discrete-time dynamical system
  • Restrict attention to non-chaotic simulations
    with known initial state x0, no control input (uk
    0) and no process or measurement noise (wk 0,
    vk 0).
  • State at time T is completely determined by q
    q, x0

INPUT SPACE Q set of possible parameter vectors
6
Landscape Characterization
M. C. Burl V-0616
Simulator xT, zT
1
q
q(q)
-1
Grading Script
Which input values q result in q(q) 1?
q1
Inverse Mapping
q2
Solution Region
1
q3
7
Example Grading Scripts
  • Do satellites exist in bound orbits about the
    Largest Remnant (LR) at T4 days (SMATs)?
  • Did the collision produce any EEBs?
  • Was the collision catastrophic? (threshold on
    mass of LR)
  • Is the shape of the LR elongated?
  • Does the resulting size-frequency distribution
    match that of a particular (observed) asteroid
    family with sufficient fidelity?
  • Does the collision produce a binary pair that is
    likely to be observable (e.g., to a ground
    telescope)?

Within the scope of one simulator, we can address
many different science questions by swapping in
different grading scripts.
8
Modification of Concept Learning
M. C. Burl V-0617
  • Q is typically continuous-valued
  • cannot just enumerate all points in solution
    region
  • Really want a function

-1, 1
q Q
such that q agrees with q over most of domain
  • Like concept learning, but some key differences

CL
Simulators
  • Fixed set of training
  • examples drawn iid
  • Simulator serves as an oracle
  • Choice of examples conditional
  • on previous choices
  • Very expensive to get labels
  • Potential for introspection

9
Represent q with SVM
M. C. Burl V-0618
  • q(q) S bi K(q, si)

i
i-th support vector
scalar weight
kernel function
  • Learned SVM depends on hyperparameters (l)

Kernel type, kernel-specific variables, C
(n)
ql (q) SVM learned after n-th iteration
using hyperparameters l.
10
Active Learning
M. C. Burl V-0619
  • Determine which new point(s) in input space, if
    labeled, would be most informative for refining
    the boundaries of the solution region
  • Imagine discretized input space
  • L(n) set of labeled instances at end of trial
    n (red or green)
  • U(n) set of unlabeled instances at end of
    trial n (black)
  • Choose q(n1) from U(n) such that the SVM learned
    from
  • L(n1) L(n) q(n1)
  • will bring q closer to q.

11
Active Learning (cont)
M. C. Burl V-0620
  • Simple TongKoller, 01 choose unlabeled point
    closest to current SVM decision boundary
  • q(n1) arg min ql(n) (q)
  • q in U(n)
  • Other methods (MaxMin Margin, Diverse) have not
    worked particularly well.
  • Probabilistic Simple choose higher-ranked
    queries with higher probability (dont always
    pick best)
  • Trade-off between exploration and exploitation.
  • Information-theoretic selection (recent)

12
Exploration vs Exploitation
M. C. Burl V-0626
13
Simulator Details
M. C. Burl V-0630
  • Asteroid Collisions
  • SPH and N-body code
  • q(q) formation of gravitationally bound system
  • Q impact velocity, angle, mass ratio
  • 5 X 5 X 6 grid (150pts)
  • 25 positives on grid
  • 10 days on 1GHz CPU!
  • Magnetosphere
  • 10-parameter Roelof Model to determine plasma
    distribution and predict ENA image
  • q(q) smoothness and match to spacecraft-observed
    ENA image.
  • Q 3 key Roelof params
  • 9 X 9 X 9 grid (729pts)
  • 4 hours on 1GHz CPU

14
Evaluation Methodology
M. C. Burl V-0621
  • Underlying q() function is not known
  • Evaluate q() on as fine a grid as practical
  • Consider problem solved if q_hat() and q() agree
    on sufficient fraction of grid points
  • Look at learning curves (agreement vs n)
  • Subgrid accuracy?

15
Decision Surface
M. C. Burl V-0627
16
Magnetospheric Inversion
M. C. Burl V-0623
17
Reduction in Simulation Trials (catastrophic
collision subproblem)
Percentages of total number of grid runs required
to achieve a desired level of agreement between
the grid and active learning. Low percentages
are good (for example, 20 a 5x speed-up by
using active learning).
18
Leveraging the Time-Savings
  • Same final result can be obtained with fewer
    simulation trials.
  • More simulation trials can be conducted in a
    given amount of time.
  • Higher-fidelity (e.g., finer spatio-temporal
    resolution) simulation trials can be used.
  • More trials can be concentrated at the region
    between interesting and non-interesting regions.

19
Hyperparameters Accuracy
M. C. Burl V-0629
  • Choose l l0 upfront
  • Consider finite pool L
  • Try to get accuracy estimate
  • Use best or combine wishes of multiple SVMs
  • Instantaneous x-val
  • Bayesian approach
  • Maintain posterior distribution on accuracy
  • Choose next point based on weighted utility
  • Provides smoothing of accuracy estimates

20
Hyperparameter Experiments
L. Scharenbroich, D. Mazzoni
21
Battleship Game
M. C. Burl V-0631
  • Spatial coherence/correlation
  • Can the simulator be compressed?
  • Kolmogorov complexity
  • Can an SVM efficiently represent q(q)?

22
Upcoming Work
  • Maturing
  • Software and deployment issues
  • Optimization of resources
  • Explicit balancing of exploration vs exploitation
  • Completion
  • Broadening
  • New simulations
  • High-dimensional input spaces
  • Information-theoretic active learning
  • Gaussian Processes to describe spatial
    correlation
  • Markov Chain Monte Carlo Sampling
  • Continuous-valued landscapes

23
Acknowledgments
  • Dennis DeCoste, Dominic Mazzoni, Lucas
    Scharenbroich, Kiri Wagstaff, Alex Holub, Pietro
    Perona
  • Dan Durda, Bill Bottke, Jorg Micha-Jahn
  • Erik Asphaug, Derek Richardson, Zoe Leinhardt
  • NASA Intelligent Systems Program (early
    prototype)
  • NASA Applied Information Systems Program

Publications
  • SIAM Data Mining
  • Icarus
  • European Conf. on Machine Learning (ECML)
  • CVPR Workshop on Online Learning
  • NASA Tech Briefs
  • JPL New Technology Report
Write a Comment
User Comments (0)
About PowerShow.com