Adaptive Agents - PowerPoint PPT Presentation

1 / 27
About This Presentation
Title:

Adaptive Agents

Description:

Adaptive Agents. Diana Gordon. Navy Center for Applied Research in AI. Naval Research Laboratory ... Diana Gordon and William Spears. Insup Lee and Oleg ... – PowerPoint PPT presentation

Number of Views:39
Avg rating:3.0/5.0
Slides: 28
Provided by: theoryS
Category:
Tags: adaptive | agents | diana

less

Transcript and Presenter's Notes

Title: Adaptive Agents


1
Adaptive Agents
  • Diana Gordon
  • Navy Center for Applied Research in AI
  • Naval Research Laboratory
  • May 9, 2000

2
Outline
  • Agents that are adaptive, predictable and timely.
  • Adaptive supervisory control of multi-agent
    systems.
  • Epidemiological model of computer virus spread.
  • Potential transitions.

3
APT Agents Agents That Are Adaptive,
Predictable, and Timely
Diana Gordon Naval Research Laboratory
4
Agents Embedded in an Environment
automaton strategy
multi-agent automaton strategy
(simulated) environment
5
Re-Verification of APT Agents Strategies
O F F L I N E
strategy
Verification
O N L I N E
strategy
Adaptation
NEW SITUATION
Rapid Re-verification
revised strategy
6
Adaptation
7
(Co-)Evolving Agent Strategies
select parents from population
population of strategies
parents
perturbation (learning) operators applied
return offspring to population
offspring
fitness
evaluation
environment
8
Learning (Perturbation) Operators
  • Not addressed at this time
  • Abstraction/concretion of the Boolean algebra for
    transition conditions (see ICML98 paper)
  • Add/delete state
  • Edge operators are currently being addressed
  • Add/delete edge
  • Generalize/specialize transition condition
  • Move transition condition
  • Preserves determinism and
  • completeness (relevant for agents)

F-deliver I-receive
F-collect I-deliver
9
Predictability
10
The Need for Formal Verification
  • Testing alone has the following drawbacks
  • The process of evolution with testing is slow.
  • It doesnt provide strict behavioral guarantees.
  • Properties that are more critical require formal
    verification.
  • Model checking is applied after each learning
    occurrence.
  • (Current approach Verify global Invariance
    and Response properties with a model checker that
    does a depth-first search.)
  • If model checking outputs failure, either a new
    learning operator is selected or the strategy
    repaired.
  • To model check global multi-agent properties, the
    product automaton (multi-agent strategy) is
    formed.

11
Timeliness
12
Objective Rapid Re-verification
  • Re-verification from scratch.
  • Time-inefficient. If m actions for each of n
    agents, time complexity is O(m ).
  • Restrict learning using a priori results.
  • Safe machine learning operator (SMLO)
  • S P gt o(S) P
  • Safety guarantee with no run-time re-verification
    cost!
  • Incremental re-verification.
  • Useful when general a priori results are negative
    or difficult to obtain.
  • Time efficiency gained by localizing.

n
-(
-)
-)
13
A Priori Results
  • New results for Invariance and Response
    properties.
  • For operators with negative a priori results, we
    need incremental.

14
Three Types ofIncremental Algorithms
15
Streamline Formation of Product Automaton Knowing
That Learning Occurred
Re-form product transitions affected by learning
product state
product state
Agent 1 state3
Agent 3 state1
Agent 2 state5
Re-use product transitions not affected by
learning
product state
Agent 1 state3
Agent 3 state7
Agent 2 state5
16
Incremental Re-verification That Capitalizes on
Knowing That Learning Occurred
Learning applied here
Only need to re-verify downstream from the
learning site
Advantage Has been demonstrated to be more time
efficient than total
re-verification on some practical
problems. Limitation Worst-case time complexity
is same as total re-verif.
17
Incremental Re-verification Algorithms Specific
for Generalization and Particular Property
Classes (Invariance, Response)
Generalization applied here
Works because generalization has a local
effect on accessibility
Only need to re-verify locally
Advantage Typically theyre extremely
time-efficient. Limitations Overly cautious --
may find false errors.
18
Evaluation of Incremental Re-verification
Algorithms
  • Theoretical worst-case time complexity analysis,
    e.g.,
  • The second type of incremental algorithm saves no
    time over
  • total re-verification from scratch in the
    worst case.
  • The incremental algorithms for generalization are
    unaffected by the number of automaton states.
  • Empirical cpu time comparisons in a natural
    setting
  • All incremental algorithms were faster than total
    re-verification, though improvement for second
    type not statistically significant.
  • The incremental algorithm for generalization and
    Response properties showed a 1/2-billion-fold
    speedup (on average) over total re-verification
    on 274,000-state product automata.

19
Applications
  • Coordinating planetary rovers.
  • Competition for resources simulation.

20
Adaptive Supervisory Control of Multi-Agent
Systems
  • Diana Gordon, Naval Research Laboratory
  • Kiriakos Kiriakidis, U.S. Naval Academy

21
The Challenge
  • Repairing errors detected by verification in a
    multi-agent system is a highly challenging credit
    assignment problem, e.g.,
  • What is the source of the inter-agent conflict?
  • What is the best way to resolve this conflict?
  • Our solution
  • Recast the multi-agent paradigm as discrete
    supervisory control (see Ramadge Wonham).

22
Our Solution Adaptive Supervisory Control
learning
(re-)verify
AGENT 1 STRATEGY
DESIRED MULTI-AGENT BEHAVIOR
AGENT 2 STRATEGY
SUPERVISOR
AGENT 5 STRATEGY
repair if (re-)verification fails
23
Last MURI Board Meeting Evolving Better
Strategies for Resource Competition
  • Naval Research Laboratory

Diana Gordon and William Spears
Insup Lee and Oleg Sokolsky
University of Pennsylvania
24
Last MURI Board Meeting Future Directions
  • Integrate virus simulation with MaCS
  • Co-evolving virus and anti-virus
  • More realistic communication topologies
  • Epidemiological model of virus/anti-virus spread
    through a network

25
Current Progress on Virus Modeling
  • Kephart White (1991 1993)
  • Diff eqs describing the time evolution of the
    fraction of infected nodes in a network.
  • di/dt ??i(1- i) - ? i
  • Advantage Closed-form solutions.
  • Limitation They had difficulty solving the diff
    eqs for complex topologies and realistic
    assumptions.
  • Spears Gordon
  • Extending a (computational) Markov model approach
    used to analyze evolutionary algorithms to apply
    to modeling virus
  • and anti-virus spread through a network.
  • Advantage and limitation is opposite Kephart
    White.

26
Questions Well Be Able to Answer with the Markov
Model
  • What is the
  • expected number of infected/inoculated nodes at
    some future time?
  • probability of virus extinction at a certain
    time?
  • expected waiting time until extinction?

27
Potential Transitions
Global monitoring UAV
Using Artificial Physics, MAVs form a hexagonal
lattice sensing grid
- Ongoing discussions with Jill Dahlburg and Rick
Foch in NRL TEW about transitioning the
Artificial Physics with global monitoring
(MaC) to real MAVs. Other NRL vehicles also
being considered. Need to extend AP
from 2D to 3D - Kiriakos Kiriakidis has a student
who wants to transition our work to mini-robots.
Write a Comment
User Comments (0)
About PowerShow.com