Automating Network Diagnostics to Help EndUsers - PowerPoint PPT Presentation

1 / 11
About This Presentation
Title:

Automating Network Diagnostics to Help EndUsers

Description:

... to get email about an outage ... Freedom of information: Outage info should be available to all ... Privacy: Outage info should be available only to those ... – PowerPoint PPT presentation

Number of Views:29
Avg rating:3.0/5.0
Slides: 12
Provided by: dave213
Category:

less

Transcript and Presenter's Notes

Title: Automating Network Diagnostics to Help EndUsers


1
Automating Network Diagnostics to Help End-Users
  • Dave Thaler
  • dthaler_at_microsoft.com

2
Motivation
  • User needs are becoming more dependent on the
    Internet
  • Problems can occur anywhere in the Internet
  • Users have no control over these problems, and
    often neither do their direct providers!
  • Increased support calls are NOT the answer

3
Poor error messages today
  • User doesnt understand the problem, only the
    symptom
  • Error message pop-ups often arent helpful to
    users
  • Event logs arent really any better
  • Technical information is only useful to technical
    experts who dont have access to end-users
    machines

4
Goals
  • Reduce number of support calls
  • Help the user/app help itself where possible
  • Locate the correct party to contact if not
  • Reduce the time spent on support calls that do
    occur

5
Focus on what the user wants!
  • User doesnt want to have to call anyone
  • User doesnt want to get email about an outage
  • User doesnt want to go to some web site to find
    out
  • User just wants the application to work
  • If it doesnt work, user often wants to know when
    it will work
  • If application is non-interactive, application
    may want to retry as soon as it is fixed (e.g.
    search engine)
  • User also wants to know that someone is working
    on fixing the problem
  • If user policy decision needed, user wants
    choices

6
Multiple Adminstrative Entities
  • Policy Principles
  • Freedom of information Outage info should be
    available to all those affected by it.
  • Privacy Outage info should be available only to
    those affected by it.
  • Freedom of speech Any entity should be able to
    report a problem, whether or not it is trusted.
  • Conservation of effort Perform the minimum work
    needed to troubleshoot the problem.

7
Solution Framework
  • Self-diagnosing, self-healing
  • Naming identification of a problem instance
  • Message routing getting problem instance to
    capable agent
  • Methodology structured process for confirming,
    diagnosing, repairing, etc.
  • Domain-specific classes/agents
  • Self-improving
  • Learning what possible causes are the most likely
  • Learning what is normal/abnormal within a class
  • Reporting on agent behavior for improvements

8
Architecture
Application or Monitoring Tool
Client API
Engine
Protocol
Engine
Helper Class
Helper Class
Helper Class
Helper Class
Component
Component
Component
Component
9
Separation of Roles
  • Engine
  • Maintains cause-effect tree
  • Handles message routing
  • Implements core methodology
  • Helper Classes
  • Implement simple API
  • Embed knowledge about one component type
  • Generate causal hypotheses that are treated just
    like client reports

10
Example TCPIP-related Helper Classes
11
Summary
  • Its not just about one network being
    self-managing
  • We need to improve the end-user experience
  • Handling multiple administrative domains is a
    core issue
  • Structured methodology aids in problem analysis
  • Today too many things are ad hoc
  • High-level methodology should be independent of
    component-specific knowledge
  • Provides extensibility
  • Facilitates appropriate learning at both levels
Write a Comment
User Comments (0)
About PowerShow.com