Performance Management Framework - PowerPoint PPT Presentation

About This Presentation
Title:

Performance Management Framework

Description:

Denial of Service Attack (Day 2) 8/28/09. USBancorp. 16. What are Alerts and why do we ... polling. Event driven. Applications and Infrastructure Devices. CIC ... – PowerPoint PPT presentation

Number of Views:396
Avg rating:3.0/5.0
Slides: 31
Provided by: gregory113
Category:

less

Transcript and Presenter's Notes

Title: Performance Management Framework


1
Performance Management Framework
  • ROC Retreat
  • 6/17/2004

2
Topics
  • Overview What is it, Why have it Where has it
    been, Where does it fit?
  • Fundamentals What are the underlying principles
    and metrics?
  • Architecture Data collection (Observation,
    Analysis, Action)
  • Instrumentation How are performance
    measurements collected?
  • Analysis and Reporting How are events analyzed
    What are the reporting patterns?
  • Alerts, Leveraging ROC Concepts and Next Steps -

3
Overview - What is it?,
  • Performance Engineering and Management is
    the ability to ensure that applications will be
    designed to meet their response time and
    throughput requirements, and when in production
    continue to do so.

4
Fundamentals
Concepts and Metrics
Infrastructure
Net. Facilities
Resource
Application
Load/Usage
Users
Performance
DB
OS
Methods
Quality
Response Time
Stability
Net.Components
5
Instrumentation
  • Measurements
  • No Load (SoftProbes)
  • Intrusive vs. Non Intrusive
  • Passive vs. Active
  • Data Mining
  • Application Logs
  • System Logs

6
Architecture
  • Throughout the development lifecycle, performance
    statistics are asynchronously collected, analyzed
    and used to influence design and implementation
    decisions.

Training (Alert Rules)
7
Instrumentation
Intrusive Application Instrumentation
(Performance Framework)
  • GUI
  • Business context, user, workstation, method,
    class
  • Application rules
  • Specific method invocations
  • 3rd party/calls to external modules and/or
    systems
  • Data base access
  • Selects, inserts, update, deletes, stored
    procedures, DDL commands

8
Instrumentation
Performance Framework Class Structure
(Highlighting Alert Pattern Classes)
9
Performance Repository
  • Data Model

Event attributes Begin timestamp End
timestamp Latency (ms) Session_id
10
Performance Framework Benchmark Stats
Response time Rt_1 felt by application Rt_2
internal posting
11
Analysis and Reports
12
Analysis and Reports
Transaction Summary (10 bucket report)
13
Analysis and Reports
If you cannot see the problemyou cannot fix it!
Arrival Rates and Response Times
Arrival Rate and Concurrency
14
Analysis and Reports
Denial of Service Attack (Day 1)
15
Analysis and Reports
Denial of Service Attack (Day 2)
16
What are Alerts and why do we need them?
  • The ability for an application to assess when it
    cant perform its functions correctly or to meet
    service levels, and thenreport the failures to
    someone who cares.
  • If an application is sick and cant perform some
    or all of its functions, what is a better way
    (fast and precise) to be notified than having the
    application tell you exactly whats wrong.

You need to walk before you can run
17
Todays Application Alert Architecture (outside
looking in)
  • Device Monitors for Servers via OpenView
  • Application http/https Monitors via ISM

Device Monitoring OV
Event driven
Applications and Infrastructure Devices
CIC
Application Monitoring (ISM)
polling
18
Tomorrows Alert Architecture (inside looking
out)
  • Application Problem Determination as Presented in
    2002/2003

Device Monitoring OV
Event driven
Application Alerts via OV using SNMP traps
Applications and Infrastructure Devices
CIC
Application Monitoring (ISM)
polling
19
Tomorrows Alert Architecture (inside looking
out)
  • Application Problem Determination as suggested
    using whats in place today!

Device Monitoring OV
Event driven
Root Cause Analysis
Application Alerts via OV
Applications and Infrastructure Devices
CIC
Corrective action
Application Monitoring (ISM)
polling
20
Architecture
  • Throughout the development lifecycle, performance
    statistics are asynchronously collected, analyzed
    and used to influence design and implementation
    decisions.

Training (alert Rules)
21
Performance Frameworks Alert Functionality
  • Performance Framework has been integrated into
    the banks Application-WebSphere Framework so
    that all applications that use it are being
    monitored for response time, throughput, quality
    and stability. The Performance Framework is
    currently being re-written for the banks
    Application-.NET Framework.
  • The main purpose of the performance framework is
    to instrument applications so that performance
    related statistics can be measured and
    subsequently analyzed. As a by-product of data
    collection, real time analysis software was added
    in 2002 to identify when the target application
    is not functioning as designed or within
    performance tolerances.
  • We chose not to implement it in 2002/2003 because
    the banks implementation of the problem
    management software was not sophisticated enough
    to assist in root cause analysis prior to
    involving an operator. Raw alerts would have
    overwhelmed the problem management process at the
    CIC.

22
Performance Frameworks Functionality
  • Alert Types
  • Response Time Are transactions (method
    invocations) meeting there expected latency?
  • Stability (throughput) Is the Application
    processing transactions at the expected volume
    and throughput?
  • Quality Are the transactions (method
    invocations) error free, if not what are the
    errors?

23
Performance Frameworks Functionality
  • Alert Rule (Attributes)
  • FORMAT
  • Alarm_Name (String any unique label)
  • Alarm_Type (String Latency, error, stability)
  • Alarm_Layer (integer layer identifier)
  • Alarm_Method_Name (String Optional)
  • Alarm_Days (N1-N2 - where Sunday 1 and
    Saturday 7)
  • Alarm_Times (t1-t2 - range 1-24 hrs)
  • Sample_Size (gt1, lt15)
  • Alarm_Threshold (integer - this will be used as
    min Arrivals for Stability Alarms)
  • Alarm_Forgiveness (integer)
  • Alarm_Message (Text, white space allowed)
  • The Format will be in order (top to bottom)
    delimited with a comma
  • stab_1,stability,1,null,1-7,1-24,3,101,0.0,This
    is a stability alarm
  • err_1,error,1,null,2-6,7-20,2,0,5.0,Errors
    Greater than 5 pct

24
Performance Frameworks Alert Functionality
  • Alert Types

Architecture Layers
25
Components
  • Likely Black Box Mapping

26
Analysis Needs to Collaborate Between Components
27
What is needed to use it?
  • Real-time Alert Analysis
  • Collaboration between Rule-Types
  • Is a stability alert real, or is it the
    by-product of a latency problem?
  • Is a response time alert real, or is it the
    by-product of an error alert?
  • Are any latency or stability alerts real, or are
    they pointing in the direction of the root cause?

28
What is needed to use it?
  • Real-time Alert Analysis (continued)
  • Collaboration between components
  • Multi column applications need to isolate
    underlying infrastructure failures. When a
    subset of app columns deliver slow response time
    is the mutual failure in the network or the
    mainframe?
  • When a group of applications deliver slow
    response time while others are OK, is the mutual
    failure in the network or the mainframe, or ?

29
What is needed to use it?
  • USBank Application Framework services that
    supports Alert Analysis and Communication

Root Cause Analysis
30
Next Steps
  • Determine whether or not the Performance
    Frameworks approach is directional for problem
    determination.
  • Determine the requirements for a robust Alert
    Analysis and Recovery process.
  • Determine the role that the banks application
    framework should provide support services.
  • Determine whether to buy or build a strategic
    Analysis solution.
Write a Comment
User Comments (0)
About PowerShow.com