Extensible Scalable Monitoring for Clusters of Computers - PowerPoint PPT Presentation

1 / 15

About This Presentation

Title:

Extensible Scalable Monitoring for Clusters of Computers

Description:

Snapshot. Experience. Conclusion & Future Work. 5. Problem: ... Implementation Snapshot. 13. Experience. Configuration information should be in database ... – PowerPoint PPT presentation

Number of Views:28

Avg rating:3.0/5.0

Slides: 16

Provided by: erica180

Category:

Tags: clusters | computers | extensible | monitoring | scalable | snapshot

Transcript and Presenter's Notes

Title: Extensible Scalable Monitoring for Clusters of Computers

1
Extensible Scalable Monitoring for Clusters of
Computers

Eric Anderson
U.C. Berkeley
Summer 1997 NOW Retreat

2
Overall Problem

Monitoring a cluster of cooperating computers
Different from client-server where only servers
matter
Requires substantial information from all
machines
100s-1000s of nodes
Client-server becomes subset of this problem

3
Problems Solutions

Cluster software and hardware is constantly
evolving
Monitoring software must be extensible and
flexible
Use relational tables
Failures will occur in the cluster
Monitoring software must detect and recover from
failures
Use timestamps for weak synchronization
Scalability needed to hundreds of nodes
Need to efficiently transfer data from sources to
sinks
Use hierarchy hybrid push-pull protocol
Need to display statistics and information from
all nodes
Use statistical aggregation color,shade to
minimize info. loss

4
Overview

Details of solutions
Handling evolving software
Detecting and recovering from failures
Scaling data management
Scaling visualization
Implementation
Architecture
Programs
Snapshot
Experience
Conclusion Future Work

5
Problem Clusters Evolve

Solution Relational tables
Increases flexibility by decoupling data users
from data providers
Increases extensibility by structuring data into
independent tables
Increases extensibility by allowing additional
columns in tables without breaking old programs
Retains performance through transparent use of
indicies
Improvement over tree structures in previous
systems

6
Problem Failures Occur

Solution Use timestamps
Loss of periodic updates to timestamps allow
remote nodes to detect failures
Timestamps allow weak synchronization between
databases
Better availability during failures, simpler
recovery
Timestamps allow stale data to be eliminated
Only requires purges run every so often rather
than relying on programs to clean up after
themselves
Reasons 2 3 are useful even in normal operation

7
Problem Scalable Data Access

Solution Hierarchy efficient protocol
Hierarchy allows
Batching of data from different nodes (all data
from routers)
Specialization to particular data (all data on
processes)
Efficient protocol (Hybrid of push/pull)
Sink sends (SQL select command, interval, count )
to source
Changed data is extracted via SQL every interval
seconds and forwarded to the sink count times
Sink can cancel requests at any time
Achieves the best of pull and push protocols in
terms of wasted data transfers, freshness, and
network bandwidth

8
Problem Scalable Visualization

Solution Statistical aggregation use of shade
color to minimize information loss
Aggregate across similar variables (average load
of 10 machines) show dispersion (std. dev.) as
shade
Aggregate across variables from one node
(utilization maxdisk,network,cpu)
Both forms of aggregation at the same time
hierarchical aggregation
Use color to draw attention to special things
(nodes down) to limit visual overload

9
Implementation Architecture
10
Implementation Details

Databases are MiniSQL
Freely available with source code
Implements subset of SQL
Forwarder implements source part of hybrid
protocol
Using polling to get data from database
Joinpush implements merging part of hierarchy
Control of merge sources external to the program
Both forwarder joinpush implemented in threaded
C
Simpler implementation for blocking operations
Could be merged in with the database

11
Implementation Details, cont.

Gather implemented in perl
Simpler to add new data sources, but would like
threading
Somewhat inefficient, might re-implement in C
Javaserver implemented in perl
Easier to extend with additional aggregation
forms
Application level proxy because Java cant access
network
Javaclient implemented in Java
Allows clients to run in browser anywhere in the
world
Weak feedback to javaserver to control
information displayed

12
Implementation Snapshot
13
Experience

Configuration information should be in database
Had them in random files database collects it
together
Reset-world operation very important
Puts system in known state
Useful for default destination of statistics of
remote database
Minimizes load on monitored nodes
Potentially reduces fault tolerance
Browser user interface very useful
Limitations of Java very obnoxious

14
Conclusion

Four problems solutions important for any
cluster monitoring system
Evolution inherent in uses of clusters
Independent failures occur in all clusters
Scalability of data management needed for large
clusters
Scalability of visualization also needed for
large clusters
Implementation works, and initially useful,
further deployment needed
Experience identified problems, places for
improvements.

15
Future Work

Automatic identification of statistics relevant
to problems
Expect to be able to use Boolean disjunction
learning algorithms
Tracking of long term trends and statistical
measures
Self tuning of specialized databases based on
usage
Addition of notification, repair components
Gathering of more statistics (via SNMP for
example)
Distribution of system to external sites

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

Data Stream Mining with Extensible Markov Model PowerPoint PPT Presentation

Data Stream Mining with Extensible Markov Model - Is the process of automatically searching large volumes of ... Online web purchase log records (JcPenny data 2003) Sensor network data (Ouse, Serwent 2002) ... | PowerPoint PPT presentation | free to view

The Anatomy of the Grid Enabling Scalable Virtual Organizations PowerPoint PPT Presentation

The Anatomy of the Grid Enabling Scalable Virtual Organizations - The Anatomy of the Grid. Enabling Scalable Virtual Organizations. Ian Foster ... Civil engineers collaborate to design, execute, & analyze shake table experiments ... | PowerPoint PPT presentation | free to view

Introduction to Clusters PowerPoint PPT Presentation

Introduction to Clusters - Follow-on lectures talk more in detail about various aspects of clustering ... (SHRIMP) Scalable High-performance Really Inexpensive Multi-Processor (Princeton) ... | PowerPoint PPT presentation | free to view

Grid Monitoring and Information Services: Globus Toolkit MDS4 PowerPoint PPT Presentation

Grid Monitoring and Information Services: Globus Toolkit MDS4 - Registry or directory service. A construct (database? ... Have upgrades taken place in a timely fashion? Nov 2, 2004. 22. Inca Producers: Reporters ... | PowerPoint PPT presentation | free to view

Scalable Coordination Algorithms for Deeply Distributed Systems PowerPoint PPT Presentation

Scalable Coordination Algorithms for Deeply Distributed Systems - PC104s running diffusion interface with mote clusters using TinyDiffusion. Motes enable dense sensor deployment but can support limited in-network processing ... | PowerPoint PPT presentation | free to view

PARMON A Comprehensive Cluster Monitoring System PARMON Team Centre for Development of Advanced Computing, Bangalore, India http://www.cdacindia.com Project Leader: Rajkumar Buyya (buyya@computer.org) PowerPoint PPT Presentation

PARMON A Comprehensive Cluster Monitoring System PARMON Team Centre for Development of Advanced Computing, Bangalore, India http://www.cdacindia.com Project Leader: Rajkumar Buyya (buyya@computer.org) - PARMON Installation and its Usage. Monitoring with PARMON. PARMON Integration with other products ... Disk/Network Usage Monitoring. 21. Message Viewer (System ... | PowerPoint PPT presentation | free to view

Using OpenVMS Clusters for Disaster Tolerance PowerPoint PPT Presentation

Using OpenVMS Clusters for Disaster Tolerance - ... high probability in the face of common (mostly hardware) failures ... Very-common method. ... reports, via OPCOM messages, LAN component failures and repairs ... | PowerPoint PPT presentation | free to view

Information Modeling and Monitoring in Grid Systems PowerPoint PPT Presentation

Information Modeling and Monitoring in Grid Systems - Share resources across administrative domains (e.g., computing power, ... Each domain has a Theodolite Service that gather network service related metrics ... | PowerPoint PPT presentation | free to view

Introduction to the NPACI Rocks Clustering Toolkit: Building Manageable COTS Clusters PowerPoint PPT Presentation

Introduction to the NPACI Rocks Clustering Toolkit: Building Manageable COTS Clusters - Builders of clusters which drive very large commercial databases ... Bootable CD floppy which contains all the packages and site configuration info ... | PowerPoint PPT presentation | free to view

Scalable, Fault-tolerant Management of Grid Services: Application to Messaging Middleware PowerPoint PPT Presentation

Scalable, Fault-tolerant Management of Grid Services: Application to Messaging Middleware - Applications distributed and composed of ... Components widely dispersed and disparate in nature and access ... Under differing network / security policies ... | PowerPoint PPT presentation | free to view

Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters PowerPoint PPT Presentation

Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters - Clusters as an alternative to multiprocessor machines for high performance computing ... Configurable modular global scheduler ... | PowerPoint PPT presentation | free to view

Engineering a Scalable Placement Heuristic for DNA Probe Arrays PowerPoint PPT Presentation

Engineering a Scalable Placement Heuristic for DNA Probe Arrays - CLIP-like method: move the cluster that v belongs to */ Reset the gains of all nodes to zero ... CLIP Algorithm. v. CLIP. v. Reminiscent of CLIP (Deng et al. ... | PowerPoint PPT presentation | free to view

GridMonitor: Integration of Large Scale Facility Monitoring With MDS PowerPoint PPT Presentation

GridMonitor: Integration of Large Scale Facility Monitoring With MDS - Information Provider Provides Cache for the Newest Value From the Mysql Database ... A Sub-cluster Contains the Host With the Same Configuration ... | PowerPoint PPT presentation | free to view

Scalable%20Molecular%20Dynamics%20for%20Large%20Biomolecular%20Systems PowerPoint PPT Presentation

Scalable%20Molecular%20Dynamics%20for%20Large%20Biomolecular%20Systems - Was getting time for performance tuning runs on parallel machines. 50 ... Both include downloadable software. 53. Parallel Programming Laboratory. Funding: ... | PowerPoint PPT presentation | free to view

Embedded Networked Sensing for Environmental Monitoring: Applications and Challenges PowerPoint PPT Presentation

Embedded Networked Sensing for Environmental Monitoring: Applications and Challenges - Embedded Networked Sensing for Environmental Monitoring: Applications and Challenges | PowerPoint PPT presentation | free to view

CEPH: A SCALABLE, HIGH-PERFORMANCE DISTRIBUTED FILE SYSTEM PowerPoint PPT Presentation

CEPH: A SCALABLE, HIGH-PERFORMANCE DISTRIBUTED FILE SYSTEM - Paper highlights Yet another distributed file system using object storage devices Designed for ... distributed object storage System Architecture ... | PowerPoint PPT presentation | free to view

ChaMPIon/ProTM: A High Performance Multithreaded Portable MPI-2 Implementation for ASCI Terascale Platforms and Linux Clusters PowerPoint PPT Presentation

ChaMPIon/ProTM: A High Performance Multithreaded Portable MPI-2 Implementation for ASCI Terascale Platforms and Linux Clusters - ChaMPIon/ProTM: A High Performance Multithreaded Portable MPI-2 Implementation for ASCI Terascale Platforms and Linux Clusters Rossen Dimitrov, Anthony Skjellum ... | PowerPoint PPT presentation | free to view

Scalable Process Management and Interfaces for Clusters PowerPoint PPT Presentation

Scalable Process Management and Interfaces for Clusters - Scalable Process Management and Interfaces for Clusters Rusty Lusk representing also David Ashton, Anthony Chan, Bill Gropp, Debbie Swider, Rob Ross, Rajeev Thakur | PowerPoint PPT presentation | free to view

HaLoop: Efficient Iterative Data Processing On Large Scale Clusters PowerPoint PPT Presentation

HaLoop: Efficient Iterative Data Processing On Large Scale Clusters - HaLoop: Efficient Iterative Data Processing On Large Scale Clusters Horizon Yingyi Bu, UC Irvine Bill Howe, UW Magda Balazinska, UW Michael Ernst, UW | PowerPoint PPT presentation | free to view

GridMonitor: Integration of Large Scale Facility Monitoring With MDS PowerPoint PPT Presentation

GridMonitor: Integration of Large Scale Facility Monitoring With MDS - GridMonitor: Integration of Large Scale Facility Monitoring With MDS Richard Baker, Antonio Chan Jason Smith, Dantong Yu USATLAS/RHIC Computing Facility | PowerPoint PPT presentation | free to view

Towards a Scalable, Adaptive and Network-aware Content Distribution Network PowerPoint PPT Presentation

Towards a Scalable, Adaptive and Network-aware Content Distribution Network - ... and servers capacity constraints Self-organize replica into a scalable application-level multicast for disseminating ... 3084 paths w/ 5% improvment: ... | PowerPoint PPT presentation | free to view

An Algebraic Approach to Practical and Scalable Overlay Network Monitoring PowerPoint PPT Presentation

An Algebraic Approach to Practical and Scalable Overlay Network Monitoring - Title: Distributed Performance Measurement Infrastructure Author: yanchen Last modified by: Yan Chen Created Date: 11/29/2001 1:42:56 AM Document presentation format | PowerPoint PPT presentation | free to view

Computers for the PostPC Era PowerPoint PPT Presentation

Computers for the PostPC Era - www.cs.berkeley.edu | PowerPoint PPT presentation | free to view

CEPH: A SCALABLE, HIGH-PERFORMANCE DISTRIBUTED FILE SYSTEM PowerPoint PPT Presentation

CEPH: A SCALABLE, HIGH-PERFORMANCE DISTRIBUTED FILE SYSTEM - CEPH: A SCALABLE, HIGH-PERFORMANCE DISTRIBUTED FILE SYSTEM S. A. Weil, S. A. Brandt, E. L. Miller D. D. E. Long, C. Maltzahn U. C. Santa Cruz | PowerPoint PPT presentation | free to view

CoBase: Scalable and Extensible Cooperative Information System PowerPoint PPT Presentation

CoBase: Scalable and Extensible Cooperative Information System - CoBase: Scalable and Extensible Cooperative Information System Wesley W. Chu Computer Science Department University of California, Los Angeles http://www.cobase.cs ... | PowerPoint PPT presentation | free to view

Clustera: A data-centric approach to scalable cluster management PowerPoint PPT Presentation

Clustera: A data-centric approach to scalable cluster management - Clustera: A data-centric approach to scalable cluster management David J. DeWitt Jeff Naughton Eric Robinson Andrew Krioukov Srinath Shankar Joshua Royalty | PowerPoint PPT presentation | free to view

Directed Diffusion: A Scalable and Robust Communication Paradigm for Sensor Networks PowerPoint PPT Presentation

Directed Diffusion: A Scalable and Robust Communication Paradigm for Sensor Networks - Directed Diffusion: A Scalable and Robust Communication Paradigm for Sensor Networks | PowerPoint PPT presentation | free to view