BEST PRACTICES FOR RELIABLE CARRIER GRADE TELEPHONY - PowerPoint PPT Presentation

About This Presentation
Title:

BEST PRACTICES FOR RELIABLE CARRIER GRADE TELEPHONY

Description:

BEST PRACTICES FOR RELIABLE CARRIER GRADE TELEPHONY Alistair Cunningham, Integrics Ltd. Reliability Think people and culture, not technology. Complexity is the enemy. – PowerPoint PPT presentation

Number of Views:94
Avg rating:3.0/5.0
Slides: 9
Provided by: Alist98
Category:

less

Transcript and Presenter's Notes

Title: BEST PRACTICES FOR RELIABLE CARRIER GRADE TELEPHONY


1

BEST PRACTICES FOR RELIABLE CARRIER GRADE
TELEPHONY Alistair Cunningham, Integrics Ltd.
2
Reliability
  • Think people and culture, not technology.
  • Complexity is the enemy.
  • Discipline is the answer.
  • Management must be willing to sacrifice features.
  • Reliability for all customers is more important
    than winning one new customer.

3
Staff Responibilities
  • Assign a senior engineer as system manager.
  • System manager has ultimate responsibility for
    whole system.
  • Can delegate tasks to others.

4
Cluster Architecture
  • Duplicate all important functions. Use heartbeat,
    DRBD/GFS, application level load balancing.
    Remember utilities.
  • Consistency between machines is vital.
  • Virtual machines have more outages.
  • Monitor all machines, services, and resources.
  • Daily and monthly backups.

5
Upgrades and Changes
  • Risk is unpredicable and cumulative.
  • Many small changes are riskier than a few large
    changes.
  • Test all changes on a staging machine first.
  • Keep records of changes.
  • Consider change management system.
  • Keep customizations to a minimum.

6
Dealing with Vendors
  • Vendors can never substitute for system manager.
  • Give vendors access to staging machines but not
    production.
  • Your staff must have debugging skills.
  • Subscribe to security mailing lists.

7
Causes of Outages
  • Most outages are caused by one of
  • Untested changes use staging.
  • Hard disks filling up use monitoring.
  • Power and network outages redundancy or split
    cluster.
  • Avoiding these three is usually sufficient to
    achieve good reliability.

8
Questions?
Write a Comment
User Comments (0)
About PowerShow.com