RON: Resilient Overlay Networks - PowerPoint PPT Presentation

About This Presentation
Title:

RON: Resilient Overlay Networks

Description:

... aggressive aggregation and information hiding. Commercial reality via peering & transit ... Inability to detect badly performing paths. Inability to ... – PowerPoint PPT presentation

Number of Views:34
Avg rating:3.0/5.0
Slides: 15
Provided by: haribala8
Learn more at: http://nms.csail.mit.edu
Category:

less

Transcript and Presenter's Notes

Title: RON: Resilient Overlay Networks


1
RON Resilient Overlay Networks
  • David Andersen, Hari Balakrishnan,
  • Frans Kaashoek, Robert Morris
  • MIT Laboratory for Computer Science
  • http//nms.lcs.mit.edu/ron/

2
Fault-tolerant Networking
B
A
C
D
Any-to-any communication, routing around failures
3
The Internet
Mom-and-popISP
Really-big ISP everyones afraid of
Big ISP
Autonomous System (AS)
Peering
BGP4
Scalability via aggressive aggregation and
information hiding Commercial reality via peering
transit relationships
4
How Robust is Internet Routing?
Paxson 95-97 3.3 of all routes had serious problems
Labovitz 97-00 10 of routes available lt 95 of the time 65 of routes available lt 99.9 of the time 3-min minimum detectionrecovery time often 15 mins 40 of outages took 30 mins to repair
Chandra 01 5 of faults last more than 2.75 hours
  1. Slow outage detection and recovery
  2. Inability to detect badly performing paths
  3. Inability to efficiently leverage redundant paths
  4. Inability to perform application-specific routing
  5. Inability to express sophisticated routing policy

5
Our Goal
  • To improve communication availability for small
    groups by at least a factor or 10
  • Many applications
  • Collaboration and conferencing
  • Virtual Private Networks (VPNs) across public
    Internet
  • Overlay Internet Service

6
RON Routing Using Overlays
  • Cooperating end-systems in different routing
    domains can conspire to do better than scalable
    wide-area protocols

Scalable BGP-based IP routing substrate
  • Types of failures
  • Outages Configuration/operational errors,
    backhoes, etc.
  • Performance failures Severe congestion,
    denial-of-service attacks, etc.

7
RON Design
Nodes in different routing domains (ASes)
RON library
Performance Database
Application-specific routing tables Policy
routing module
8
Many Research Questions
  • Does the RON approach work at all?
  • Each RON is small in size, no more than 50 or 100
    nodes
  • How fast can failure detection recovery happen?
  • Policy routing
  • Doesnt RON violate AUPs and other policies?
  • Routing behavior
  • Can stable routing be achieved?
  • Implementing efficient multi-criteria routing
  • Is it safe to deploy a large number of (small)
    interacting RONs on the Internet?

9
RON Deployment (19 sites)
To vu.nl lulea.se ucl.uk
To kaist.kr, .ve
.com (ca), .com (ca), dsl (or), cci (ut), aros
(ut), utah.edu, .com (tx) cmu (pa), dsl (nc), nyu
, cornell, cable (ma), cisco (ma), mit, vu.nl,
lulea.se, ucl.uk, kaist.kr, univ-in-venezuela
10
RON Experiments
  • Measure loss, latency, and throughput with and
    without RON
  • 13 hosts in the US and Europe
  • 3 days of measurements from data collected in
    March 2001
  • 30-minute average loss rates
  • A 30 minute outage is very serious!
  • Note Experiments done with No-Internet2-for-comm
    ercial-use policy

11
RON greatly improves loss-rate
30-min average loss rate on Internet
RON loss rate never more than 30
13,000 samples
30-min average loss rate with RON
12
An order-of-magnitude fewer failures
30-minute average loss rates
Loss Rate RON Better No Change RON Worse
10 479 57 47
20 127 4 15
30 32 0 0
50 20 0 0
80 14 0 0
100 10 0 0
6,825 path hours represented here 12 path
hours of essentially complete outage 76 path
hours of TCP outage RON routed around all of
these! One indirection hop provides almost all
the benefit!
13
Resilience Against DoS Attacks
14
Conclusion
  • Improved availability of Internet communication
    paths using small overlays
  • Layered above scalable IP substrate
  • RON provides a set of libraries and programs to
    facilitate this application-specific routing
  • Experimental data suggest that this approach
    works
  • Over 10X availability
  • Outage detection and recovery in about 15 seconds
  • Able to route around certain denial-of-service
    attacks
  • Many interesting questions remain

http//nms.lcs.mit.edu/ron/
15
Policy Routing
  • Today, wide-area policy expression is a
    sledgehammer
  • Policy control is important
  • From talking to some providers
  • E.g., rate control policy Internet2, etc.
  • True, RONs could violate AUPs
  • But, the RON approach enables more flexible
    policies
  • More complex routing decisions rate-based too
  • Multiple routing tables
  • Deeper packet inspection, etc.

16
Example
17
Throughput Improvement
Write a Comment
User Comments (0)
About PowerShow.com