PANDA - PowerPoint PPT Presentation

1 / 45
About This Presentation
Title:

PANDA

Description:

Panda. Design. Performance. Introduction. Want to design a fast, ... Panda creates a persistent, efficient, scalable p2p communication space. Hypothesis ... – PowerPoint PPT presentation

Number of Views:247
Avg rating:3.0/5.0
Slides: 46
Provided by: christop89
Category:
Tags: panda | panda

less

Transcript and Presenter's Notes

Title: PANDA


1
PANDA
  • A generative communication system using a
    structured p2p overlay

2
Agenda
3
Introduction
  • Want to design a fast, scalable distributed
    system
  • Need an interface for communication
  • RPC?
  • Easy to understand, integrate
  • Distributed computing hidden in a local procedure
    calls
  • Remote calls too slow
  • Linda?
  • No hidden communication

4
Linda communicates using tuples
  • Tuple an ordered list of typed parameters
  • Tuple-space



Node
out
in
5
Implementing Linda for scalability
  • Tuples stored on a single server
  • Not scalable
  • Tuples stored on multiple servers
  • Scalable
  • How can multiple servers communicate effectively?
  • Distributed Hash Tables (DHT)

6
Issues associated with DHTs
  • Tuple storage
  • Keys based on a data hash
  • Exact searches straight forward
  • Partial searches very difficult
  • Keys predetermined by shared naming scheme
  • Loss of anonymity
  • Security
  • Decoupling requires trust among participants

7
Pastry, PAST, and Scribe combine to build
tuple-space
  • Pastry
  • Bounded message routing
  • Load-balancing
  • Good locality in a decentralized fashion
  • PAST
  • A DHT service built on top of Pastry
  • Provides content hashing and replication
    functionality
  • Scribe
  • A multicast infrastructure
  • Provides communication between nodes storing and
    requesting tuples

8
Panda liNDA on PAstry
  • Panda
  • Provides Lindas functionality as a Java library
  • Built on top of Pastry p2p overlay
  • Supports multiple non-overlapping tuple-spaces
  • Panda creates a persistent, efficient, scalable
    p2p communication space

9
Hypothesis
  • Panda will gain efficiency as the system grows

10
Agenda
11
Characteristics of the Panda tuple-space
  • Panda based on LighTS interface
  • A lightweight Linda implementation in Java
  • Tuple operations
  • Default addition and block removal
  • out(Tuple t)
  • rd(Tuple t)
  • in(Tuple t)
  • Group addition and block removal
  • outg(Group g)
  • rdg(Group g)
  • ing(Group g)

12
Tuples A formal definition
  • A tuple is an ordered sequence of typed fields
  • Can be any java implementation of the
    Serializable interface
  • A field can
  • Contain a value (actual field)
  • Be characterized by type (formal field)
  • Tuples with formals typically used as a template
    to query tuple space through pattern matching

13
Active tuples
  • Active tuple a tuple which can be executed on
    the accepting node
  • Nodes in tuple space trust each other not to be
    malicious
  • Cannot guarantee competency
  • Active tuples not implemented due to tome
    constraints

14
Agenda
15
Design objectives
  • Focus on providing the benefits of a distributed
    application
  • Maintain the generative communication
    functionality
  • Definitions
  • Tuple signature - data structure of the contents
    of the tuple
  • Tuple signature group - set of all tuples in a
    tuple space that share a tuple signature

16
Two possible naïve insertion/retrieval policies
  • Store by cID
  • Hash of the tuples content
  • Provides load balancing
  • Infeasible search overhead
  • Store by rID
  • Hash of the tuples signature
  • Not sufficient for load balancing
  • Efficient query resolution
  • ID collisions among tuples with the same signature



17
Both IDs are used
  • The rID is used to route tuple to a specific node
  • Tuple is stored in a hashtable by cID
  • All nodes of a given signature are stored on the
    same node

18
Tuple retrieval
  • For each existing node signature, there is a
    Request scribe group and an Announcement group
  • At tuple-insertion, the storing node joins (and
    listens on) the Request group, but publishes the
    arrival on the Announcement group
  • A requesting node joins (and listens on) the
    Announcement group but publishes the request on
    the Request group

19
The primary storer is the first to subscribe to
the two groups
  • Primary storer
  • Receives first tuple of given signature
  • Subscribes to request group for that signature
  • Replicates tuple
  • All secondary storers subscribe to the request
    group

Request group as primary

20
Both groups have a unique message type
  • Groups receive rIDs concatenated with a suffix
    for each of the two groups
  • A query message contains
  • The tuple pattern to be matched
  • The node ID of the query originator
  • A notice of insertion contains
  • A newly inserted tuple
  • The node ID of its primary host

Query Message Node 1000
Notice of Insertion Node 1005
21
A consumer joins the announce group before
sending a request
  • Consumer joins an announce group
  • Consumer sends out a request
  • Request multicasted
  • Nodes return a tuple to announce group
  • Announce group multicasts tuple

Announcement Group
rID.q Node 1000
Request group as primary
22
The primary storer announces the arrival of a new
tuple
  • Primary receives a tuple
  • Notifies announcement group of inserted tuples
  • Subscribers compare the new tuple and select the
    first matching tuple
  • NOTE multicasting in Scribe is only best effort

Notice of Insertion Node 1005

P
Announcement Group
23
Fault tolerance
  • Use PAST for replication
  • Keep-alive messages verify member liveness
  • Unresponsive nodes assumed dead
  • Failed nodes replaced with closest outside node
  • Internally, PAST shares the hash tables stored
    under each rID
  • Changes in node membership will affect only the
    groups sharing a given signature

P
X
24
When a primary node fails
  • If a majority of replica nodes notice a primary
    failed
  • Node with closest ID to primary
  • Assumes primary status
  • Joins the appropriate scribe groups and
    re-announces all tuples

X
P
P
25
Load balancing
  • Storage space
  • Tuples are diverted when storage is close to
    maximum capacity
  • Query processing allocation
  • Query load is partitioned among nodes able to
    serve a group of queries

26
Storage load balancing
  • Tuples are diverted when a node is close to
    capacity
  • The space of the tuple content hash is divided
  • Deterministic or dependent partitioning
  • Alternate nodes are randomly selected in a policy
    similar to PAST
  • Each partial storer joins the appropriate Scribe
    groups
  • Opted not to implement storage balancing

Hashtable(rID) FULL!!
rID
rID
rID
rID
27
Query load balancing
  • Goal divide a set of queries
  • Recall queries are routed to all nodes storing a
    matching tuple signature
  • 1st option partition the query resolution among
    all replica storers
  • Synchronization issues
  • 2nd option generate distinct groups of queries
    to be resolved by different nodes
  • Very complex
  • Opted not to implement query load balancing

P
rID.q Node 1006
Request group as primary
rID.q Node 1005
rID.q Node 1004
rID.q Node 1001
rID.q Node 1003
28
Deletions
  • PAST semantics allow for relaxed file removal
  • Avoid issues with PAST by removing the tuple
    entry in the hash table, not the hashtable itself

In
add,2,2,4
29
Tuple persistence
  • Linda guarantees tuples will persist indefinitely
  • Storage may eventually fill up
  • Tuples will be removed in LRU order
  • System must come to a consensus as a whole

out
out
out
out
out
out
30
Blocking request persistence
  • Two issues
  • Announcement message never delivered to a client
  • Refresh messages sent periodically for blocking
    queries
  • Request message failed
  • Periodic transmission of refresh message reduces
    the probability of this happening
  • Could also have timeouts on blocking reads
    (violates Linda semantics)

Notice of Insertion
X
Query Message
X
31
Security
  • Participants in a tuple space must trust each
    other to maintain decoupling between sender and
    data
  • However, they may not trust participants in other
    tuple spaces
  • Solution Allow each tuple space to provide a
    shared secret to participants for encryption and
    integrity (not implemented)

32
Agenda
33
(No Transcript)
34
Backup
35
What is Panda?
  • Panda is Linda on Pastry
  • Linda a generative communication blackboard
  • Pastry a peer to peer distributed hash table
    overlay
  • Panda creates a persistent, efficient, scalable
    p2p communication space

36
Pastry and DHT
  • Pastry is a robust, self-organizing network
    overlay
  • Nodes assigned locations in a circular namespace
    using SHA-1 hash of the nodes public key
  • Each128-bit nodeID is unique
  • Routing uses nodeIDs as keys to a Distributed
    Hash Table (DHT)

37
Pastry What a Node Knows
  • Each Pastry node has a routing table
  • log2b N rows of 2b - 1 entries
  • Each entry on a row i, 0 i match of length i with current node
  • Routing cost based on a proximity metric can be
    calculated for each entry

38
Pastry A Routing Example
0
2128 - 1
Route (d46a1c)
39
Pastry Fault Tolerance
  • Each Pastry node maintains a leaf set
  • Largest and smallest adjacent nodeIDs
  • Improved fault tolerance when data is replicated
    amongst leaf set members

a927dd
a92e1c
a941f1
a9523c
a9a231
40
The Linda Model
  • Linda uses generative communication for parallel
    programming
  • Persistent objects rather than transient messages
  • Processes dont need to know how to contact each
    other
  • Programming language independent

41
Linda Tuples
  • A Tuple is an ordered list of typed parameters
  • Tuples may contain formals (wild cards)

42
Linda Tuple Space
  • Tuples are shared via tuple space
  • out(Tuple t) places a tuple t in tuple space
  • rd(Tuple t) reads a tuple that matches t
  • in(Tuple t) reads and removes a tuple
    matching t
  • eval(Tuple t) executes a code snippet in t,
    leaving behind a result tuple

43
Panda P2P Linda
  • Panda features
  • Java API based on LighTS, an existing Java-Linda
    API
  • Distributed tuple spaces using Pastry DHT
  • Multiple non-intersecting tuple spaces
  • Shared group keys and tuple MACs

44
Panda Tuple Spaces DHT
  • Multiple, distributed tuple spaces
  • Hashing of tuples based on text keys for
    efficient inserts and deletes
  • Best effort persistence deletion policy based on
    timestamps and timeout values

45
Panda Bibliography
  • G. P. Picco. http//lights.sourceforge.net.
  • M.Castro, P. Druschel, Y. C. Hu and A Rowstron.
    Topology-aware routing in structured peer-to-peer
    overlay networks, 2002.
  • A Rowstron and P. Druschel. Pastry Scalable,
    decentralized object location and routing for
    large-scale peer-to-peer systems. In Proc.
    IFIP/ACM Middleware 2001, Heidelberg, Germany,
    Nov. 2001.
  • A. Rowstron and P. Druschel. Storage management
    and caching in PAST, a large-scale, persistent
    peer-to-peer storage utility. In Proc. ACM
    SOSP01, Banfff, Canada, Oct. 2001.
  • N. Carriero and D. Gelernter. Linda in Context.
    In Communications of the ACM, April 1989.
Write a Comment
User Comments (0)
About PowerShow.com