SuperPeerBased Routing and Clustering Strategies for RDFBased PeerToPeer Networks

1 / 14

About This Presentation

Title:

SuperPeerBased Routing and Clustering Strategies for RDFBased PeerToPeer Networks

Description:

... Wolpers, Wolf Siberski, Christoph Schmitz, Mario Schlosser, Ingo Brunkhorst ... Based on this information, super-peers maintain indexes of schemas/schema ... –

Number of Views:48

Avg rating:3.0/5.0

Slides: 15

Provided by: wol71

Category:

more less

Transcript and Presenter's Notes

Title: SuperPeerBased Routing and Clustering Strategies for RDFBased PeerToPeer Networks

1
Super-Peer-Based Routing and Clustering
Strategies forRDF-Based Peer-To-Peer Networks
Alexander Löser Technische Universität Berlin,
Germany
Wolfgang Nejdl, Martin Wolpers, Wolf Siberski,
Christoph Schmitz, Mario Schlosser, Ingo
Brunkhorst Learning Lab Lower Saxony,Hannover/Kar
lsruhe, Germany
2
Overview

Introduction to Edutella
Schema-based P2P systems
Super-Peer networks
The HyperCuP topology
Indexing
Routing
Clustering
Schema Mapping
Further Work

3
PADLR Personalized Access to Digital Learning
Resources

Heterogeneous
Applications
Repositories
Platforms

4
Edutella Introduction

Main Goal Achieve interoperability between
heterogeneous metadata-driven (e-learning)
systems
Provides metadata only, not the resources
resources are fetched via http
Foundations
Semantic Web
Peer-to-Peer
Federated Databases
Open source project (http//edutella.jxta.org)
Uses other OSS JXTA Platform, Jena, JUnit, Ant
Uses Xerces, Jetty, ICU4J, XIndice, ...

5
Query Service

provides standardized query/retrieval of RDF
metadata stored in distributed RDF repositories
Query Exchange Language
Based on Datalog (allows expression of rules)
RDF syntax
For exchange only
Adapters to enable QEL query processing on
several backends
File, RDBMS, Rule Database, ...

6
Schema-Based Peer-to-Peer Networks

User-definable schemas
Structured schemas
Query language

No central control
Node autonomy
Self organization

(system list not complete)
7
Problem and Approach

Broadcasting all queries to all information
sources obviously doesnt scale
Problem How to distribute queries in a scalable
fashion?
Optimal solution distribute a query only to
peers which have results for it
Approach
Use Super-Peer network
Introduce Query Routing Indices

8
Super-Peer Networks

Observation Peers vary significantly in
availability, bandwith, processing power, etc.
Create network backbone from highly available and
powerful peers to distribute load better.

?
9
Super-Peer Topology

Super-peers are arranged as HyperCuP
Broadcast needs n-1 messages, log2(n) hops
High connectivity, resilient against node failures

10
Routing Indices

On joining the network, each peer provides
self-description
Based on this information, super-peers maintain
indexes of schemas/schema elements used at each
peer
Super-peer/peer indices
Super-peer/super-peer indices
Index Granularity
Schema
Property
Property value range
Property individual values

11
Index Sample
12
Query Routing Sample
Find any resource with dcsubjectccssw-eng and
lomcontextundergrad
13
Clustering

If peers are randomly assigned to super-peers, we
often still have to broadcast queries within the
super-peer network
Two approaches
Static super-peer administrators define
constraints which peers have to fulfill to be
accepted
Dynamic based on query statistics, peers are
continually reassigned to optimize query
distribution
Work in progress

14
Schema Mapping