On%20the%20Evaluation%20of%20Semantic%20Web%20Service%20Matchmaking%20Systems - PowerPoint PPT Presentation

About This Presentation

Title:

On%20the%20Evaluation%20of%20Semantic%20Web%20Service%20Matchmaking%20Systems

Description:

Matching service requests and advertisements, based on ... Matchmaking engine: OWLS-MX Matcher. Used only logic-based matching algorithms. Threshold = FAIL ... – PowerPoint PPT presentation

Number of Views:39

Avg rating:3.0/5.0

Slides: 21

Provided by: vassileio

Category:

more less

Transcript and Presenter's Notes

Title: On%20the%20Evaluation%20of%20Semantic%20Web%20Service%20Matchmaking%20Systems

1
On the Evaluation of Semantic Web Service
Matchmaking Systems

Vassileios Tsetsos, Christos Anagnostopoulos and
Stathes Hadjiefthymiades
Pervasive Computing Research Group
Communication Networks Laboratory
Department of Informatics and Telecommunications
University of Athens Greece
ECOWS 06 _at_ Zurich

2
Outline

Introduction
Problem Statement
A Generalized Fuzzy Evaluation Scheme for Service
Retrieval
Experimental Results
A Pragmatic View
Conclusions

3
SWS Matchmaking

Matching service requests and advertisements,
based on their semantic annotations (expressed
through ontologies)
Numerous matchmaking approaches
Logic-, similarity-, structure-based (graph
matching)
Various matched entities
functional service parameters (e.g., IOPE
attributes)
Non-functional parameters (e.g., QoS attributes)
Ultimate goal More effective service discovery,
based on semantics and not just on syntax of
service descriptions

4
Degree of Match

A value that expresses how similar two entities
are, with respect to some similarity metric(s)
Important feature of almost all SWS matchmaking
approaches
Allows for ranking of discovered services
Example DoM set exact, plugin, subsumes,
subsumed-by, fail

5
Evaluation Basics

Most works evaluate the performance of SWS
Discovery (i.e., response times, scalability)
Limited contributions to the evaluation of
retrieval effectiveness (i.e., the ability to
discover relevant services)

Q possible service requests S advertisements of
published services e QxS?W (DoM, analogous to
Retrieval Status Value in IR) r QxS?W (expert
mappings) Evaluation is the determination of how
closely vector e approximates vector r
6
Evaluation Schemes

W is the set of values denoting DoM (for e) or
degree of relevance (for r)
W defines different evaluation schemes (EVS)

Evaluation Scheme RSVs e(R,Si) Expert Mappings r(R,Si)
EVS1 Boolean Boolean
EVS2 Multi-valued Multi-valued
7
Boolean Evaluation (EVS1)

W0,1
Information Retrieval (IR) measures can be used
Precision (PB) and Recall (RB)

RT set of retrieved advertisements RL set of
relevant advertisements
8
Problem Statement (1/2)

Since, SWS matchmaking systems have multi-valued
vectors e, application of Boolean evaluation
implies the introduction of a relevance threshold

Problem 1 This Booleanization process filters
out any service semantics captured through DoM
Problem 2 An optimal threshold value is hard to
find

9
Problem Statement (2/2)

Problem 3 Boolean expert mappings are too
coarse-grained and do not always reflect the
intention of the domain expert.
Experiment
Manually defined multi-valued mappings between 6
requests and 135 advertisements of TC2 with W0,
0.25, 0.5, 0.75, 1
Calculation of deviation from existing Boolean
mappings

Only 33 of the Boolean mappings agree with the
multi-valued ones
40 of the Boolean mappings are not even close
to the multi-valued ones (deviation gt 0.25)

10
A Generalized Fuzzy Evaluation Scheme

Such scheme (EVS2) can provide solutions to the
aforementioned problems
Main design decisions
Expert mappings are fuzzy linguistic terms
DoM are fuzzy sets
Boolean measures are substituted by generalized
ones
Why fuzzy modeling?
Relevance is an amorphic concept (L. Zadeh).
I.e., its complexity prevents its mathematical
definition
Numeric values have vague semantics
Fuzzy linguistic variables assume values from a
linguistic term set, with each term being a fuzzy
variable set
Warning Fuzziness does not refer to the
matchmaking process per se

11
Fuzzification of e and r
fr QxS?0,1
fe QxS?0,1
If there is not one-to-one correspondence between
the number of fuzzy variables in each set, fuzzy
modifiers could be used (e.g., dilutions,
concentrators)
12
Generalized Evaluation Measures

Based on Buell and Kraft, Performance
measurement in a fuzzy retrieval system, 1981
the following measures are defined

The cardinalities of the sets RT and RL are
transformed to fuzzy set cardinalities, since the
above sets are fuzzy.
Note the evaluation measures take into account
all services Si

13
Experimental Results (1/3)

Manual assessment of fuzzy relevance in the
Education subset of TC v2
Matchmaking engine OWLS-MX Matcher
Used only logic-based matching algorithms
Threshold FAIL

EVS1 EVS1 EVS2 EVS2
Query ID RB PB RG PG
Q15 77 77 77 77
Q16 60 92 87 96
Q17 57 92 77 89
Q18 73 92 90 88
Q19 100 65 100 71
Q20 80 71 95 72
Difference between RG and RB is due to
considerable deviation between Boolean and fuzzy
expert mappings
14
Experimental Results (2/3)

Sensitivity of the proposed scheme

Actual case Hypothetical case
S1 somewhat relevant/FAIL (RG87) S1 very relevant/FAIL (RG84, all other unchanged)
S2 irrelevant/SUBSUMES (PG96) S2 irrelevant/EXACT (PG93, all other unchanged)

Only the generalized measures, are affected by
stronger false negatives/positives

15
Experimental Results (3/3)

Similar overall behavior but better
accuracy/sensitivity as already shown

16
A Pragmatic View

A reasonable assumption
experts are not willing to provide more than
Boolean mappings
Automatic fuzzification of Boolean expert
mappings would be valuable

17
A First Approach

Services are represented as concepts and form a
service profile ontology
Then an inference matrix is used for adjusting
the Boolean r values

Logic relation Eq DSup DSub Sib No
Boolean Value 1 1 1 1 1
Inferred Fuzzy Value V R R R SW

Logic relation Eq DSup DSub Sib No
Boolean Value 0 0 0 0 0
Inferred Fuzzy Value SW S S I I
18
Experimental Results