Rank Aggregation Methods for the Web - PowerPoint PPT Presentation

About This Presentation

Title:

Rank Aggregation Methods for the Web

Description:

Rank Aggregation Methods for the Web CS728 Lecture 11 Web Page Ranking Methods Reviewed PageRank global link analysis Indegree local link analysis HITS- topic ... – PowerPoint PPT presentation

Number of Views:143

Avg rating:3.0/5.0

Slides: 27

Provided by: 6649922

Learn more at: https://eecs.ceas.uc.edu

Category:

more less

Transcript and Presenter's Notes

Title: Rank Aggregation Methods for the Web

1
Rank Aggregation Methods for the Web

CS728
Lecture 11

2
Web Page Ranking Methods Reviewed

PageRank global link analysis
Indegree local link analysis
HITS- topic-based link analysis
Voting NNN and Correlation
Graph distance from seed
URL length and depth
Text-based methods (e.g., tfidf)

3
Rank Aggregation
B D C A F E
Consensus ranking of all
B D C A
A B D C FE
B C D A F E
4
Notations for Ranking

Given a universe U, and ordered list t of a
subset of S of U
tx1 x2 xd , xi in S
t(i) position of rank of i
t number of elements
full list t which contains all the elements in
U
partial list rank only some of elements in U
top d list all d ranked elements are above all
unranked elements
Question when are two orderings similar? Can you
give a distance measure?

5
Measuring Distance Between Orderings

Spearmans Footrule Distance
s , t two full list.
s( i ) rank of candidate i
Kendall tau distance
Count the number of pairwise disagreements
between the two lists

6
Example of Ordered-List Distance

Example
S A,B,C,D,E
s , t two full list
Spearmans Footrule Distance
F(s , t ) 1 2 1 0 2 6
Kendall tau distance
K(s , t ) (A,C), (B.D), (B,E), (D,E) 4

7
Optimal ranking aggregation

Optimality depends on the distance measure we
use.
Optimizing with Kendall tau distance, we obtain
Kemeny optimal aggregation
Can show satisfies neutrality and consistency
important properties of rank aggregation
functions.
Useful but computationally hard. Kemeny optimal
aggregation is NP-hard.
Will show that footrule-optimal is in P.

8
Two properties relate K and F

For any full lists s,t
K(s,t) F(s,t) 2 K(s,t)
So we get a 2-approximation to Kemeny-optimality
Since, if s is the Kemeny optimal aggregation of
full lists t1 ,, tk and s optimizes the
footrule aggregation then,
K(s, t1 ,, tk ) 2 K(s, t1 ,, tk )

9

Condorcet Criteria and SPAM Filters

Condorcet Criterion
An element of S which wins every other in
pairwise simple majority voting should be ranked
first.
Extended Condorcet Criterion (XCC)
If most voters prefer candidate a to candidate b
(i.e., of i s.t. ?i(a) lt ?i(b) is at least
n/2), then also ? should prefer a to b (i.e.,
?(a) lt ?(b)).
XCC is effective in spam-fighting and thus good
to use in meta-search.

10
XCC Not always realizable
c b a
a a b
b c c
c b a
a c b
b a c
?(a) lt ?(b) lt ?(c)
Not realizable
11
Voting Theory Desired Properties

Given set of candidates and voter preferences
seek an algorithm that ranks candidates which
satisfies a set of desired properties
Which combination of properties are realizable?
1) Independence from Irrelevant Alternatives
Relative order of a and b in ? should depend
only on relative order of a and b in ?1,,?n.
Ex if ?i (a b c) changes to (a c b), relative
order of a,b in ? should not change.

12
Desired Properties

2) Neutrality
No candidate should be favored to others.
If two candidates switch positions in ?1,,?n,
they should switch positions also in ?.
3) Anonymity
No voter should be favored to others.
If two voters switch their orderings, ? should
remain the same.

13
Desired Properties

4) Monotonicity
If the ranking of a candidate is improved by a
voter, its ranking in ? can only improve.
5) Consistency
If voters are split into two disjoint sets, S
and T, and both the aggregation of voters in S
and the aggregation of voters in T prefer a to b,
then also the aggregation of all voters should
prefer a to b.

14
Desired Properties

6) No Dictatorship f(?1,,?n) ! ?I
7) Unanimity (a.k.a. Pareto optimality)
If all voters prefer candidate a to candidate b
(i.e., ?i(a) lt ?i(b) for all i), then also ?
should prefer a to b (i.e., ?(a) lt ?(b)).

15
Desired Properties

8) Democracy satisfies extended Condorcet
Criterion XCC.
Always works for m 2.
Not always realizable for m 3.
Theorem May, 1952 For m 2, Democracy is the
only rank aggregation function which is monotone,
neutral, and anonymous.

16
Arrows Impossibility Theorem Arrow, 1951

Theorem If m 3, then the only rank aggregation
function that is unanimous and independent from
irrelevant alternatives is dictatorship.
Won Nobel prize (1972)

17
Bordas method

Easy and intuitive - Several score-basedvariants
1781
Violates independence from irrelevant
alternatives

B(c)?iBi(c) Sorted in decreasing order
Bi(C8) 1 2 0
13
Bi(c)the number of candidates ranked below c in
? i
18
Partial lists

Handle partial lists by giving all the excess
scores equally among all unranked candidates,

Example Candidates number 100
Ranked candidates number 70 (score
31100) gtAssign score 31/30 to each 30 unranked
candidates
19
Footrule optimal aggregation

Footrule optimal aggregation can be computed in
polynomial time. is a good approximation of
Kemeny optimal aggregation.
Proof Via minimum cost perfect matching

20
Markov Chain method for rank aggregation.

Statescandidates
Transitions depend on the preference orders given
by voters
Basic idea probabilistically switch to a
better candidate
Rank candidates based on stationary
probabilities!

21
Markov chain advantages

Handling partial list and top d list by using
available comparisons to infer new ones
Handling uneven comparison and list length
Computation efficiency
O(NK) preprocessing,O(K) per step for
about O(N) steps

22
Four ways to build transition Matrix

Current state is candidate a.
MC1 Choose uniformly from multiset of all
candidates that were ranked at least as high as a
by some voter.
Probability to stay at a average rank
of a.
MC2 Choose a voter i uniformly at random and
pick uniformly at random from among the
candidates that the i-th voter ranked at least as
high as a.
MC3 Choose a voter i uniformly at random and
pick uniformly at random a candidate b. If i-th
voter ranked b higher than a, go to b. Otherwise,
stay in a.
MC4 Choose a candidate b uniformly at random If
most voters ranked b higher than a, go to b.
Otherwise, stay in a.
Rank of a of pairwise contests a
wins.

23
A locally Kemeny optimal aggregation is a
relaxation of Kemeny Optimality

A locally Kemeny optimal aggregation satisfies
the extended Condorcet property and can be
computed in kO(nlogn) worst case, O(n2)
Many of existing aggregation methods do not
satisfy ECC.
gtGiven t1 , ,tk use your favorite
aggregation
method to obtain a full list µ. And Apply local
kemenization to µ with respect to t1 , ,tk .

24
Local Kemenization is a procedure to get locally
Kemeny optimal aggregation.

A local Kemenization of a full list with
respect to Compute a
locally Kemeny optimal aggregation of
that is maximally consistent
with
This approach
(1) preserves the strengths of the initial
aggregation .
(2) ranks non-spam above spam.
(3) gives a result that disagrees with on
any pair ( i, j ) only if a majority of the ts
endorse this disagreement.
(4) for every d, 1 d µ , the restriction
of the output is a local Kemenization of the top
d elements of µ

25
How do we perform local kemenization?

Local Kemenization Example!

A B F E C D
B C A E F D
A C F D E B
B F D C A E
C A B F E D
B A DC E F
B
B A
A B
A B D
A B DC
A B CD
A B CF E D
disagree
AgtB 3 AltB 2
BgtD 4 BltD 1
26
Experiments meta-search
K Kendall distance
SF scaled footrule distance IF induced
footrule distance LK Local
Kemenization

Write a Comment

User Comments (0)