Finetuning Ranking Models:

About This Presentation

Title:

Finetuning Ranking Models:

Description:

Web retrieval, movie recommendation, NFL draft, etc. Einat's contextual search ... Andy's context sensitive spelling correction algorithm ... – PowerPoint PPT presentation

Number of Views:56

Avg rating:3.0/5.0

Slides: 28

Provided by: vitorrocha

Learn more at: http://www.cs.cmu.edu

Category:

more less

Transcript and Presenter's Notes

Title: Finetuning Ranking Models:

1
Fine-tuning Ranking Models

a two-step optimization approach

Vitor Jan 29, 2008 Text Learning Meeting - CMU
With invaluable ideas from .
2
Motivation

Rank, Rank, Rank
Web retrieval, movie recommendation, NFL draft,
etc.
Einats contextual search
Richards set expansion (SEAL)
Andys context sensitive spelling correction
algorithm
Selecting seeds in Franks political blog
classification algorithm
Ramnaths thunderbird extension for
Email Leak prediction
Email Recipient suggestion

3
Help your brothers!

Try Cut Once!, our Thunderbird extension
Works well with Gmail accounts
Its working reasonably well
We need feedback.

4
Thunderbird plug-in
Leak warnings hit x to remove recipient
Suggestions hit to add
Pause or cancel send of message
Email Recipient Recommendation
Timer msg is sent after 10sec by default
Classifier/rankers written in JavaScript
5
Email Recipient Recommendation
36 Enron users
6
Email Recipient Recommendation
Threaded
Carvalho Cohen, ECIR-08
7
Aggregating Rankings
Aslam Montague, 2001 Ogilvie Callan,
2003 Macdonald Ounis, 2006

Many Data Fusion methods
2 types
Normalized scores CombSUM, CombMNZ, etc.
Unnormalized scores BordaCount, Reciprocal Rank
Sum, etc.
Reciprocal Rank
The sum of the inverse of the rank of document in
each ranking.

8
Aggregated Ranking Results
Carvalho Cohen, ECIR-08
9
Intelligent Email Auto-completion
TOCCBCC
CCBCC
10
Carvalho Cohen, ECIR-08
11
Can we do better?

Not using other features, but better ranking
methods
Machine learning to improve ranking Learning to
rank
Many (recent) methods
ListNet, Perceptrons, RankSvm, RankBoost,
AdaRank, Genetic Programming, Ordinal Regression,
etc.
Mostly supervised
Generally small training sets
Workshop in SIGIR-07 (Einat was in the PC)

12
Pairwise-based Ranking
Goal induce a ranking function f(d) s.t.
Rank q
d1 d2 d3 d4 d5 d6 ... dT
We assume a linear function f
Therefore, constraints are
13
Ranking with Perceptrons

Nice convergence properties and mistake bounds
bound on the number of mistakes/misranks
Fast and scalable
Many variants Collins 2002, Gao et al
2005, Elsas et al 2008
Voting, averaging, committee, pocket, etc.
General update rule
Here Averaged version of perceptron

14
Rank SVM
Joachims, KDD-02, Herbrich et al, 2000
Equivalent to

Equivalent to maximing AUC

15
Loss Function
16
Loss Function
17
Loss Function
18
Loss Functions

SVMrank

SigmoidRank

Not convex
19
Fine-tuning Ranking Models
Base ranking model
Final model
Base Ranker
Sigmoid Rank
e.g., RankSVM, Perceptron, etc.
Non-convex Minimizing a very close
approximation for the number of misranks
20
Gradient Descent
21
Results in CC prediction
36 Enron users
22
Set Expansion (SEAL) Results
Wang Cohen, ICDM-2007
Listnet Cao et al. , ICML-07
23
Results in Letor
24
Results in Letor
25
Learning Curve
TOCCBCC Enron user lokay-m
26
Learning Curve
CCBCC Enron user campbel-m
27
Learning Curve
CCBCC Enron user campbel-m
28
Regularization Parameter
TREC3
TREC4
Ohsumed
s2
29
Some Ideas