Title: Subjectivity and Sentiment Analysis
1Subjectivity and Sentiment Analysis
- Jan Wiebe
- Department of Computer Science
- CERATOPS Center for the Extraction and
Summarization of Events and Opinions in Text - University of Pittsburgh
2What is Subjectivity?
- The linguistic expression of somebodys opinions,
sentiments, emotions, evaluations, beliefs,
speculations (private states) - Wow, this is my 4th Olympus camera
- Staley declared it to be one wicked song
- Most voters believe he wont raise their taxes
3One Motivation
- Automatic question answering
4Fact-Based Question Answering
- Q When is the first day of spring in 2007?
-
- Q Does the US have a tax treaty with Cuba?
5Fact-Based Question Answering
- Q When is the first day of spring in 2007?
- A March 21
- Q Does the US have a tax treaty with Cuba?
- A Thus, the U.S. has no tax treaties with
nations like Iraq and Cuba.
6Opinion Question Answering
- Q What is the international reaction to the
reelection of Robert Mugabe as President of
Zimbabwe? -
7Opinion Question Answering
- Q What is the international reaction to the
reelection of Robert Mugabe as President of
Zimbabwe? - A African observers generally approved of his
victory while Western Governments strongly
denounced it. -
-
8More Motivations
- Product review mining What features of the
ThinkPad T43 do customers like and which do they
dislike? - Review classification Is a review positive or
negative toward the movie? - Information Extraction Is killing two birds
with one stone a terrorist event? - Tracking sentiments toward topics over time Is
anger ratcheting up or cooling down? - Etcetera!
9ltppolneggtcondemnlt/ppolgt ltppolposgtgreatlt/ppol
gt ltppolneggtwickedlt/ppolgt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltcpolposgtwicked lt/cpolgt visuals
ltcpolneggtloudly condemnedlt/cpolgt
QA Review Mining Opinion Tracking
The building has been ltsubjectivityobjgt
condemned lt/subjectivitygt
10ltppolneggtcondemnlt/ppolgt ltppolposgtgreatlt/ppol
gt ltppolneggtwickedlt/ppolgt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltcpolposgtwicked lt/cpolgt visuals
ltcpolneggtloudly condemnedlt/cpolgt
QA Review Mining Opinion Tracking
The building has been ltsubjectivityobjgt
condemned lt/subjectivitygt
11ltppolneggtcondemnlt/ppolgt ltppolposgtgreatlt/ppol
gt ltppolneggtwickedlt/ppolgt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltcpolposgtwicked lt/cpolgt visuals
ltcpolneggtloudly condemnedlt/cpolgt
QA Review Mining Opinion Tracking
The building has been ltsubjectivityobjgt
condemned lt/subjectivitygt
12ltppolneggtcondemnlt/ppolgt ltppolposgtgreatlt/ppol
gt ltppolneggtwickedlt/ppolgt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltcpolposgtwicked lt/cpolgt visuals
ltcpolneggtloudly condemnedlt/cpolgt
QA Review Mining Opinion Tracking
The building has been ltsubjectivityobjgt
condemned lt/subjectivitygt
13ltppolneggtcondemnlt/ppolgt ltppolposgtgreatlt/ppol
gt ltppolneggtwickedlt/ppolgt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltcpolposgtwicked lt/cpolgt visuals
ltcpolneggtloudly condemnedlt/cpolgt
QA Review Mining Opinion Tracking
The building has been ltsubjectivityobjgt
condemned lt/subjectivitygt
14Outline
- Subjectivity annotation scheme (MPQA)
- Learning subjective expressions from unannotated
texts - Contextual polarity of sentiment expressions
- Word sense and subjectivity
- Conclusions and pointers to related work
15ltppolneggtcondemnlt/ppolgt ltppolposgtgreatlt/ppol
gt ltppolneggtwickedlt/ppolgt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltcpolposgtwicked lt/cpolgt visuals
ltcpolneggtloudly condemnedlt/cpolgt
QA Review Mining Opinion Tracking
The building has been ltsubjectivityobjgt
condemned lt/subjectivitygt
16Corpus Annotation
- Wiebe, Wilson, Cardie 2005
- Wilson Wiebe 2005
- Somasundaran, Wiebe, Hoffmann, Litman 2006
- Wilson 2007
17Outline for Section 1
- Overview
- Frame types
- Nested Sources
- Extensions
18Overview
- Fine-grained expression level rather than
sentence or document level - Annotate
- expressions of opinions, sentiments, emotions,
evaluations, speculations, - material attributed to a source, but presented
objectively
19Overview
- Opinions, evaluations, emotions, speculations are
private states - They are expressed in language by subjective
expressions
Private state state that is not open to
objective observation or verification Quirk,
Greenbaum, Leech, Svartvik (1985). A
Comprehensive Grammar of the English Language.
20Overview
- Focus on three ways private states are expressed
in language - Direct subjective expressions
- Expressive subjective elements
- Objective speech events
21Direct Subjective Expressions
- Direct mentions of private states
- The United States fears a spill-over from the
anti-terrorist campaign. - Private states expressed in speech events
- We foresaw electoral fraud but not daylight
robbery, Tsvangirai said.
22Expressive Subjective Elements Banfield 1982
- We foresaw electoral fraud but not daylight
robbery, Tsvangirai said - The part of the US human rights report about
China is full of absurdities and fabrications
23Objective Speech Events
- Material attributed to a source, but presented as
objective fact (without evaluation) - The government, it added, has amended the
Pakistan Citizenship Act 10 of 1951 to enable
women of Pakistani descent to claim Pakistani
nationality for their children born to foreign
husbands.
24(No Transcript)
25Nested Sources
The report is full of absurdities, Xirao-Nima
said the next day.
26Nested Sources
(Writer)
27Nested Sources
(Writer, Xirao-Nima)
28Nested Sources
(Writer Xirao-Nima)
(Writer Xirao-Nima)
29Nested Sources
(Writer)
(Writer Xirao-Nima)
(Writer Xirao-Nima)
30The report is full of absurdities, Xirao-Nima
said the next day.
Objective speech event anchor the entire
sentence source ltwritergt implicit true
Direct subjective anchor said source
ltwriter, Xirao-Nimagt intensity high
expression intensity neutral attitude type
negative target report
Expressive subjective element anchor full of
absurdities source ltwriter, Xirao-Nimagt
intensity high attitude type negative
31The report is full of absurdities, Xirao-Nima
said the next day.
Objective speech event anchor the entire
sentence source ltwritergt implicit true
Direct subjective anchor said source
ltwriter, Xirao-Nimagt intensity high
expression intensity neutral attitude type
negative target report
Expressive subjective element anchor full of
absurdities source ltwriter, Xirao-Nimagt
intensity high attitude type negative
32The report is full of absurdities, Xirao-Nima
said the next day.
Objective speech event anchor the entire
sentence source ltwritergt implicit true
Direct subjective anchor said source
ltwriter, Xirao-Nimagt intensity high
expression intensity neutral attitude type
negative target report
Expressive subjective element anchor full of
absurdities source ltwriter, Xirao-Nimagt
intensity high attitude type negative
33The report is full of absurdities, Xirao-Nima
said the next day.
Objective speech event anchor the entire
sentence source ltwritergt implicit true
Direct subjective anchor said source
ltwriter, Xirao-Nimagt intensity high
expression intensity neutral attitude type
negative target report
Expressive subjective element anchor full of
absurdities source ltwriter, Xirao-Nimagt
intensity high attitude type negative
34The report is full of absurdities, Xirao-Nima
said the next day.
Objective speech event anchor the entire
sentence source ltwritergt implicit true
Direct subjective anchor said source
ltwriter, Xirao-Nimagt intensity high
expression intensity neutral attitude type
negative target report
Expressive subjective element anchor full of
absurdities source ltwriter, Xirao-Nimagt
intensity high attitude type negative
35The report is full of absurdities, Xirao-Nima
said the next day.
Objective speech event anchor the entire
sentence source ltwritergt implicit true
Direct subjective anchor said source
ltwriter, Xirao-Nimagt intensity high
expression intensity neutral attitude type
negative target report
Expressive subjective element anchor full of
absurdities source ltwriter, Xirao-Nimagt
intensity high attitude type negative
36The US fears a spill-over, said Xirao-Nima, a
professor of foreign affairs at the Central
University for Nationalities.
37(Writer)
The US fears a spill-over, said Xirao-Nima, a
professor of foreign affairs at the Central
University for Nationalities.
38(writer, Xirao-Nima)
The US fears a spill-over, said Xirao-Nima, a
professor of foreign affairs at the Central
University for Nationalities.
39(writer, Xirao-Nima, US)
The US fears a spill-over, said Xirao-Nima, a
professor of foreign affairs at the Central
University for Nationalities.
40(Writer)
(writer, Xirao-Nima, US)
(writer, Xirao-Nima)
The US fears a spill-over, said Xirao-Nima, a
professor of foreign affairs at the Central
University for Nationalities.
41The US fears a spill-over, said Xirao-Nima, a
professor of foreign affairs at the Central
University for Nationalities.
Objective speech event anchor the entire
sentence source ltwritergt implicit true
Objective speech event anchor said source
ltwriter, Xirao-Nimagt
Direct subjective anchor fears source
ltwriter, Xirao-Nima, USgt intensity medium
expression intensity medium attitude type
negative target spill-over
42Corpus
- _at_ www.cs.pitt.edu/mpqa
- English language versions of articles from the
world press (187 news sources) - Also includes contextual polarity annotations
- Themes of the instructions
- No rules about how particular words should be
annotated. - Dont take expressions out of context and think
about what they could mean, but judge them as
they are used in that sentence.
43ExtensionsWilson 2007
44ExtensionsWilson 2007
- I think people are happy because Chavez has
fallen.
direct subjective span are happy source
ltwriter, I, Peoplegt attitude
direct subjective span think source
ltwriter, Igt attitude
inferred attitude span are happy because
Chavez has fallen type neg sentiment
intensity medium target
attitude span are happy type pos sentiment
intensity medium target
attitude span think type positive arguing
intensity medium target
target span people are happy because
Chavez has fallen
target span Chavez has fallen
target span Chavez
45Outline
- Subjectivity annotation scheme (MPQA)
- Learning subjective expressions from unannotated
texts - Contextual polarity of sentiment expressions
- Word sense and subjectivity
- Conclusions and pointers to related work
46ltppolneggtcondemnlt/ppolgt ltppolposgtgreatlt/ppol
gt ltppolneggtwickedlt/ppolgt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltcpolposgtwicked lt/cpolgt visuals
ltcpolneggtloudly condemnedlt/cpolgt
QA Review Mining Opinion Tracking
The building has been ltsubjectivityobjgt
condemned lt/subjectivitygt
47Outline for Section 2
- Learning subjective nouns with extraction pattern
bootstrapping - Automatically generating training data with
high-precision classifiers - Learning subjective and objective expressions
(not simply words or n-grams) - Riloff, Wiebe, Wilson 2003 Riloff Wiebe 2003
Wiebe Riloff 2005 Riloff, Patwardhan, Wiebe
2006.
48Outline for Section 2
- Learning subjective nouns with extraction pattern
bootstrapping - Automatically generating training data with
high-precision classifiers - Learning subjective and objective expressions
49Information Extraction
- Information extraction (IE) systems identify
facts related to a domain of interest. - Extraction patterns are lexico-syntactic
expressions that identify the role of an object.
For example
ltsubjectgt was killed
assassinated ltdobjgt
murder of ltnpgt
50Learning Subjective Nouns
- Goal to learn subjective nouns from unannotated
text - Method applying IE-based bootstrapping
algorithms that were designed to learn semantic
categories - Hypothesis extraction patterns can identify
subjective contexts that co-occur with subjective
nouns
Example expressed ltdobjgt
concern, hope, support
51Extraction Examples
expressed ltdobjgt condolences, hope, grief,
views, worries indicative of ltnpgt compromise,
desire, thinking inject ltdobjgt vitality,
hatred reaffirmed ltdobjgt resolve, position,
commitment voiced ltdobjgt outrage, support,
skepticism, opposition, gratitude,
indignation show of ltnpgt support, strength,
goodwill, solidarity ltsubjgt was shared anxiety,
view, niceties, feeling
52Meta-Bootstrapping Riloff Jones 1999
Ex hope, grief, joy, concern, worries
Ex expressed ltDOBJgt
Best Extraction Pattern
Ex happiness, relief, condolences
Extractions (Nouns)
53Subjective Seed Words
54Subjective Noun Results
- Bootstrapping corpus 950 unannotated news
articles - We ran both bootstrapping algorithms for several
iterations - We manually reviewed the words and labeled them
as strong, weak, or not subjective - 1052 subjective nouns were learned (454 strong,
598 weak) - included in our subjectivity lexicon _at_
www.cs.pitt.edu/mpqa
55Examples of Strong Subjective Nouns
anguish exploitation pariah antagonism
evil repudiation apologist fallacies
revenge atrocities genius
rogue barbarian goodwill
sanctimonious belligerence
humiliation scum bully ill-treatment
smokescreen condemnation injustice
sympathy denunciation innuendo
tyranny devil insinuation
venom diatribe liar exaggeration
mockery
56Examples of Weak Subjective Nouns
57Outline for Section 2
- Learning subjective nouns with extraction pattern
bootstrapping - Automatically generating training data with
high-precision classifiers - Learning subjective and objective expressions
58 Training Data Creation
unlabeled texts
subjective clues
rule-based subjective sentence classifier
rule-based objective sentence classifier
subjective objective sentences
59Subjective Clues
- Subjectivity lexicon
- _at_ www.cs.pitt.edu/mpqa
- Entries from manually developed resources (e.g.,
General Inquirer, Framenet) - Entries automatically identified (E.g., nouns
just described)
60Creating High-Precision Rule-Based Classifiers
GOAL use subjectivity clues from previous
research to build a high-precision (low-recall)
rule-based classifier
- a sentence is subjective if it contains ? 2
strong subjective clues - a sentence is objective if
- it contains no strong subjective clues
- the previous and next sentence contain ? 1 strong
subjective clue - the current, previous, and next sentence together
contain ? 2 weak subjective clues
61Accuracy of Rule-Based Classifiers (measured
against MPQA annotations)
SubjRec SubjPrec SubjF Subj RBC 34.2 90.4
49.6
ObjRec ObjPrec ObjF Obj RBC 30.7 82.4 44.7
62Generated Data
- We applied the rule-based classifiers to 300,000
sentences from (unannotated) new articles - 53,000 were labeled subjective
- 48,000 were labeled objective
- training set of over 100,000 labeled sentences!
63Generated Data
- The generated data may serve as training data for
supervised learning, and initial data for self
training Wiebe Riloff 2005 - The rule-based classifiers are part of
OpinionFinder _at_ www.cs.pitt.edu/mpqa - Here, we use the data to learn new subjective
expressions
64Outline for Section 2
- Learning subjective nouns with extraction pattern
bootstrapping - Automatically generating training data with
high-precision classifiers - Learning subjective and objective expressions
65Representing Subjective Expressions with
Extraction Patterns
- EPs can represent expressions that are not fixed
word sequences
drove NP up the wall - drove him up the
wall - drove George Bush up the wall - drove
George Herbert Walker Bush up the wall
step on modifiers toes - step on her toes -
step on the mayors toes - step on the newly
elected mayors toes
gave NP a modifiers look - gave his
annoying sister a really really mean look
66The Extraction Pattern Learner
- Used AutoSlog-TS Riloff 96 to learn EPs
- AutoSlog-TS needs relevant and irrelevant texts
as input - Statistics are generated measuring each patterns
association with the relevant texts - The subjective sentences are the relevant texts,
and the objective sentences are the irrelevant
texts
67ltsubjectgt passive-vp ltsubjgt was
satisfied ltsubjectgt active-vp ltsubjgt
complained ltsubjectgt active-vp dobj ltsubjgt dealt
blow ltsubjectgt active-vp infinitive ltsubjgt
appears to be ltsubjectgt passive-vp
infinitive ltsubjgt was thought to be ltsubjectgt
auxiliary dobj ltsubjgt has position
68AutoSlog-TS(Step 1)
The World Trade Center, an icon of New York
City, was intentionally attacked very early on
September 11, 2001.
Parser
Extraction Patterns ltsubjgt was attacked icon of
ltnpgt was attacked on ltnpgt
69AutoSlog-TS (Step 2)
Extraction Patterns ltsubjgt was attacked icon of
ltnpgt was attacked on ltnpgt
70Identifying Subjective Patterns
- Subjective patterns
- Freq gt X
- Probability gt Y
- 6,400 learned on 1st iteration
- Evaluation against the MPQA corpus
- direct evaluation of performance as simple
classifiers - evaluation of classifiers using patterns as
additional features - Does the system learn new knowledge?
- Add patterns to the strong subjective set and
re-run the rule-based classifier - recall 20 while precision - 7 on 1st iteration
71Patterns with Interesting Behavior
PATTERN FREQ P(Subj Pattern) ltsubjgt asked 128
.63 ltsubjgt was asked 11 1.0
72Conclusions for Section 2
- Extraction pattern bootstrapping can learn
subjective nouns (unannotated data) - Extraction patterns can represent richer
subjective expressions - Learning methods can discover subtle distinctions
that are important for subjectivity (unannotated
data) - Ongoing work
- lexicon representation integrating different
types of entries, enabling comparisons (e.g.,
subsumption relationships) - learning and bootstrapping processes applied to
much larger unannotated corpora - processes applied to learning positive and
negative subjective expressions
73Outline
- Subjectivity annotation scheme (MPQA)
- Learning subjective expressions from unannotated
texts - Contextual polarity of sentiment expressions
(briefly) - Wilson, Wiebe, Hoffmann 2005 Wilson 2007 Wilson
Wiebe forthcoming - Word sense and subjectivity
- Conclusions and pointers to related work
74ltppolneggtcondemnlt/ppolgt ltppolposgtgreatlt/ppol
gt ltppolneggtwickedlt/ppolgt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltcpolposgtwicked lt/cpolgt visuals
ltcpolneggtloudly condemnedlt/cpolgt
QA Review Mining Opinion Tracking
The building has been ltsubjectivityobjgt
condemned lt/subjectivitygt
75Prior Polarity versus Contextual Polarity
- Most approaches use a lexicon of positive and
negative words - Prior polarity out of context, positive or
negative - beautiful ? positive
- horrid ? negative
- A word may appear in a phrase that expresses a
different polarity in context -
- Contextual polarity
Cheers to Timothy Whitfield for the wonderfully
horrid visuals.
76Goal of This Work
- Automatically distinguish prior and contextual
polarity
77Approach
- Use machine learning and variety of features
- Achieve significant results for a large subset of
sentiment expressions
78Manual Annotations
- Subjective expressions of the MPQA corpus
annotated with contextual polarity
79Annotation Scheme
- Mark polarity of subjective expressions as
positive, negative, both, or neutral
positive
African observers generally approved of his
victory while Western governments denounced it.
negative
Besides, politicians refer to good and evil
both
Jerome says the hospital feels no different than
a hospital in the states.
neutral
80Annotation Scheme
- Judge the contextual polarity of sentiment
ultimately being conveyed - They have not succeeded, and will never succeed,
in breaking the will of this valiant people.
81Annotation Scheme
- Judge the contextual polarity of sentiment
ultimately being conveyed - They have not succeeded, and will never succeed,
in breaking the will of this valiant people.
82Annotation Scheme
- Judge the contextual polarity of sentiment
ultimately being conveyed - They have not succeeded, and will never succeed,
in breaking the will of this valiant people.
83Annotation Scheme
- Judge the contextual polarity of sentiment
ultimately being conveyed - They have not succeeded, and will never succeed,
in breaking the will of this valiant people.
84Features
- Many inspired by Polanyi Zaenen (2004)
Contextual Valence Shifters - Example little threat
- little truth
- Others capture dependency relationships between
words - Example
- wonderfully horrid
pos
mod
85- Word features
- Modification features
- Structure features
- Sentence features
- Document feature
- Word token
terrifies - Word part-of-speechVB
- Context
- that terrifies me
- Prior Polaritynegative
- Reliability
strong subjective
86- Word features
- Modification features
- Structure features
- Sentence features
- Document feature
- Binary features
- Preceded by
- adjective
- adverb (other than not)
- intensifier
- Self intensifier
- Modifies
- strong clue
- weak clue
- Modified by
- strong clue
- weak clue
Dependency Parse Tree
87- Word features
- Modification features
- Structure features
- Sentence features
- Document feature
- Binary features
- In subject
- The human rights report
- poses
- In copular
- I am confident
- In passive voice
- must be regarded
88- Word features
- Modification features
- Structure features
- Sentence features
- Document feature
- Count of strong clues in previous, current, next
sentence - Count of weak clues in previous, current, next
sentence - Counts of various parts of speech
89- Document topic (15)
- economics
- health
-
- Kyoto protocol
- presidential election in Zimbabwe
- Word features
- Modification features
- Structure features
- Sentence features
- Document feature
Example The disease can be contracted if a
person is bitten by a certain tick or if a person
comes into contact with the blood of a congo
fever sufferer.
90- Word token
- Word prior polarity
- Negated
- Negated subject
- Modifies polarity
- Modified by polarity
- Conjunction polarity
- General polarity shifter
- Negative polarity shifter
- Positive polarity shifter
- Word token
- terrifies
- Word prior polarity
- negative
91- Word token
- Word prior polarity
- Negated
- Negated subject
- Modifies polarity
- Modified by polarity
- Conjunction polarity
- General polarity shifter
- Negative polarity shifter
- Positive polarity shifter
- Binary features
- Negated
- For example
- not good
- does not look very good
- not only good but amazing
-
- Negated subject
- No politically prudent Israeli could support
either of them.
92- Word token
- Word prior polarity
- Negated
- Negated subject
- Modifies polarity
- Modified by polarity
- Conjunction polarity
- General polarity shifter
- Negative polarity shifter
- Positive polarity shifter
- Modifies polarity
- 5 values positive, negative, neutral, both, not
mod - substantial negative
- Modified by polarity
- 5 values positive, negative, neutral, both, not
mod - challenge positive
93- Word token
- Word prior polarity
- Negated
- Negated subject
- Modifies polarity
- Modified by polarity
- Conjunction polarity
- General polarity shifter
- Negative polarity shifter
- Positive polarity shifter
- Conjunction polarity
- 5 values positive, negative, neutral, both, not
mod - good negative
94- General polarity shifter
- have few risks/rewards
- Negative polarity shifter
- lack of understanding
- Positive polarity shifter
- abate the damage
- Word token
- Word prior polarity
- Negated
- Negated subject
- Modifies polarity
- Modified by polarity
- Conjunction polarity
- General polarity shifter
- Negative polarity shifter
- Positive polarity shifter
95Findings
- Statistically significant improvements can be
gained - Require combining all feature types
- Ongoing work
- richer lexicon entries
- compositional contextual processing
96S/O and Pos/Neg both Important
- Several studies have found two steps beneficial
- Yu Hatzivassiloglou 2003 Pang Lee 2004
Wilson et al. 2005 Kim Hovy 2006
97S/O and Pos/Neg both Important
- S/O can be more difficult
- Manual annotation of phrases Takamura et al. 2006
- Contextual polarity Wilson et al. 2005
- Sentiment tagging of words Andreevskaia Bergler
2006 - Sentiment tagging of word senses Esuli
Sabastiani 2006 - Later evidence that S/O more significant for
senses
98S/O and Pos/Neg both Important
- Desirable for NLP systems to find a wide range of
private states, including motivations, thoughts,
and speculations, not just positive and negative
sentiments
99S/O and Pos/Neg both Important
Sentiment Other
Subjectivity Objective
Pos Neg Both
100Outline
- Subjectivity annotation scheme (MPQA)
- Learning subjective expressions from unannotated
texts - Contextual polarity of sentiment expressions
- Word sense and subjectivity
- Wiebe Mihalcea 2006
- Conclusions and pointers to related work
101ltppolneggtcondemngtlt/ppolgt ltsubjlsubjgtcondemn
1lt/subjgt ltsubjobjgtcondemn2lt/subjgt ltsubjobjgt
condemn3lt/subjgt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltgt lt/gtltgt lt/gt
ltcpolposgtwicked lt/cpolgt visuals
ltcpolneggtloudly condemnedlt/cpolgt
QA Review Mining Opinion Tracking
The building has been ltsubjectivityobjgt
condemned lt/subjectivitygt
102Introduction
- Continuing interest in word sense
- Sense annotated resources being developed for
many languages - www.globalwordnet.org
- Active participation in evaluations such as
SENSEVAL
103Word Sense and Subjectivity
- Though both are concerned with text meaning,
until recently they have been investigated
independently
104Subjectivity Labels on Senses
- Alarm, dismay, consternation (fear resulting
from the awareness of danger) - Alarm, warning device, alarm system (a
device that signals the occurrence of some
undesirable event)
105Subjectivity Labels on Senses
-
- Interest, involvement -- (a sense of concern
with and curiosity about someone or something
"an interest in music") -
-
- Interest -- (a fixed charge for borrowing
money usually a percentage of the amount
borrowed "how much interest do you pay on your
mortgage?") -
106WSD using Subjectivity Tagging
107WSD using Subjectivity Tagging
He spins a riveting plot which grabs and holds
the readers interest.
S
Sense 4 Sense 1?
Sense 4 a sense of concern with and curiosity
about someone or something S Sense 1 a fixed
charge for borrowing money O
Subjectivity Classifier
WSD System
Sense 1 Sense 4?
O
The notes do not pay interest.
108WSD using Subjectivity Tagging
He spins a riveting plot which grabs and holds
the readers interest.
S
Sense 4 Sense 1?
Sense 4 a sense of concern with and curiosity
about someone or something S Sense 1 a fixed
charge for borrowing money O
Subjectivity Classifier
WSD System
Sense 1 Sense 4?
O
The notes do not pay interest.
109Subjectivity Tagging using WSD
Subjectivity Classifier
He spins a riveting plot which grabs and holds
the readers interest.
S O?
O S?
The notes do not pay interest.
110Subjectivity Tagging using WSD
Subjectivity Classifier
He spins a riveting plot which grabs and holds
the readers interest.
S O?
Sense 4
WSD System
O S?
Sense 1
The notes do not pay interest.
111Subjectivity Tagging using WSD
Subjectivity Classifier
He spins a riveting plot which grabs and holds
the readers interest.
S O?
Sense 4
WSD System
O S?
Sense 1
The notes do not pay interest
112Refining WordNet
- Semantic Richness
- Find inconsistencies and gaps
- Verb assault attack, round, assail, last out,
snipe, assault (attack in speech or writing) The
editors of the left-leaning paper attacked the
new House Speaker - But no sense for the noun as in His verbal
assault was vicious
113Goals
- Explore interactions between word sense and
subjectivity - Can subjectivity labels be assigned to word
senses? - Manually
- Automatically
- Can subjectivity analysis improve word sense
disambiguation? - Can word sense disambiguation improve
subjectivity analysis? Current work
114Outline for Section 4
- Motivation and Goals
- Assigning Subjectivity Labels to Word Senses
- Manually
- Automatically
- Word Sense Disambiguation using Automatic
Subjectivity Analysis - Conclusions
115Annotation Scheme
- Assigning subjectivity labels to WordNet senses
- S subjective
- O objective
- B both
116Examples
- Alarm, dismay, consternation (fear
resulting form the awareness of danger) - Fear, fearfulness, fright (an emotion
experiences in anticipation of some specific pain
or danger (usually accompanied by a desire to
flee or fight)) - Alarm, warning device, alarm system (a
device that signals the occurrence of some
undesirable event) - Device (an instrumentality invented for a
particular purpose the device is small enough
to wear on your wrist a device intended to
conserve water
S
O
117Subjective Sense Definition
- When the sense is used in a text or conversation,
we expect it to express subjectivity, and we
expect the phrase/sentence containing it to be
subjective.
118Subjective Sense Examples
- His alarm grew
- Alarm, dismay, consternation (fear
resulting form the awareness of danger) - Fear, fearfulness, fright (an emotion
experiences in anticipation of some specific pain
or danger (usually accompanied by a desire to
flee or fight)) - He was boiling with anger
- Seethe, boil (be in an agitated emotional
state The customer was seething with anger) - Be (have the quality of being (copula, used
with an adjective or a predicate noun) John is
rich This is not a good answer)
119Subjective Sense Examples
- Whats the catch?
- Catch (a hidden drawback it sounds good
but whats the catch?) - Drawback (the quality of being a hindrance he
pointed out all the drawbacks to my plan) - That doctor is a quack.
- Quack (an untrained person who pretends to
be a physician and who dispenses medical advice) - Doctor, doc, physician, MD, Dr., medico
120Objective Sense Examples
- The alarm went off
- Alarm, warning device, alarm system (a
device that signals the occurrence of some
undesirable event) - Device (an instrumentality invented for a
particular purpose the device is small enough
to wear on your wrist a device intended to
conserve water - The water boiled
- Boil (come to the boiling point and change
from a liquid to vapor Water boils at 100
degrees Celsius) - Change state, turn (undergo a transformation or
a change of position or action)
121Objective Sense Examples
- He sold his catch at the market
- Catch, haul (the quantity that was caught
the catch was only 10 fish) - Indefinite quantity (an estimated quantity)
- The ducks quack was loud and brief
- Quack (the harsh sound of a duck)
- Sound (the sudden occurrence of an audible
event)
122Objective Senses Observation
- We dont necessarily expect phrases/sentences
containing objective senses to be objective - Will someone shut that darn alarm off?
- Cant you even boil water?
- Subjective, but not due to alarm and boil
123Objective Sense Definition
- When the sense is used in a text or conversation,
we dont expect it to express subjectivity and,
if the phrase/sentence containing it is
subjective, the subjectivity is due to something
else.
124Inter-Annotator Agreement Results
- Overall
- Kappa0.74
- Percent Agreement85.5
- Without the 12.3 cases when a judge is
uncertain - Kappa0.90
- Percent Agreement95.0
125S/O and Pos/Neg
- Hypothesis moving from word to word sense is
more important for S versus O than it is for
Positive versus Negative - Pilot study with the nouns of the SENSEVAL-3
English lexical sample task - ½ have both subjective and objective senses
- Only 1 has both positive and negative subjective
senses
126Outline for Section 4
- Motivation and Goals
- Assigning Subjectivity Labels to Word Senses
- Manually
- Automatically
- Word Sense Disambiguation using Automatic
Subjectivity Analysis - Conclusions
127Overview
- Main idea assess the subjectivity of a word
sense based on information about the subjectivity
of - a set of distributionally similar words
- in a corpus annotated with subjective expressions
128Preliminaries
- Suppose the goal were to assess the subjectivity
of a word w, given an annotated corpus - We could consider how often w appears in
subjective expressions - Or, we could take into account more evidence
the subjectivity of a set of distributionally
similar words
129Lins Distributional Similarity
Word R W I R1
have have R2 dog brown R3 dog
. . .
Lin 1998
130Lins Distributional Similarity
Word1
Word2
R W R W R W
R W R W R W
R W R W
R W R W
R W R W
131Subjectivity of word w
132Subjectivity of word w
Unannotated Corpus (BNC)
insts(DSW) in SE - insts(DSW)
not in SE
insts (DSW)
subj(w)
-1, 1 highly objective, highly subjective
133Subjectivity of word w
134Subjectivity of word sense wi
Rather than 1, add or subtract sim(wi,dswj)
sim(wi,dsw1)
-sim(wi,dsw1)
sim(wi,dsw2)
135Method Step 1
- Given word w
- Find distributionally similar words
- DSW dswj j 1 .. n
136Method Step 2
word w Alarm DSW1 Panic DSW2 Detector
Sense w1 fear resulting from the awareness of danger sim(w1,panic) sim(w1,detector)
Sense w2 a device that signals the occurrence of some undesirable event sim(w2,panic) sim(w2,detector)
137Method Step 2
- Find the similarity between each word sense and
each distributionally similar word
- wnss can be any concept-based similarity measure
between word senses - we use Jiang Conrath 1997
138Method Step 3
- Input word sense wi of word w
-
- DSW dswj j 1..n
-
- sim(wi,dswj)
-
- MPQA Opinion Corpus
- Output subjectivity score subj(wi)
139Method Step 3
- totalsim insts(dswj) sim(wi,dswj)
- evi_subj 0
- for each dswj in DSW
- for each instance k in insts(dswj)
- if k is in a subjective expression
- evi_subj sim(wi,dswj)
- else
- evi_subj - sim(wi,dswj)
- subj(wi) evi_subj / totalsim
140Evaluation
- Calculate subj scores for each word sense, and
sort them - While 0 is a natural candidate for division
between S and O, we perform the evaluation for
different thresholds in -1,1 - Determine the precision of the algorithm at
different points of recall
141Evaluation precision/recall curves
Number of distri-butionally similar words 160
142Outline for Section 4
- Motivation and Goals
- Assigning Subjectivity Labels to Word Senses
- Manually
- Automatically
- Word Sense Disambiguation using Automatic
Subjectivity Analysis - Conclusions
143Overview
- Augment an existing data-driven WSD system with a
feature reflecting the subjectivity of the
context of the ambiguous word - Compare the performance of original and
subjectivity-aware WSD systems - The ambiguous nouns of the SENSEVAL-3 English
Lexical Task - SENSEVAL-3 data
144Original WSD System
- Integrates local and topical features
- Local context of three words to the left and
right, their part-of-speech - Topical top five words occurring at least three
times in the context of a word sense - Ng Lee, 1996, Mihalcea, 2002
- Naïve Bayes classifier
- Lee Ng, 2003
145Subjectivity Classifier
- Rule-based automatic sentence classifier from
Wiebe Riloff 2005 - Harvests subjective and objective sentences it is
certain about from unannotated data - Part of OpinionFinder _at_ www.cs.pitt.edu/mpqa/
146Subjectivity Tagging for WSD
Sentences of the SENSEVAL-3data that contain a
target noun are tagged with O, S, or B
Subjectivity Classifier
O
S
atmosphere
Sentencek
Tags are fed as input to the Subjectivity Aware
WSD System
147WSD using Subjectivity Tagging
148WSD using Subjectivity Tagging
Hypothesis instances of subjective senses are
more likely to be in subjective sentences, so
sentence subjectivity is an informative feature
for WSD of words with both subjective and
objective senses
149Words with S and O Senses
S sense not in data
4.3 error reduction significant (p lt 0.05
paired t-test)
150Words with Only O Senses
often target
Overall 2.2 error reduction significant (p lt
0.1)
151Conclusions for Section 4
- Can subjectivity labels be assigned to word
senses? - Manually
- Good agreement Kappa0.74
- Very good when uncertain cases removed
Kappa0.90 - Automatically
- Method substantially outperforms baseline
- Showed feasibility of assigning subjectivity
labels to the fine-grained sense level of word
senses
152Conclusions for Section 4
- Can subjectivity analysis improve word sense
disambiguation? - Quality of a WSD system can be improved with
subjectivity information - Improves performance, but mainly for words with
both S and O senses - 4.3 error reduction significant (p lt 0.05)
- Performance largely remains the same or degrades
for words that dont - Once senses have been assigned subjectivity
labels, a WSD system could consult them to decide
whether to consider the subjectivity feature
153Pointers To Related Work
- Tutorial held at EUROLAN 2007 Semantics,
Opinion, and Sentiment in Text, Iasi, Romania,
August - Slides, bibliographies,
- www.cs.pitt.edu/wiebe/EUROLAN07
154Conclusions
- Subjectivity is common in language
- Recognizing it is useful in many NLP tasks
- It comes in many forms and often is
context-dependent - A wide variety of features seem to be necessary
for opinion and polarity recognition - Subjectivity may be assigned to word senses,
promising improved performance for both
subjectivity analysis and WSD - Promising as well for multi-lingual subjectivity
analysis Mihalcea, Banea, Wiebe 2007
155Acknowledgements
- CERATOPS Center for the Extraction and
Summarization of Events and Opinions in Text - Pittsburgh Paul Hoffmann, Josef Ruppenhofer,
Swapna Somasundaran, Theresa Wilson - Cornell Claire Cardie, Eric Breck, Yejin Choi,
Ves Stoyanov - Utah Ellen Riloff, Sidd Patwardhan, Bill
Phillips - UNT Rada Mihalcea, Carmen Banea
- NLP_at_Pitt Wendy Chapman, Rebecca Hwa, Diane
Litman,