Criticism Mining: - PowerPoint PPT Presentation

1 / 26
About This Presentation
Title:

Criticism Mining:

Description:

Exploring Book, Movie and Music Reviews Using Text Mining Techniques ... difference between book reviews and movie reviews, especially for items in the same genre ... – PowerPoint PPT presentation

Number of Views:41
Avg rating:3.0/5.0
Slides: 27
Provided by: jdow4
Category:

less

Transcript and Presenter's Notes

Title: Criticism Mining:


1
Criticism Mining
  • Exploring Book, Movie and Music Reviews Using
    Text Mining Techniques
  • Xiao Hu, J. Stephen Downie and M. Cameron Jones
  • Graduate School of Library and Information
    Science
  • University of Ilinois at Urbana-Champaign

2
Emerging Domain
  • Many networked resources now provide critical
    consumer-generated reviews of humanities
    materials
  • Public and private
  • blogs
  • mailing lists
  • wikis
  • Online stores
  • Review websites

3
Emerging Opportunity
  • Many of these reviews are quite detailed,
    covering not only the reviewers personal
    opinions but also important background and
    contextual information about the works under
    discussion.

4
Emerging Need
  • Humanities scholars should be given the ability
    to easily gather up and then analytically examine
    these reviews to determine, for example, how
    users are impacted and influenced by humanities
    materials.

5
Addressing the Need
  • The authors have conducted a series of very
    promising large-scale experiments that bring to
    bear powerful text mining techniques to the
    problem of criticism analysis.

6
A Possible Solution to the Need
  • Our experimental results concerning the
    application of the Naïve Bayes text mining
    technique to the criticism analysis domain
    indicate that criticism mining is not only
    feasible but also worthy of further exploration
    and refinement.

7
Experimental Goals
  • Our principal experimental goal was to build and
    then evaluate a prototype criticism mining system
    that could automatically predict the
  • genre of the work being reviewed
  • quality rating assigned to the reviewed item
  • difference between book reviews and movie
    reviews, especially for items in the same genre
  • difference between fiction and non-fiction book
    reviews

8
Data Sets
  • Source epinions.com
  • Book reviews 1800
  • Movie reviews 1650
  • Music review 1800
  • Each review contains a quality rating using 1-5
    stars
  • Each review is associated with a genre

9
(No Transcript)
10
An example of a review from epinions.com
11
Genre Experiments
12
(No Transcript)
13
(No Transcript)
14
(No Transcript)
15
(No Transcript)
16
Quality Rating Experiments
  • Three levels of granularity
  • Fine 1, 2, 3, 4 and 5 stars
  • Medium 1,2 vs. 4,5 stars
  • Large 1 vs. 5 stars

17
(No Transcript)
18
(No Transcript)
19
Book vs. Movie Experiments
  • Two levels of interest
  • All genres
  • Pairing by similar genres

20
(No Transcript)
21
Fiction vs. Non-Fiction Experiments
22
(No Transcript)
23
Conclusions (1)
  • Consumer-generated reviews of humanities
    materials represent a valuable research resource
    for humanities scholars.
  • Our series of experiments on the automated
    classification of reviews verify that important
    information about the materials being reviewed
    can be found using text mining techniques.

24
Conclusions (2)
  • All our experiments were highly successful in
    terms of both classification accuracy and the
    logical placement of confusion in the confusion
    matrices.
  • Thus, the development of criticism mining
    techniques based upon the relatively simple Naïve
    Bayes model has been shown to be simultaneously
    viable and robust.
  • This finding promises to make the ever-growing
    consumer-generated review resources useful to
    humanities scholars.

25
Future Work (1)
  • A broadening of our understanding by exploring
    the application of text mining techniques beyond
    the Naïve Bayes model
  • decision trees
  • neural nets
  • support vector machines
  • We will also work towards the development of a
    system to automatically mine arbitrary bodies of
    critical review text such as blogs, mailing
    lists, and wikis.

26
Future Work (2)
  • We also hope to construct content and
    ethnographic analyses to help answer the why
    questions that pertain to the results.
  • Final comment
  • In the end, it is all about the why questions!
Write a Comment
User Comments (0)
About PowerShow.com