Information Technology Reference
In-Depth Information
This chapter introduces opine, an unsupervised information extraction
system that embodies a solution to each of the above subtasks. Given a partic-
ular product and a corresponding set of reviews, opine outputs a set of product
features , accompanied by a list of associated opinions , which are ranked based
on strength.
Our contributions are as follows:
1. We describe opine's novel use of a relaxation labeling method to find
the semantic orientation of words in the context of given product features and
sentences.
2. We compare opine with the review mining system of Hu and Liu [2]
and find that opine's precision on the feature extraction task is 22% higher
than that of Hu and Liu, although its recall is 3% lower. We show that 1/3
of opine's increase in precision comes from the use of its feature assessment
mechanism on review data while the rest is due to Web statistics.
3. While many other systems have used extracted opinion phrases in order
to determine the polarity of sentences or documents, opine reports its preci-
sion and recall on the tasks of opinion phrase extraction and opinion phrase
polarity extraction in the context of known product features and sentences.
On the first task, opine has a precision of 79% and a recall of 76%. On the
second task, opine has a precision of 86% and a recall of 89%.
4. Finally, opine ranks the opinion phrases corresponding to a particular
property based on their strength and obtains an accuracy of 73%.
The remainder of this chapter is organized as follows: Section 2.2 intro-
duces the basic terminology; Section 2.3 gives an overview of opine,and
describes and evaluates its main components; Section 2.4 describes related
work; and Section 2.5 describes our conclusions and future work.
2.2 Terminology
A product class ( e.g. , Scanner) is a set of products ( e.g. , Epson1200). opine
extracts the following types of product features : properties , parts , features of
product parts , related concepts , parts and properties of related concepts (see
Table 2.1 in subsection 2.3.2 for examples in the Scanner domain). Related
concepts are concepts relevant to the customers' experience with the main
product ( e.g. , the company that manufactures a scanner). The relationships
between the main product and related concepts are typically expressed as
verbs ( e.g. , “the company manufactures scanners”) or prepositions (“scanners
from Epson”). Features can be explicit (“good scan quality ”) or implicit
(“good scans” implies good ScanQuality ).
opine also extracts opinion phrases , which are adjective, noun, verb or
adverb phrases representing customer opinions. Opinions can be positive or
negative and vary in strength ( e.g. , “fantastic” is stronger than “good”).
2.3 opine Overview
This section gives an overview of opine (see Figure 2.1) and describes its
components and their experimental evaluation.
Search WWH ::




Custom Search