Advanced Analytical Theory and Methods: Text Analysis - Data Science and Big Data Analytics

Database Reference

In-Depth Information

9.8

9.9

Precision is defined as the percentage of documents in the results that are

relevant. If by entering keyword bPhone , the search engine returns 100

documents, and 70 of them are relevant, the precision of the search engine result

is 0.7%.

Recall is the percentage of returned documents among all relevant documents

in the corpus. If by entering keyword bPhone , the search engine returns 100

documents, only 70 of which are relevant while failing to return 10 additional,

relevant documents, the recall is

.

Therefore, the naïve Bayes classifier from Table 9.7 receives a recall of

and a precision of

.

Precision and recall are important concepts, whether the task is about information

retrieval of a search engine or text analysis over a finite corpus. A good classifier

ideally should achieve both precision and recall close to 1.0. In information

retrieval, a perfect precision score of 1.0 means that every result retrieved by a

search was relevant (but says nothing about whether all relevant documents were

retrieved), whereas a perfect recall score of 1.0 means that all relevant documents

were retrieved by the search (but says nothing about how many irrelevant

documents were also retrieved). Both precision and recall are therefore based on

an understanding and measure of relevance. In reality, it is difficult for a classifier

to achieve both high precision and high recall. For the example in Table 9.7 , the

naïve Bayes classifier has a high recall but a low precision. Therefore, the Data

Science team needs to check the cleanliness of the data, optimize the classifier, and

find if there are ways to improve the precision while retaining the high recall.

Classifiers determine sentiments solely based on the datasets on which they are

trained. The domain of the datasets and the characteristics of the features

determine what the knowledge classifiers can learn. For example, lightweight

is a positive feature for reviews on laptops but not necessarily for reviews on

wheelbarrows or textbooks. In addition, the training and the testing sets should

share similar traits for classifiers to perform well. For example, classifiers trained

on movie reviews generally should not be tested on tweets or blog comments.

Search WWH ::

Custom Search

Home