Graph Model for Pattern Recognition in Text - Mining and Analyzing Social Networks

Information Technology Reference

In-Depth Information

2 Graph Model for Pattern Recognition

The overall approach of this algorithm begins with the identification of a set

of relevant keywords. Once these are selected, we then aggregate the relative

distances of the keywords with a document.Thisinturnisusedtoconstruct

a weighted directed multigraph that generates representing vectors for each

document in a high dimensional feature space. These vectors can then be

used to determine similarity values for any pair of documents.

2.1 Summary of Our Method

Step 1: Using a weighted directed multigraph to find a signature vector for

each document.

Step 2: Calculate the similarities between any two documents via their sig-

nature vectors.

Step 3: Using Quasi-Clique Merge clustering method to classify all

documents.

We will explain the details of each step by a simple example.

2.2 Details of the Step 1

To have a clear view of the algorithm, we will use the example illustrated in

Figure 1 [26] to explain the procedure.

Fig. 1 A fraudulent email

Search WWH ::

Custom Search

Home