Data Quality Enhancement Technology to Improve Decision Support - Efficient Decision Support Systems: Practice and Challenges from Current to Future - page 69

Information Technology Reference

In-Depth Information

algorithm was used to tune the membership function parameters. The proposed method can

be seen in Figure 1. The details of the proposed method are described in following sections.

Fig. 1 . Flowchart of Proposed method

2.1 Fuzzy C-Mean clustering

In fuzzy clustering, each point has a degree of belonging to clusters, as in fuzzy logic, rather

than belonging completely to one cluster. To manage uncertainty in getting data quality and

accurate model, a new method known as the Fuzzy C-Mean (FCM) clustering is needed.

The FCM computes the distance measured between two vectors, where each component is a

trajectory (function) instead of a real number. Thus, this Fuzzy C-mean clustering is rather

flexible, moveable, creatable, as well as able to eliminate classes and any of their

combination. From the huge number of clustering methods, the fuzzy clustering was

focused on in the methodology of the present study since the degree of membership

function on an object to the classes found provides a strong tool for the identification of

changing class structures. Thus, the Fuzzy C-Means is used in order to build an initial

classifier and to update the classifier in each cycle; nevertheless, the methodology presented

can still be extended to any other techniques which determine such degrees of membership

(e.g. probabilistic clustering, etc.) [14].

Before the FCM could be utilized, the noise was removed from the dataset due to affect on

clustering data. Based on the statistic definition:

According to [15] a noise is considered to be more than three standard deviations away from

the mean which is formulated as below:

Noises = abs (object - MeanMat) > 3*SigmaMat;

In which, Meanmat is mean, SigmaMat is Standard deviation and abs is the functioning in

mathematic for Absolute value, i.e. instance: abs (-5) =5, which was implemented in

MATLAB software.

Fuzzy c-means (FCM) is a method of clustering which allows one piece of data to belong to

two or more clusters. The method developed by [16, 17] is frequently used in pattern

recognition. It is based on minimization of the following objective function:

Next Page

Efficient Decision Support Systems: Practice and Challenges from Current to Future

Search WWH ::

Custom Search

Home