Advantages of Information Granulation in Clustering Algorithms - Agents and Artificial Intelligence

Information Technology Reference

In-Depth Information

6

Conclusions

The article presents an approach to time complexity reduction in the process of cluster-

ing data. The idea is based on preparation of point-type input data to multidimensional

granules in the form of hyperboxes. Formation of the granules maximizes information

density transferred by the hyperboxes. The experiments showed the advantage of the

presented approach: significant time reduction of granulated data clustering in compar-

ison to point-type partitioning. It is particularly visible when data contain large number

of objects. Additionally, the quality of clustering result has not deteriorated when cop-

ing with granulated data, on the contrary - in most of the cases the quality has increased.

This is connected with the generalization ability of the presented method.

In case of SOSIG algorithm, clustering process can be performed on different res-

olution of data. Clustering of hyperboxes has been executed without changing the res-

olution. A three-level structure of data has been constructed by joining original point

(third down level) in hyperboxes (second level), whereas the top level contains divid-

ing of hyperboxes into clusters. Partitioning at the top level of hyperboxes granulation

(clustering) is composed of the same number of groups as partitioning point-type data.

The quality of created clusters is also comparable due to the similar values of quality

indices are similar.

The process of hyperbox creation is a type of aggregation operation, therefore the

major benefit of the presented method is shortening the time of cluster creation in com-

parison to the processing point-type data. It is particularly effective when data contain

large number of objects. Hyperboxes also determine additional level of relationship ex-

isting within data. Finally, the description of granules is more comprehensible since the

hyperboxes contain minimal and maximal values of attributes.

Acknowledgements. The experiments have been performed on the computer cluster at

Faculty of Computer Science, Bialystok University of Technology.

This work was supported by Grant No. S/WI/5/08.

References

1. Bargiela, A., Pedrycz, W.: Classification and Clustering of Granular Data. In: IFSA World

Congress and 20th NAFIPS International Conference, vol. 3, pp. 1696-1701 (2001)

2. Bargiela, A., Pedrycz, W.: Granular Computing: an Introduction. Kluwer Academic Publish-

ers, Boston (2002)

3. Halkidi, M., Batistakis, Y.: On clustering validation techniques. Journal of Intelligent Infor-

mation Systems 17(2/3), 107-145 (2001)

4. Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann (2000)

5. Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Computing Sur-

veys 31(3), 264-323 (1999)

6. Kuzelewska, U.: Clustering with granular information processing. In: Proceedings of

ICAART Conference, vol. 1, pp. 89-97. SciTePress (2011)

7. Li, F., Xie, J., Xie, K.: Granular computing theory in the application of fault diagnosis. In:

Proceeding of Control and Decision Conference CCDC, pp. 595-597 (2008)

Search WWH ::

Custom Search

Home