International Journal of Soft Computing

Year: 2009
Volume: 4
Issue: 4
Page No. 168 - 172

Impact of Normalization in Distributed K-Means Clustering

Authors : N. Karthikeyani Visalakshi and K. Thangavel

References

Bandyopadhyay, S., C. Giannella, U. Maulik, H. Kargupta, K. Liu and S. Datta, 2006. Clustering distributed data streams in peer to peer environments. Inform. Sci., 176: 1952-1985.
CrossRef  |  

De Souto, M.C.P., D.S.A. De Araujo, I.G. Costa, R. Soares, T.B. Ludermir and A. Schliep, 2008. Comparative study on normalization procedures for cluster analysis of gene expression datasets. Proceedings of the IEEE International Joint Conference Neural Networks, (IJCNN'08), China, pp: 2792-2798.

Doherty, K.A.J., R.G. Adams and N. Davey, 2007. Unsupervised learning with normalised data and non-euclidean norms. Applied Soft Comput., 7: 203-210.

Folino, G., A. Forestiero and G. Spezzano, 2006. Swarm Based Distributed Clustering, Peer to Peer Systems, Artificial Evolution, Lecture Notes in Computer Science. Springer-Verlag, USA., pp: 37-48.

Genlin, J. and L. Xiaohan, 2007. Ensemble Learning Based Distributed Clustering. In: Emerging Technology and Knowledge Discovery and Data Mining, Washio, T. et al. (Ed.)., LNCS. Springer-Verlag, USA., pp: 312-321.

Ghosh, J. and S. Merugu, 2003. Distributed clustering with limited knowledge sharing. Proceedings of the 5th International Conference Advance Pattern Recognition, (APR'03), India, pp: 48-53.

Halkidi, M., Y. Batistakis and M. Vazirgiannis, 2002. Cluster validity methods: Part I. ACM SIGMOD Record, 13: 40-45.
CrossRef  |  Direct Link  |  

Jain, A.K., M.N. Murthy and P.J. Flynn, 1999. Data clustering. A review. ACM Comput. Surveys, 31: 265-323.

Januzaj, E., P. Kriegel Hans and M. Pfeifle, 2004. DBDC: Density Based Distributed Clustering, Advances. In: Databases Technology-EDBT 2004, LNCS., Bertino, E., S. Christodoulakis and D. Plexousakis (Eds.)., Springer, Berlin/Heidelberg, pp: 529-530.

Jeong, J., B. Ryu, D. Shin and D. Shin, 2007. Integration of Distributed Biological Data using Modified K-Means Algorithm. In: Emerging Technologies in Knowledge Discovery and Data Mining, LNCS., Washio T. et al. (Eds.)., Springer, Berlin, pp: 469-475.

Jin, R., A. Goswami and G. Agarwal, 2006. Fast and exact out of core and distributed K-means clustering. Knowledge Inform. Syst., 10: 17-40.

Kim, S.Y. and T. Hamasaki, 2008. Evaluation of clustering based on preprocessing in gene expression data. Int. J. Biol. Biomed. Med. Sci., 3: 48-53.
Direct Link  |  

Merz, C.J. and P.M. Murphy, 1998. UCI repository of machine learning databases, Irvine, University of California. http://www.ics.uci.eedu/~mlearn/.

Park, B. and H. Kargupta, 2003. Distributed Data Mining, The Hand Book of Data Mining. Lawrence Erlabum Associates, Mahwah, New Jersey, USA., pp: 341-358.

Shalabi, L.A., Z. Shaaban and B. Kassabeh, 2006. Data mining a preprocessing engine. J. Comput. Sci., 2: 735-739.

Xiong, H., J. Wu and J. Chen, 2006. K-means clustering versus validation measures: A data distribution perspective. Proceedings of the 12th ACM SIGKDD International Conference Knowledge Discovery and Data Mining, (KDDM'06), USA., pp: 779-878.

Design and power by Medwell Web Development Team. © Medwell Publishing 2024 All Rights Reserved