Herd Clustering: A synergistic data clustering approach using collective intelligence
Type
ArticleKAUST Department
Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) DivisionComputer Science Program
Date
2014-10Permanent link to this record
http://hdl.handle.net/10754/563771
Metadata
Show full item recordAbstract
Traditional data mining methods emphasize on analytical abilities to decipher data, assuming that data are static during a mining process. We challenge this assumption, arguing that we can improve the analysis by vitalizing data. In this paper, this principle is used to develop a new clustering algorithm. Inspired by herd behavior, the clustering method is a synergistic approach using collective intelligence called Herd Clustering (HC). The novel part is laid in its first stage where data instances are represented by moving particles. Particles attract each other locally and form clusters by themselves as shown in the case studies reported. To demonstrate its effectiveness, the performance of HC is compared to other state-of-the art clustering methods on more than thirty datasets using four performance metrics. An application for DNA motif discovery is also conducted. The results support the effectiveness of HC and thus the underlying philosophy. © 2014 Elsevier B.V.Citation
Wong, K.-C., Peng, C., Li, Y., & Chan, T.-M. (2014). Herd Clustering: A synergistic data clustering approach using collective intelligence. Applied Soft Computing, 23, 61–75. doi:10.1016/j.asoc.2014.05.034Publisher
Elsevier BVJournal
Applied Soft Computingae974a485f413a2113503eed53cd6c53
10.1016/j.asoc.2014.05.034