Journal article
An integrated K-means – Laplacian cluster ensemble approach for document datasets
Neurocomputing (Amsterdam), Vol.214, pp.495-507
11/19/2016
DOI: 10.1016/j.neucom.2016.06.034
Abstract
Cluster ensemble has become an important extension to traditional clustering algorithms, yet the cluster ensemble problem is very challenging due to the inherent difficulty in resolving the label correspondence problem. We adapted the integrated K-means – Laplacian clustering approach to solve the cluster ensemble problem by exploiting both the attribute information embedded in the cluster labels and the pairwise relations among the objects. The optimal solution of the proposed approach requires computing the pseudo inverse of the normalized Laplacian matrix and the eigenvalue decomposition of a large matrix, which can be computationally burdensome for large scale document datasets. We devised an effective algebraic transformation method for efficiently carrying out the aforementioned computations and proposed an integrated K-means – Laplacian cluster ensemble approach (IKLCEA). Experimental results with benchmark document datasets demonstrate that IKLCEA outperforms other cluster ensemble techniques on most cases. In addition, IKLCEA is computationally efficient and can be readily employed in large scale document applications.
Details
- Title: Subtitle
- An integrated K-means – Laplacian cluster ensemble approach for document datasets
- Creators
- Sen Xu - University of IowaKung-Sik Chan - University of IowaJun Gao - Yancheng Institute of TechnologyXiufang Xu - Yancheng Institute of TechnologyXianfeng Li - Yancheng Institute of TechnologyXiaopeng Hua - Yancheng Institute of TechnologyJing An - Yancheng Institute of Technology
- Resource Type
- Journal article
- Publication Details
- Neurocomputing (Amsterdam), Vol.214, pp.495-507
- Publisher
- Elsevier B.V
- DOI
- 10.1016/j.neucom.2016.06.034
- ISSN
- 0925-2312
- eISSN
- 1872-8286
- Grant note
- DOI: 10.13039/100000002, name: National Institutes of Health, award: U01 HL114494, NIHRO1HL089897; DOI: 10.13039/501100001809, name: National Natural Science Foundation of China, award: 61105057, 61375001; DOI: 10.13039/501100004608, name: Natural Science Foundation of Jiangsu Province, award: BK20151299; name: Jiangsu Province of China, award: BY2014108-20, BY2015057-33; name: Nature Science Foundation of the Jiangsu Higher Education Institutes of China, award: 13KJB520024; DOI: 10.13039/501100004610, name: Science and Technology Support Program of Jiangsu Province, award: BE2014679; name: Yancheng Institute of Technology, award: XKR2011019
- Language
- English
- Date published
- 11/19/2016
- Academic Unit
- Statistics and Actuarial Science; Radiology
- Record Identifier
- 9984257743602771
Metrics
2 Record Views