Conference proceeding
Combining link and content for community detection: a discriminative approach
Proceedings of the 15th ACM SIGKDD international conference on knowledge discovery and data mining, pp.927-936
KDD '09
06/28/2009
DOI: 10.1145/1557019.1557120
Abstract
In this paper, we consider the problem of combining link and content analysis for community detection from networked data, such as paper citation networks and Word Wide Web. Most existing approaches combine link and content information by a generative model that generates both links and contents via a shared set of community memberships. These generative models have some shortcomings in that they failed to consider additional factors that could affect the community memberships and isolate the contents that are irrelevant to community memberships. To explicitly address these shortcomings, we propose a discriminative model for combining the link and content analysis for community detection. First, we propose a conditional model for link analysis and in the model, we introduce hidden variables to explicitly model the popularity of nodes. Second, to alleviate the impact of irrelevant content attributes, we develop a discriminative model for content analysis. These two models are unified seamlessly via the community memberships. We present efficient algorithms to solve the related optimization problems based on bound optimization and alternating projection. Extensive experiments with benchmark data sets show that the proposed framework significantly outperforms the state-of-the-art approaches for combining link and content analysis for community detection.
Details
- Title: Subtitle
- Combining link and content for community detection: a discriminative approach
- Creators
- Tianbao Yang - Michigan State UniversityRong Jin - Michigan State UniversityYun Chi - NEC Laboratories America, Cupertino, CA, USAShenghuo Zhu - NEC (China)
- Resource Type
- Conference proceeding
- Publication Details
- Proceedings of the 15th ACM SIGKDD international conference on knowledge discovery and data mining, pp.927-936
- Series
- KDD '09
- DOI
- 10.1145/1557019.1557120
- Publisher
- ACM
- Language
- English
- Date published
- 06/28/2009
- Academic Unit
- Computer Science
- Record Identifier
- 9984259411302771
Metrics
13 Record Views