Conference proceeding
HACS: Heuristic Algorithm for Clustering Subsets
PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, pp.617-622
01/01/2007
Abstract
The term consideration set is used in marketing to refer to the set of items a customer thought about purchasing before making a choice. While consideration sets are not directly observable, finding common ones is useful for market segmentation and choice prediction. We approach the problem of inducing common consideration sets as a clustering problem on the space of possible item subsets. Our algorithm combines ideas from binary clustering and itemset mining, and differs from other clustering methods by reflecting the inherent structure of subset clusters. Experiments on both real and simulated datasets show that our algorithm clusters effectively and efficiently even for sparse datasets. In addition, a novel evaluation method is developed to compare clusters found by our algorithm with known ones.
Details
- Title: Subtitle
- HACS: Heuristic Algorithm for Clustering Subsets
- Creators
- Ding Yuan - Univ Iowa, Dept Management Sci, Iowa City, IA 52242 USAW. Nick Street - University of Iowa
- Contributors
- C Apte (Editor)B Liu (Editor)S Parthasarathy (Editor)D Skillicorn (Editor)
- Resource Type
- Conference proceeding
- Publication Details
- PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, pp.617-622
- Publisher
- Siam
- Number of pages
- 6
- Language
- English
- Date published
- 01/01/2007
- Academic Unit
- Bus Admin College; Nursing; Computer Science; Business Analytics
- Record Identifier
- 9984380515202771
Metrics
34 Record Views