Logo image
Clustering of Data with Missing Entries
Conference proceeding

Clustering of Data with Missing Entries

Sunrita Poddar and Mathews Jacob
2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vol.2018-, pp.2831-2835
04/2018
DOI: 10.1109/ICASSP.2018.8462602
PMID: 33633499
url
https://arxiv.org/pdf/1801.01455View
Open Access

Abstract

The analysis of large datasets is often complicated by the presence of missing entries, mainly because most of the current machine learning algorithms are designed to work with full data. The main focus of this work is to introduce a clustering algorithm, that will provide good clustering even in the presence of missing data. The proposed technique solves an l o fusion penalty based optimization problem to recover the clusters. We theoretically analyze the conditions needed for the successful recovery of the clusters. We also propose an algorithm to solve a relaxation of this problem using saturating non-convex fusion penalties. The method is demonstrated on simulated and real datasets, and is observed to perform well in the presence of large fractions of missing entries.
non-convex penalties Upper bound Machine learning algorithms Clustering algorithms Coherence Silicon clustering Gene expression Optimization missing entries

Details

Metrics

18 Record Views
Logo image