Principal component analysis (PCA) is one of the most popular statistical procedures for dimension reduction. A modification of PCA, called robust principal component analysis (RPCA), has been studied to overcome the well-known shortness of PCA: sensitivity to outliers and corrupted data points. Earlier works have proved RPCA can be exactly recovered via semidenite programming. Recently, researchers have provided some provable non-convex solvers for RPCA, based on projected gradient descent or alternating projections, in full or partial observed settings. Yet, we nd the computational complexity of the recent RPCA algorithms can be improved further. We study RPCA in the fully observed setting, which is about separating a low rank matrix L and a sparse matrix S from their given sum D = L + S. In this thesis, a new non-convex algorithm, dubbed accelerated alternating projections, is introduced for solving RPCA rapidly. The proposed new algorithm signicantly improves the computational efficiency of the existing alternating projections based algorithm proposed in [1] when updating the estimate of low rank factor. The acceleration is achieved by rst projecting a matrix onto some low dimensional subspace before obtaining a new estimate of the low rank matrix via truncated singular-value decomposition. Essentially, truncated singular-value decomposition (a.k.a. the best low rank approximation) is replaced by a high-efficiency sub-optimal low rank approximation, while the convergence is retained. Exact recovery guarantee has been established, which shows linear convergence of the proposed algorithm under certain natural assumptions. Empirical performance evaluations establish the advantage of our algorithm over other state-of-the-art algorithms for RPCA. An application experiment on video background subtraction has been also established.
Accelerating truncated singular-value decomposition: a fast and provable method for robust principal component analysis
Abstract
Details
- Title: Subtitle
- Accelerating truncated singular-value decomposition: a fast and provable method for robust principal component analysis
- Creators
- HanQin Cai - University of Iowa
- Contributors
- Jianfeng Cai (Advisor)Weiyu Xu (Advisor)Palle E. T. Jorgensen (Committee Member)Tong Li (Committee Member)Yangbo Ye (Committee Member)
- Resource Type
- Dissertation
- Degree Awarded
- Doctor of Philosophy (PhD), University of Iowa
- Degree in
- Applied Mathematical and Computational Sciences
- Date degree season
- Spring 2018
- DOI
- 10.17077/etd.1jh8ig66
- Publisher
- University of Iowa
- Number of pages
- xii, 91 pages
- Copyright
- Copyright © 2018 HanQin Cai
- Language
- English
- Date submitted
- 09/05/2018
- Description illustrations
- illustrations (some color)
- Description bibliographic
- Includes bibliographical references (pages 87-91).
- Public Abstract (ETD)
Suppose we are given a set of data in form of a large matrix, and would look to extract some underlying data relations from the set. A widely applied tool for such a task is principal component analysis (PCA), which allows us to find the significant data relations that uncorrelated to each other. An example is finding the common purchase patterns from the purchase records of a group of customers, and the uncorrelation ensures that a new-found purchase pattern is not simply a combination of the existing patterns.
However, PCA has a well-known shortness: sensitivity to outliers and corrupted data points. The extracted purchase patterns maybe deviate from the truth because of one unusually large purchase. For overcome such a weakness, a modification of PCA, named robust principal component analysis (RPCA), was raised and has become a hot research topic and popular tool. RPCA appears in a wide range of applications, such as video and voice background subtraction, sparse graphs clustering, 3D reconstruction, and fault isolation. The users desire speedy and stable solver for RPCA, and employ non-convex method is one popular way to acceleration algorithm.
In this thesis, we introduce a novel non-convex algorithm, dubbed accelerated alternating projections, for solving RPCA rapidly. We also provide mathematical proofs that ensure exact recovery with a high convergence rate of the proposed algorithm under certain natural assumptions. Furthermore, numerical experiments show the advantage of our algorithm over other state-of-the-art algorithms for RPCA. Last, we establish an application experiment on video background subtraction for the proposed algorithm.
- Academic Unit
- Interdisciplinary Graduate Program in Applied Mathematical & Computational Sciences
- Record Identifier
- 9983776620402771