Algorithms for sparsity constrained principal component analysis

Aktaş, Fatih Selim

Algorithms for sparsity constrained principal component analysis

buir.advisor	Pınar, Mustafa Çelebi
dc.contributor.author	Aktaş, Fatih Selim
dc.date.accessioned	2023-08-11T11:28:31Z
dc.date.available	2023-08-11T11:28:31Z
dc.date.copyright	2023-07
dc.date.issued	2023-07
dc.date.submitted	2023-08-09
dc.description	Cataloged from PDF version of article.	en_US
dc.description	Includes bibliographical references (leaves 74-82).	en_US
dc.description.abstract	The classical Principal Component Analysis problem consists of finding a linear transform that reduces the dimensionality of the original dataset while keeping most of the variation. Extra sparsity constraint sets most of the coefficients to zero which makes interpretation of the linear transform easier. We present two approaches to the sparsity constrained Principal Component Analysis. Firstly, we develop computationally cheap heuristics that can be deployed in very high-dimensional problems. Our heuristics are justified with linear algebra approximations and theoretical guarantees. Furthermore, we strengthen our algorithms by deploying the necessary conditions for the optimization model. Secondly, we use a non-convex log-sum penalty in the semidefinite space. We show a connection to the cardinality function and develop an algorithm, PCA Sparsified, to solve the problem locally via solving a sequence of convex optimization problems. We analyze the theoretical properties of this algorithm and comment on the numerical implementation. Moreover, we derive a pre-processing method that can be used with previous approaches. Finally, our findings from the numerical experiments we conducted show that our greedy algorithms scale to high dimensional problems easily while being highly competitive in many problems with state-of-art algorithms and even beating them uniformly in some cases. Additionally, we illustrate the effectiveness of PCA Sparsified on small dimensional problems in terms of variance explained. Although it is computationally very demanding, it consistently outperforms local and greedy approaches.
dc.description.statementofresponsibility	by Fatih Selim Aktaş
dc.embargo.release	2024-02-09
dc.format.extent	xi, 92 leaves : charts ; 30 cm.
dc.identifier.itemid	B162307
dc.identifier.uri	https://hdl.handle.net/11693/112649
dc.language.iso	English
dc.rights	info:eu-repo/semantics/openAccess
dc.subject	Sparse PCA
dc.subject	Greedy algorithms
dc.subject	SDP
dc.title	Algorithms for sparsity constrained principal component analysis
dc.title.alternative	Seyrek kısıtlı temel bileşen analizi için algoritmalar
dc.type	Thesis
thesis.degree.discipline	Industrial Engineering
thesis.degree.level	Master's
thesis.degree.name	MS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1

Name:: B162307.pdf
Size:: 1.4 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Graduate School of Engineering and Science