Definition
Principal component analysis (PCA) is a statistical technique aimed to explore and reduce the dimensionality of a multivariate dataset. It is based on the key idea of finding a representation of the p-dimensional data through a smaller set of k < p new variables, defined as linear combinations of the original variables. These are obtained as to explain at best the data variability. PCA was first introduced by Pearson (1901); nowadays its formulation has been developed for varied types of Euclidean data, including compositional data and functional data. Extensions to non-Euclidean data are available as well.
PCA as an Optimization Problem
It is given a dataset made of n observations of p variables, organized in a data matrix \( \mathbbm{X}=\left[{x}_{ij}\right] \) in ℝn,p, whose columns are denoted by x·j = (x1j, …, xnj). It is assumed that the variables have null sample mean (\(...
Bibliography
Chiles J-P, Delfiner P (2009) Geostatistics: modeling spatial uncertainty, vol 497. Wiley
Fletcher P, Lu C, Pizer S, Joshi S (2004) Principal geodesic analysis for the study of nonlinear statistics of shape. IEEE Trans Med Imaging 23:995–1005
Gabriel KR (1971) The biplot graphic display of matrices with application to principal component analysis. Biometrika 58(3):453–467
Hyvärinen A (2013) Independent component analysis: recent advances. Philos Trans R Soc A Math Phys Eng Sci 371(1984):20110534
Johnson R, Wichern D (2002) Applied multivariate statistical analysis, 5th edn. Prentice Hall
Pawlowsky-Glahn V, Egozcue JJ, Tolosana-Delgado R (2015) Modeling and analysis of compositional data. Wiley, Chichester
Pearson KF (1901) LIII. On lines and planes of closest fit to systems of points in space. Lond Edinb Dublin Philos Mag J Sci 2(11):559–572
Ramsay J, Silverman BW (2005) Functional data analysis. Springer, New York
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this entry
Cite this entry
Menafoglio, A. (2022). Principal Component Analysis. In: Daya Sagar, B.S., Cheng, Q., McKinley, J., Agterberg, F. (eds) Encyclopedia of Mathematical Geosciences. Encyclopedia of Earth Sciences Series. Springer, Cham. https://doi.org/10.1007/978-3-030-26050-7_256-2
Download citation
DOI: https://doi.org/10.1007/978-3-030-26050-7_256-2
Received:
Accepted:
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-26050-7
Online ISBN: 978-3-030-26050-7
eBook Packages: Springer Reference Earth and Environm. ScienceReference Module Physical and Materials ScienceReference Module Earth and Environmental Sciences
Publish with us
Chapter history
-
Latest
Principal Component Analysis- Published:
- 05 August 2022
DOI: https://doi.org/10.1007/978-3-030-26050-7_256-2
-
Original
Principal Component Analysis- Published:
- 11 January 2022
DOI: https://doi.org/10.1007/978-3-030-26050-7_256-1