Principal Component Analysis

  • Living reference work entry
  • Latest version View entry history
  • First Online:
Encyclopedia of Mathematical Geosciences

Part of the book series: Encyclopedia of Earth Sciences Series ((EESS))

  • 23 Accesses

Synonyms

Karhunen-Loève decomposition; Principal orthogonal decomposition

Definition

Principal component analysis (PCA) is a statistical technique aimed to explore and reduce the dimensionality of a multivariate dataset. It is based on the key idea of finding a representation of the p-dimensional data through a smaller set of k < p new variables, defined as linear combinations of the original variables. These are obtained as to explain at best the data variability. PCA was first introduced by Pearson (1901); nowadays its formulation has been developed for varied types of Euclidean data, including compositional data and functional data. Extensions to non-Euclidean data are available as well.

PCA as an Optimization Problem

It is given a dataset made of n observations of p variables, organized in a data matrix \( \mathbbm{X}=\left[{x}_{ij}\right] \) in ℝn,p, whose columns are denoted by x·j = (x1j, …, xnj). It is assumed that the variables have null sample mean (\(...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Bibliography

  • Chiles J-P, Delfiner P (2009) Geostatistics: modeling spatial uncertainty, vol 497. Wiley

    Google Scholar 

  • Fletcher P, Lu C, Pizer S, Joshi S (2004) Principal geodesic analysis for the study of nonlinear statistics of shape. IEEE Trans Med Imaging 23:995–1005

    Article  Google Scholar 

  • Gabriel KR (1971) The biplot graphic display of matrices with application to principal component analysis. Biometrika 58(3):453–467

    Article  Google Scholar 

  • Hyvärinen A (2013) Independent component analysis: recent advances. Philos Trans R Soc A Math Phys Eng Sci 371(1984):20110534

    Article  Google Scholar 

  • Johnson R, Wichern D (2002) Applied multivariate statistical analysis, 5th edn. Prentice Hall

    Google Scholar 

  • Pawlowsky-Glahn V, Egozcue JJ, Tolosana-Delgado R (2015) Modeling and analysis of compositional data. Wiley, Chichester

    Google Scholar 

  • Pearson KF (1901) LIII. On lines and planes of closest fit to systems of points in space. Lond Edinb Dublin Philos Mag J Sci 2(11):559–572

    Article  Google Scholar 

  • Ramsay J, Silverman BW (2005) Functional data analysis. Springer, New York

    Book  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alessandra Menafoglio .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Springer Nature Switzerland AG

About this entry

Check for updates. Verify currency and authenticity via CrossMark

Cite this entry

Menafoglio, A. (2022). Principal Component Analysis. In: Daya Sagar, B.S., Cheng, Q., McKinley, J., Agterberg, F. (eds) Encyclopedia of Mathematical Geosciences. Encyclopedia of Earth Sciences Series. Springer, Cham. https://doi.org/10.1007/978-3-030-26050-7_256-2

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-26050-7_256-2

  • Received:

  • Accepted:

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-26050-7

  • Online ISBN: 978-3-030-26050-7

  • eBook Packages: Springer Reference Earth and Environm. ScienceReference Module Physical and Materials ScienceReference Module Earth and Environmental Sciences

Publish with us

Policies and ethics

Chapter history

  1. Latest

    Principal Component Analysis
    Published:
    05 August 2022

    DOI: https://doi.org/10.1007/978-3-030-26050-7_256-2

  2. Original

    Principal Component Analysis
    Published:
    11 January 2022

    DOI: https://doi.org/10.1007/978-3-030-26050-7_256-1

Navigation