Statnote 27: Principal components analysis (PCA)

Anthony Hilton, Richard A. Armstrong

Research output: Contribution to specialist publicationArticle

Abstract

In Statnotes 24 and 25, multiple linear regression, a statistical method that examines the relationship between a single dependent variable (Y) and two or more independent variables (X), was described. The principle objective of such an analysis was to determine which of the X variables had a significant influence on Y and to construct an equation that predicts Y from the X variables. ‘Principal components analysis’ (PCA) and ‘factor analysis’ (FA) are also methods of examining the relationships between different variables but they differ from multiple regression in that no distinction is made between the dependent and independent variables, all variables being essentially treated the same. Originally, PCA and FA were regarded as distinct methods but in recent times they have been combined into a single analysis, PCA often being the first stage of a FA. The basic objective of a PCA/FA is to examine the relationships between the variables or the ‘structure’ of the variables and to determine whether these relationships can be explained by a smaller number of ‘factors’. This statnote describes the use of PCA/FA in the analysis of the differences between the DNA profiles of different MRSA strains introduced in Statnote 26.
LanguageEnglish
Pages37-39
Number of pages3
Volume12
Specialist publicationMicrobiologist
Publication statusPublished - Dec 2011

Fingerprint

factor analysis
regression
statistical method

Keywords

  • multiple linear regression
  • principal components analysis
  • PCA
  • factor analysis
  • FA
  • DNA profiles
  • MRSA strains

Cite this

Hilton, Anthony ; Armstrong, Richard A. / Statnote 27: Principal components analysis (PCA). In: Microbiologist. 2011 ; Vol. 12. pp. 37-39.
@misc{6f0a43813a12450eba5b2e6be44891ec,
title = "Statnote 27: Principal components analysis (PCA)",
abstract = "In Statnotes 24 and 25, multiple linear regression, a statistical method that examines the relationship between a single dependent variable (Y) and two or more independent variables (X), was described. The principle objective of such an analysis was to determine which of the X variables had a significant influence on Y and to construct an equation that predicts Y from the X variables. ‘Principal components analysis’ (PCA) and ‘factor analysis’ (FA) are also methods of examining the relationships between different variables but they differ from multiple regression in that no distinction is made between the dependent and independent variables, all variables being essentially treated the same. Originally, PCA and FA were regarded as distinct methods but in recent times they have been combined into a single analysis, PCA often being the first stage of a FA. The basic objective of a PCA/FA is to examine the relationships between the variables or the ‘structure’ of the variables and to determine whether these relationships can be explained by a smaller number of ‘factors’. This statnote describes the use of PCA/FA in the analysis of the differences between the DNA profiles of different MRSA strains introduced in Statnote 26.",
keywords = "multiple linear regression, principal components analysis, PCA, factor analysis, FA, DNA profiles, MRSA strains",
author = "Anthony Hilton and Armstrong, {Richard A.}",
year = "2011",
month = "12",
language = "English",
volume = "12",
pages = "37--39",
journal = "Microbiologist",
issn = "1479-2699",

}

Statnote 27: Principal components analysis (PCA). / Hilton, Anthony; Armstrong, Richard A.

In: Microbiologist, Vol. 12, 12.2011, p. 37-39.

Research output: Contribution to specialist publicationArticle

TY - GEN

T1 - Statnote 27: Principal components analysis (PCA)

AU - Hilton, Anthony

AU - Armstrong, Richard A.

PY - 2011/12

Y1 - 2011/12

N2 - In Statnotes 24 and 25, multiple linear regression, a statistical method that examines the relationship between a single dependent variable (Y) and two or more independent variables (X), was described. The principle objective of such an analysis was to determine which of the X variables had a significant influence on Y and to construct an equation that predicts Y from the X variables. ‘Principal components analysis’ (PCA) and ‘factor analysis’ (FA) are also methods of examining the relationships between different variables but they differ from multiple regression in that no distinction is made between the dependent and independent variables, all variables being essentially treated the same. Originally, PCA and FA were regarded as distinct methods but in recent times they have been combined into a single analysis, PCA often being the first stage of a FA. The basic objective of a PCA/FA is to examine the relationships between the variables or the ‘structure’ of the variables and to determine whether these relationships can be explained by a smaller number of ‘factors’. This statnote describes the use of PCA/FA in the analysis of the differences between the DNA profiles of different MRSA strains introduced in Statnote 26.

AB - In Statnotes 24 and 25, multiple linear regression, a statistical method that examines the relationship between a single dependent variable (Y) and two or more independent variables (X), was described. The principle objective of such an analysis was to determine which of the X variables had a significant influence on Y and to construct an equation that predicts Y from the X variables. ‘Principal components analysis’ (PCA) and ‘factor analysis’ (FA) are also methods of examining the relationships between different variables but they differ from multiple regression in that no distinction is made between the dependent and independent variables, all variables being essentially treated the same. Originally, PCA and FA were regarded as distinct methods but in recent times they have been combined into a single analysis, PCA often being the first stage of a FA. The basic objective of a PCA/FA is to examine the relationships between the variables or the ‘structure’ of the variables and to determine whether these relationships can be explained by a smaller number of ‘factors’. This statnote describes the use of PCA/FA in the analysis of the differences between the DNA profiles of different MRSA strains introduced in Statnote 26.

KW - multiple linear regression

KW - principal components analysis

KW - PCA

KW - factor analysis

KW - FA

KW - DNA profiles

KW - MRSA strains

UR - http://issuu.com/societyforappliedmicrobiology/docs/dec2011_micro

M3 - Article

VL - 12

SP - 37

EP - 39

JO - Microbiologist

T2 - Microbiologist

JF - Microbiologist

SN - 1479-2699

ER -