Mixtures of probabilistic principal component analysers

Michael E. Tipping; Christopher M. Bishop

doi:10.1162/089976699300016728

Mixtures of probabilistic principal component analysers

Michael E. Tipping, Christopher M. Bishop

Research output: Contribution to journal › Article › peer-review

Abstract

Principal component analysis (PCA) is one of the most popular techniques for processing, compressing and visualising data, although its effectiveness is limited by its global linearity. While nonlinear variants of PCA have been proposed, an alternative paradigm is to capture data complexity by a combination of local linear PCA projections. However, conventional PCA does not correspond to a probability density, and so there is no unique way to combine PCA models. Previous attempts to formulate mixture models for PCA have therefore to some extent been ad hoc. In this paper, PCA is formulated within a maximum-likelihood framework, based on a specific form of Gaussian latent variable model. This leads to a well-defined mixture model for probabilistic principal component analysers, whose parameters can be determined using an EM algorithm. We discuss the advantages of this model in the context of clustering, density modelling and local dimensionality reduction, and we demonstrate its application to image compression and handwritten digit recognition.

Original language	English
Pages (from-to)	443-482
Number of pages	40
Journal	Neural Computation
Volume	11
Issue number	2
DOIs	https://doi.org/10.1162/089976699300016728
Publication status	Published - 15 Feb 1999

Bibliographical note

Copyright of the Massachusetts Institute of Technology Press (MIT Press)

Keywords

Principal component analysis
projections
non-linear variants
probabilistic
compression
handwritten
digit recognition

Access to Document

10.1162/089976699300016728

Cite this

@article{8980862a69fd4a728a42ae533afd5b8d,

title = "Mixtures of probabilistic principal component analysers",

abstract = "Principal component analysis (PCA) is one of the most popular techniques for processing, compressing and visualising data, although its effectiveness is limited by its global linearity. While nonlinear variants of PCA have been proposed, an alternative paradigm is to capture data complexity by a combination of local linear PCA projections. However, conventional PCA does not correspond to a probability density, and so there is no unique way to combine PCA models. Previous attempts to formulate mixture models for PCA have therefore to some extent been ad hoc. In this paper, PCA is formulated within a maximum-likelihood framework, based on a specific form of Gaussian latent variable model. This leads to a well-defined mixture model for probabilistic principal component analysers, whose parameters can be determined using an EM algorithm. We discuss the advantages of this model in the context of clustering, density modelling and local dimensionality reduction, and we demonstrate its application to image compression and handwritten digit recognition.",

keywords = "Principal component analysis, projections, non-linear variants, probabilistic, compression, handwritten, digit recognition",

author = "Tipping, {Michael E.} and Bishop, {Christopher M.}",

note = "Copyright of the Massachusetts Institute of Technology Press (MIT Press)",

year = "1999",

month = feb,

day = "15",

doi = "10.1162/089976699300016728",

language = "English",

volume = "11",

pages = "443--482",

journal = "Neural Computation",

issn = "0899-7667",

publisher = "MIT Press Journals",

number = "2",

}

TY - JOUR

T1 - Mixtures of probabilistic principal component analysers

AU - Tipping, Michael E.

AU - Bishop, Christopher M.

N1 - Copyright of the Massachusetts Institute of Technology Press (MIT Press)

PY - 1999/2/15

Y1 - 1999/2/15

N2 - Principal component analysis (PCA) is one of the most popular techniques for processing, compressing and visualising data, although its effectiveness is limited by its global linearity. While nonlinear variants of PCA have been proposed, an alternative paradigm is to capture data complexity by a combination of local linear PCA projections. However, conventional PCA does not correspond to a probability density, and so there is no unique way to combine PCA models. Previous attempts to formulate mixture models for PCA have therefore to some extent been ad hoc. In this paper, PCA is formulated within a maximum-likelihood framework, based on a specific form of Gaussian latent variable model. This leads to a well-defined mixture model for probabilistic principal component analysers, whose parameters can be determined using an EM algorithm. We discuss the advantages of this model in the context of clustering, density modelling and local dimensionality reduction, and we demonstrate its application to image compression and handwritten digit recognition.

AB - Principal component analysis (PCA) is one of the most popular techniques for processing, compressing and visualising data, although its effectiveness is limited by its global linearity. While nonlinear variants of PCA have been proposed, an alternative paradigm is to capture data complexity by a combination of local linear PCA projections. However, conventional PCA does not correspond to a probability density, and so there is no unique way to combine PCA models. Previous attempts to formulate mixture models for PCA have therefore to some extent been ad hoc. In this paper, PCA is formulated within a maximum-likelihood framework, based on a specific form of Gaussian latent variable model. This leads to a well-defined mixture model for probabilistic principal component analysers, whose parameters can be determined using an EM algorithm. We discuss the advantages of this model in the context of clustering, density modelling and local dimensionality reduction, and we demonstrate its application to image compression and handwritten digit recognition.

KW - Principal component analysis

KW - projections

KW - non-linear variants

KW - probabilistic

KW - compression

KW - handwritten

KW - digit recognition

UR - http://www.mitpressjournals.org/doi/abs/10.1162/089976699300016728

U2 - 10.1162/089976699300016728

DO - 10.1162/089976699300016728

M3 - Article

SN - 0899-7667

VL - 11

SP - 443

EP - 482

JO - Neural Computation

JF - Neural Computation

IS - 2

ER -

Mixtures of probabilistic principal component analysers

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this