The principled statistical application of Gaussian random field models used in geostatistics has historically been limited to data sets of a small size. This limitation is imposed by the requirement to store and invert the covariance matrix of all the samples to obtain a predictive distribution at unsampled locations, or to use likelihood-based covariance estimation. Various ad hoc approaches to solve this problem have been adopted, such as selecting a neighborhood region and/or a small number of observations to use in the kriging process, but these have no sound theoretical basis and it is unclear what information is being lost. In this article, we present a Bayesian method for estimating the posterior mean and covariance structures of a Gaussian random field using a sequential estimation algorithm. By imposing sparsity in a well-defined framework, the algorithm retains a subset of “basis vectors” that best represent the “true” posterior Gaussian random field model in the relative entropy sense. This allows a principled treatment of Gaussian random field models on very large data sets. The method is particularly appropriate when the Gaussian random field model is regarded as a latent variable model, which may be nonlinearly related to the observations. We show the application of the sequential, sparse Bayesian estimation in Gaussian random field models and discuss its merits and drawbacks.
Bibliographical noteThe definitive version is available at www.onlinelibrary.wiley.com. Permissions sought by third parties must be acquired from Wiley-Blackwell
- Gaussian random field models
- predictive distribution
- covariance estimation
- sequential estimation algorithm
Cornford, D., Csató, L., & Opper, M. (2005). Sequential, Bayesian geostatistics: a principled method for large data sets. Geographical Analysis, 37(2), 183-199. https://doi.org/10.1111/j.1538-4632.2005.00635.x