A Resampling-Based Stochastic Approximation Method for Analysis of Large Geostatistical Data

Handle URI:
http://hdl.handle.net/10754/597394
Title:
A Resampling-Based Stochastic Approximation Method for Analysis of Large Geostatistical Data
Authors:
Liang, Faming; Cheng, Yichen; Song, Qifan; Park, Jincheol; Yang, Ping
Abstract:
The Gaussian geostatistical model has been widely used in modeling of spatial data. However, it is challenging to computationally implement this method because it requires the inversion of a large covariance matrix, particularly when there is a large number of observations. This article proposes a resampling-based stochastic approximation method to address this challenge. At each iteration of the proposed method, a small subsample is drawn from the full dataset, and then the current estimate of the parameters is updated accordingly under the framework of stochastic approximation. Since the proposed method makes use of only a small proportion of the data at each iteration, it avoids inverting large covariance matrices and thus is scalable to large datasets. The proposed method also leads to a general parameter estimation approach, maximum mean log-likelihood estimation, which includes the popular maximum (log)-likelihood estimation (MLE) approach as a special case and is expected to play an important role in analyzing large datasets. Under mild conditions, it is shown that the estimator resulting from the proposed method converges in probability to a set of parameter values of equivalent Gaussian probability measures, and that the estimator is asymptotically normally distributed. To the best of the authors' knowledge, the present study is the first one on asymptotic normality under infill asymptotics for general covariance functions. The proposed method is illustrated with large datasets, both simulated and real. Supplementary materials for this article are available online. © 2013 American Statistical Association.
Citation:
Liang F, Cheng Y, Song Q, Park J, Yang P (2013) A Resampling-Based Stochastic Approximation Method for Analysis of Large Geostatistical Data. Journal of the American Statistical Association 108: 325–339. Available: http://dx.doi.org/10.1080/01621459.2012.746061.
Publisher:
Informa UK Limited
Journal:
Journal of the American Statistical Association
KAUST Grant Number:
KUS-C1-016-04
Issue Date:
Mar-2013
DOI:
10.1080/01621459.2012.746061
Type:
Article
ISSN:
0162-1459; 1537-274X
Sponsors:
Liang's research was partially supported by grants from the National Science Foundation (DMS-1007457 and DMS-1106494) and the award (KUS-C1-016-04) made by King Abdullah University of Science and Technology (KAUST). Yang's research was partially supported by the endowment funds related to the David Bullock Harris Chair in Geosciences at the College of Geosciences, Texas A&M University. The authors thank the editor, associate editor, and the referees for their constructive comments that have led to significant improvement of this article.
Appears in Collections:
Publications Acknowledging KAUST Support

Full metadata record

DC FieldValue Language
dc.contributor.authorLiang, Famingen
dc.contributor.authorCheng, Yichenen
dc.contributor.authorSong, Qifanen
dc.contributor.authorPark, Jincheolen
dc.contributor.authorYang, Pingen
dc.date.accessioned2016-02-25T12:32:18Zen
dc.date.available2016-02-25T12:32:18Zen
dc.date.issued2013-03en
dc.identifier.citationLiang F, Cheng Y, Song Q, Park J, Yang P (2013) A Resampling-Based Stochastic Approximation Method for Analysis of Large Geostatistical Data. Journal of the American Statistical Association 108: 325–339. Available: http://dx.doi.org/10.1080/01621459.2012.746061.en
dc.identifier.issn0162-1459en
dc.identifier.issn1537-274Xen
dc.identifier.doi10.1080/01621459.2012.746061en
dc.identifier.urihttp://hdl.handle.net/10754/597394en
dc.description.abstractThe Gaussian geostatistical model has been widely used in modeling of spatial data. However, it is challenging to computationally implement this method because it requires the inversion of a large covariance matrix, particularly when there is a large number of observations. This article proposes a resampling-based stochastic approximation method to address this challenge. At each iteration of the proposed method, a small subsample is drawn from the full dataset, and then the current estimate of the parameters is updated accordingly under the framework of stochastic approximation. Since the proposed method makes use of only a small proportion of the data at each iteration, it avoids inverting large covariance matrices and thus is scalable to large datasets. The proposed method also leads to a general parameter estimation approach, maximum mean log-likelihood estimation, which includes the popular maximum (log)-likelihood estimation (MLE) approach as a special case and is expected to play an important role in analyzing large datasets. Under mild conditions, it is shown that the estimator resulting from the proposed method converges in probability to a set of parameter values of equivalent Gaussian probability measures, and that the estimator is asymptotically normally distributed. To the best of the authors' knowledge, the present study is the first one on asymptotic normality under infill asymptotics for general covariance functions. The proposed method is illustrated with large datasets, both simulated and real. Supplementary materials for this article are available online. © 2013 American Statistical Association.en
dc.description.sponsorshipLiang's research was partially supported by grants from the National Science Foundation (DMS-1007457 and DMS-1106494) and the award (KUS-C1-016-04) made by King Abdullah University of Science and Technology (KAUST). Yang's research was partially supported by the endowment funds related to the David Bullock Harris Chair in Geosciences at the College of Geosciences, Texas A&M University. The authors thank the editor, associate editor, and the referees for their constructive comments that have led to significant improvement of this article.en
dc.publisherInforma UK Limiteden
dc.subjectAsymptotic normalityen
dc.subjectInfill asymptoticsen
dc.subjectLarge spatial dataen
dc.subjectU-statisticsen
dc.titleA Resampling-Based Stochastic Approximation Method for Analysis of Large Geostatistical Dataen
dc.typeArticleen
dc.identifier.journalJournal of the American Statistical Associationen
dc.contributor.institutionTexas A and M University, College Station, United Statesen
kaust.grant.numberKUS-C1-016-04en
All Items in KAUST are protected by copyright, with all rights reserved, unless otherwise indicated.