Bayesian Levy-Dynamic Spatio-Temporal Process: Towards Big Data Analysis

Enregistré dans:
Détails bibliographiques
Publié dans:arXiv.org (Dec 16, 2024), p. n/a
Auteur principal: Bhattacharya, Sourabh
Publié:
Cornell University Library, arXiv.org
Sujets:
Accès en ligne:Citation/Abstract
Full text outside of ProQuest
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!

MARC

LEADER 00000nab a2200000uu 4500
001 2529030275
003 UK-CbPIL
022 |a 2331-8422 
035 |a 2529030275 
045 0 |b d20241216 
100 1 |a Bhattacharya, Sourabh 
245 1 |a Bayesian Levy-Dynamic Spatio-Temporal Process: Towards Big Data Analysis 
260 |b Cornell University Library, arXiv.org  |c Dec 16, 2024 
513 |a Working Paper 
520 3 |a In this era of big data, all scientific disciplines are evolving fast to cope up with the enormity of the available information. So is statistics, the queen of science. Big data are particularly relevant to spatio-temporal statistics, thanks to much-improved technology in satellite based remote sensing and Geographical Information Systems. However, none of the existing approaches seem to meet the simultaneous demand of reality emulation and cheap computation. In this article, with the Levy random fields as the starting point, e construct a new Bayesian nonparametric, nonstationary and nonseparable dynamic spatio-temporal model with the additional realistic property that the lagged spatio-temporal correlations converge to zero as the lag tends to infinity. Although our Bayesian model seems to be intricately structured and is variable-dimensional with respect to each time index, we are able to devise a fast and efficient parallel Markov Chain Monte Carlo (MCMC) algorithm for Bayesian inference. Our simulation experiment brings out quite encouraging performance from our Bayesian Levy-dynamic approach. We finally apply our Bayesian Levy-dynamic model and methods to a sea surface temperature dataset consisting of 139,300 data points in space and time. Although not big data in the true sense, this is a large and highly structured data by any standard. Even for this large and complex data, our parallel MCMC algorithm, implemented on 80 processors, generated 110,000 MCMC realizations from the Levy-dynamic posterior within a single day, and the resultant Bayesian posterior predictive analysis turned out to be encouraging. Thus, it is not unreasonable to expect that with significantly more computing resources, it is feasible to analyse terabytes of spatio-temporal data with our new model and methods. 
653 |a Data analysis 
653 |a Big Data 
653 |a Algorithms 
653 |a Geographic information systems 
653 |a Sea surface temperature 
653 |a Fields (mathematics) 
653 |a Markov chains 
653 |a Bayesian analysis 
653 |a Dynamic models 
653 |a Massive data points 
653 |a Statistical inference 
653 |a Data points 
653 |a Remote sensing 
773 0 |t arXiv.org  |g (Dec 16, 2024), p. n/a 
786 0 |d ProQuest  |t Engineering Database 
856 4 1 |3 Citation/Abstract  |u https://www.proquest.com/docview/2529030275/abstract/embedded/6A8EOT78XXH2IG52?source=fedsrch 
856 4 0 |3 Full text outside of ProQuest  |u http://arxiv.org/abs/2105.08451