Developing the Raster Big Data Benchmark: A Comparison of Raster Analysis on Big Data Platforms
Type
ArticleKAUST Department
Energy Resources & Petroleum EngineeringPhysical Science and Engineering (PSE) Division
Date
2020-11-19Submitted Date
2020-09-02Permanent link to this record
http://hdl.handle.net/10754/666094
Metadata
Show full item recordAbstract
Technologies around the world produce and interact with geospatial data instantaneously, from mobile web applications to satellite imagery that is collected and processed across the globe daily. Big raster data allow researchers to integrate and uncover new knowledge about geospatial patterns and processes. However, we are at a critical moment, as we have an ever-growing number of big data platforms that are being co-opted to support spatial analysis. A gap in the literature is the lack of a robust assessment comparing the efficiency of raster data analysis on big data platforms. This research begins to address this issue by establishing a raster data benchmark that employs freely accessible datasets to provide a comprehensive performance evaluation and comparison of raster operations on big data platforms. The benchmark is critical for evaluating the performance of spatial operations on big data platforms. The benchmarking datasets and operations are applied to three big data platforms. We report computing times and performance bottlenecks so that GIScientists can make informed choices regarding the performance of each platform. Each platform is evaluated for five raster operations: pixel count, reclassification, raster add, focal averaging, and zonal statistics using three raster different datasets.Citation
Haynes, D., Mitchell, P., & Shook, E. (2020). Developing the Raster Big Data Benchmark: A Comparison of Raster Analysis on Big Data Platforms. ISPRS International Journal of Geo-Information, 9(11), 690. doi:10.3390/ijgi9110690Sponsors
Research reported in this publication was supported by the National Cancer Institute of the National Institutes of Health under Award Number T32CA163184 (Michele Allen, MD, MS; PI). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.Publisher
MDPI AGAdditional Links
https://www.mdpi.com/2220-9964/9/11/690ae974a485f413a2113503eed53cd6c53
10.3390/ijgi9110690
Scopus Count
Except where otherwise noted, this item's license is described as This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.