ABSTRACT
We extend the capability of space-time geostatistical modeling using algebraic approximations, illustrating application-expected accuracy worthy of double precision from majority low-precision computations and low-rank matrix approximations. We exploit the mathematical structure of the dense covariance matrix whose inverse action and determinant are repeatedly required in Gaussian log-likelihood optimization. Geostatistics augments first-principles modeling approaches for the prediction of environmental phenomena given the availability of measurements at a large number of locations; however, traditional Cholesky-based approaches grow cubically in complexity, gating practical extension to continental and global datasets now available. We combine the linear algebraic contributions of mixed-precision and low-rank computations within a tile-based Cholesky solver with on-demand casting of precisions and dynamic runtime support from PaRSEC to orchestrate tasks and data movement. Our adaptive approach scales on various systems and leverages the Fujitsu A64FX nodes of Fugaku to achieve up to 12X performance speedup against the highly optimized dense Cholesky implementation.
Supplemental Material
- S. Abdulah, H. Ltaief, Y. Sun, M. G. Genton, and D. E. Keyes, "ExaGeoStat: A High Performance Unified Software for Geostatistics on Manycore Systems," IEEE Transactions on Parallel and Distributed Systems, vol. 29, no. 12, pp. 2771--2784, 2018.Google Scholar
Cross Ref
- R Core Team, R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, 2022. [Online]. Available: https://www.R-project.org/Google Scholar
- D. Hammerling, H. Huang, and L. Blake, "Implementing Spatial Statistical Methods for Massive Data," in Joint Statistics Meetings, 2019.Google Scholar
- M. Stein, "Limitations on Low-Rank Approximations for Covariance Matrices of Spatial Data," Spatial Statistics, vol. 8, pp. 1--19, 2014.Google Scholar
Cross Ref
- W. Hackbusch, "A Sparse Matrix Arithmetic Based on H-matrices. Part I: Introduction to H-Matrices," Computing, vol. 62, no. 2, pp. 89--108, 1999.Google Scholar
Digital Library
- Y. Sun, B. Li, and M. G. Genton, "Geostatistics for Large Datasets," in Advances and challenges in space-time modelling of natural events. Springer, 2012, pp. 55--77.Google Scholar
- P. Amestoy, C. Ashcraft, O. Boiteau, A. Buttari, J.-Y. L'Excellent, and C. Weisbecker, "Improving Multifrontal Methods By Means Of Block Low-Rank Representations," SIAM Journal on Scientific Computing, vol. 37, no. 3, pp. A1451--A1474, 2015.Google Scholar
Digital Library
- K. Akbudak, H. Ltaief, A. Mikhalev, and D. Keyes, "Tile Low Rank Cholesky Factorization for Climate/Weather Modeling Applications on Manycore Architectures," in 32nd International Conference on High Performance, Frankfurt, Germany. Springer, 2017, pp. 22--40.Google Scholar
- E. Agullo, J. Demmel, J. Dongarra, B. Hadri, J. Kurzak, J. Langou, H. Ltaief, P. Luszczek, and S. Tomov, "Numerical Linear Algebra on Emerging Architectures: The PLASMA and MAGMA Projects," J. Phys.: Conf. Ser., vol. 180, no. 1, 2009.Google Scholar
Cross Ref
- G. Morton, A Computer Oriented Geodetic Data Base and a New Technique in File Sequencing. IBM, 1966.Google Scholar
- S. Abdulah, H. Ltaief, Y. Sun, M. G. Genton, and D. E. Keyes, "Geostatistical Modeling and Prediction Using Mixed Precision Tile Cholesky Factorization," in 26th International Conference on High Performance Computing, Data, and Analytics (HiPC). IEEE, 2019, pp. 152--162.Google Scholar
- S. Abdulah, Q. Cao, Y. Pei, G. Bosilca, J. Dongarra, M. G. Genton, D. E. Keyes, H. Ltaief, and Y. Sun, "Accelerating Geostatistical Modeling and Prediction With Mixed-Precision Computations: A High-Productivity Approach with PaRSEC," IEEE Transactions on Parallel and Distributed Systems, vol. 33, no. 4, pp. 964--976, 2021.Google Scholar
Cross Ref
- M. G. Genton, "Separable Approximations of Space-time Covariance Matrices," Environmetrics: The Official Journal of the International Environmetrics Society, vol. 18, no. 7, pp. 681--695, 2007.Google Scholar
Cross Ref
- N. Cressie and C. K. Wikle, Statistics for Spatio-Temporal Data. John Wiley & Sons, 2015.Google Scholar
- B. Matérn, Spatial Variation. Springer-Verlag, Berlin,, 1986, vol. 36.Google Scholar
- J.-P. Chiles and P. Delfiner, Geostatistics: Modeling Spatial Uncertainty. John Wiley & Sons, 2009, vol. 497.Google Scholar
- S. Börm and J. Garcke, "Approximating Gaussian Processes with H2-Matrices," in European Conference on Machine Learning. Springer, 2007, pp. 42--53.Google Scholar
- T. Gneiting, "Nonseparable, Stationary Covariance Functions for Space-time Data," Journal of the American Statistical Association, vol. 97, no. 458, pp. 590--600, 2002.Google Scholar
Cross Ref
- C. Augonnet, S. Thibault, R. Namyst, and P.-A. Wacrenier, "StarPU: a unified platform for task scheduling on heterogeneous multicore architectures," Concurrency and Computation: Practice and Experience, vol. 23, no. 2, pp. 187--198, 2011.Google Scholar
Digital Library
- A. Duran, R. Ferrer, E. Ayguade, R. M. Badia, and J. Labarta, "A proposal to extend the OpenMP tasking model with dependent tasks," Intl. Journal of Parallel Programming, vol. 37, no. 3, pp. 292--305, 2009.Google Scholar
Digital Library
- R. Hoque, T. Herault, G. Bosilca, and J. Dongarra, "Dynamic Task Discovery in PaRSEC: A Data-flow Task-based Runtime," in Proceedings of the 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, ser. ScalA '17, 2017.Google Scholar
Digital Library
- T. Heller, H. Kaiser, and K. Iglberger, "Application of the ParalleX execution model to stencil-based problems," Computer Science - Research and Development, vol. 28, no. 2--3, pp. 253--261, 2013.Google Scholar
- M. Bauer, S. Treichler, E. Slaughter, and A. Aiken, "Legion: Expressing locality and independence with logical regions," in International Conference for High Performance Computing, Networking, Storage and Analysis, SC. IEEE, 2012, pp. 1--11.Google Scholar
- L. V. Kale and S. Krishnan, "CHARM++: a portable concurrent object oriented system based on C++," in ACM Sigplan Notices, vol. 28, no. 10. ACM, 1993, pp. 91--108.Google Scholar
- G. Bosilca, A. Bouteiller, A. Danalis, M. Faverge, T. Hérault, and J. J. Dongarra, "PaRSEC: Exploiting Heterogeneity to Enhance Scalability," Computing in Science & Engineering, vol. 15, no. 6, pp. 36--45, 2013.Google Scholar
Digital Library
- M. Asch, T. Moore, R. Badia, M. Beck, P. Beckman, T. Bidot, F. Bodin, F. Cappello, A. Choudhary, B. de Supinski, E. Deelman, J. Dongarra, A. Dubey, G. Fox, H. Fu, S. Girona, W. Gropp, M. Heroux, Y. Ishikawa, K. Keahey, D. Keyes, W. Kramer, J.-F. Lavignon, Y. Lu, S. Matsuoka, B. Mohr, D. Reed, S. Requena, J. Saltz, T. Schulthess, R. Stevens, M. Swany, A. Szalay, W. Tang, G. Varoquaux, J.-P. Vilotte, R. Wisniewski, Z. Xu, and I. Zacharov, "Big Data and Extreme-scale Computing: Pathways to Convergence-toward a Shaping Strategy for a Future Software and Data Ecosystem for Scientific Inquiry," The International Journal of High Performance Computing Applications, vol. 32, no. 4, pp. 435--479, 2018.Google Scholar
Digital Library
- "NVIDIA Tensor Cores," https://www.nvidia.com/en-us/data-center/tensorcore/, 2019, [Online; accessed June 2019].Google Scholar
- "Google Tensor Processing Unit (TPU)," https://cloud.google.com/tpu/, 2019, [Online; accessed June 2019].Google Scholar
- W. Joubert, D. Weighill, D. Kainer, S. Climer, A. Justice, K. Fagnan, and D. Jacobson, "Attacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction," in SC18: International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE, 2018, pp. 717--730.Google Scholar
- K. Ando, R. Bale, C. Li, S. Matsuoka, K. Onishi, and M. Tsubokura, "Digital Transformation of Droplet/Aerosol Infection Risk Assessment Realized on" Fugaku" for the Fight Against COVID-19," arXiv preprint arXiv:2110.09769, 2021.Google Scholar
- Y. Liu, X. Liu, F. Li, H. Fu, Y. Yang, J. Song, P. Zhao, Z. Wang, D. Peng, H. Chen et al., "Closing the Quantum Supremacy gap: Achieving Realtime Simulation of a Random Quantum Circuit Using a new Sunway Supercomputer," in Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2021, pp. 1--12.Google Scholar
- Q. Cao, Y. Pei, K. Akbudak, A. Mikhalev, G. Bosilca, H. Ltaief, D. Keyes, and J. Dongarra, "Extreme-scale Task-based Cholesky Factorization Toward Climate and Weather Prediction Applications," in Proceedings of the Platform for Advanced Scientific Computing Conference (PASC), 2020, pp. 1--11.Google Scholar
- A. Danalis, G. Bosilca, A. Bouteiller, T. Herault, and J. Dongarra, "PTG: an Abstraction for Unhindered Parallelism," in 2014 Fourth International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing. IEEE, 2014, pp. 21--30.Google Scholar
- Q. Cao, Y. Pei, T. Herault, K. Akbudak, A. Mikhalev, G. Bosilca, H. Ltaief, D. Keyes, and J. Dongarra, "Performance Analysis of Tile Low-Rank Cholesky Factorization Using PaRSEC Instrumentation Tools," in IEEE/ACM International Workshop on Programming and Performance Visualization Tools (ProTools) at SC19, 2019, pp. 25--32.Google Scholar
- Q. Cao, Y. Pei, K. Akbudak, G. Bosilca, H. Ltaief, D. Keyes, and J. Dongarra, "Leveraging PaRSEC Runtime Support to Tackle Challenging 3D Data-Sparse Matrix Problems," in 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS). IEEE, 2021, pp. 79--89.Google Scholar
- Q. Cao, R. Alomairy, Y. Pei, G. Bosilca, H. Ltaief, D. Keyes, and J. Dongarra, "A Framework to Exploit Data Sparsity in Tile Low-Rank Cholesky Factorization," in 36th IEEE International Parallel & Distributed Processing Symposium (IPDPS), 2022.Google Scholar
- N. W. Chaney, P. Metcalfe, and E. F. Wood, "Hydroblocks: a field-scale resolving land surface model for application over continental extents," Hydrological Processes, vol. 30, no. 20, pp. 3543--3559, 2016.Google Scholar
Cross Ref
- A. McNally, J. Jacob, K. Arsenault, K. Slinski, D. Sarmiento, A. Hoell, S. Pervez, J. Rowland, M. Budde, S. Kumar et al., "A hydrologic monitoring dataset for food and water security applications in central asia," Earth System Science Data Discussions, pp. 1--32, 2021.Google Scholar
- N. J. Higham and T. Mary, "Mixed precision algorithms in numerical linear algebra," Acta Numerica, vol. 31, pp. 347--414, 2022.Google Scholar
Cross Ref
- M. L. O. Salvana, S. Abdulah, H. Ltaief, Y. Sun, M. G. Genton, and D. Keyes, "Parallel Space-Time Likelihood Optimization for Air Pollution Prediction on Large-Scale Systems," in Proceedings of the Platform for Advanced Scientific Computing Conference (PASC), 2022.Google Scholar
- J. Kennedy and R. Eberhart, "Particle swarm optimization," in Proceedings of ICNN'95 - International Conference on Neural Networks, vol. 4. IEEE, 1995, pp. 1942--1948.Google Scholar
Comments