10.5555/3571885.3571888acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article

Reshaping geostatistical modeling and prediction for extreme-scale environmental applications

Authors Info & Claims
Published:18 November 2022Publication History

ABSTRACT

We extend the capability of space-time geostatistical modeling using algebraic approximations, illustrating application-expected accuracy worthy of double precision from majority low-precision computations and low-rank matrix approximations. We exploit the mathematical structure of the dense covariance matrix whose inverse action and determinant are repeatedly required in Gaussian log-likelihood optimization. Geostatistics augments first-principles modeling approaches for the prediction of environmental phenomena given the availability of measurements at a large number of locations; however, traditional Cholesky-based approaches grow cubically in complexity, gating practical extension to continental and global datasets now available. We combine the linear algebraic contributions of mixed-precision and low-rank computations within a tile-based Cholesky solver with on-demand casting of precisions and dynamic runtime support from PaRSEC to orchestrate tasks and data movement. Our adaptive approach scales on various systems and leverages the Fujitsu A64FX nodes of Fugaku to achieve up to 12X performance speedup against the highly optimized dense Cholesky implementation.

Skip Supplemental Material Section

Supplemental Material

reshaping_geostatistical_modeling_and_prediction_for_extreme-scale_environmental_applications.mp4 (1080p).mp4

Presentation at SC '22

References

  1. S. Abdulah, H. Ltaief, Y. Sun, M. G. Genton, and D. E. Keyes, "ExaGeoStat: A High Performance Unified Software for Geostatistics on Manycore Systems," IEEE Transactions on Parallel and Distributed Systems, vol. 29, no. 12, pp. 2771--2784, 2018.Google ScholarGoogle ScholarCross RefCross Ref
  2. R Core Team, R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, 2022. [Online]. Available: https://www.R-project.org/Google ScholarGoogle Scholar
  3. D. Hammerling, H. Huang, and L. Blake, "Implementing Spatial Statistical Methods for Massive Data," in Joint Statistics Meetings, 2019.Google ScholarGoogle Scholar
  4. M. Stein, "Limitations on Low-Rank Approximations for Covariance Matrices of Spatial Data," Spatial Statistics, vol. 8, pp. 1--19, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  5. W. Hackbusch, "A Sparse Matrix Arithmetic Based on H-matrices. Part I: Introduction to H-Matrices," Computing, vol. 62, no. 2, pp. 89--108, 1999.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Y. Sun, B. Li, and M. G. Genton, "Geostatistics for Large Datasets," in Advances and challenges in space-time modelling of natural events. Springer, 2012, pp. 55--77.Google ScholarGoogle Scholar
  7. P. Amestoy, C. Ashcraft, O. Boiteau, A. Buttari, J.-Y. L'Excellent, and C. Weisbecker, "Improving Multifrontal Methods By Means Of Block Low-Rank Representations," SIAM Journal on Scientific Computing, vol. 37, no. 3, pp. A1451--A1474, 2015.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. K. Akbudak, H. Ltaief, A. Mikhalev, and D. Keyes, "Tile Low Rank Cholesky Factorization for Climate/Weather Modeling Applications on Manycore Architectures," in 32nd International Conference on High Performance, Frankfurt, Germany. Springer, 2017, pp. 22--40.Google ScholarGoogle Scholar
  9. E. Agullo, J. Demmel, J. Dongarra, B. Hadri, J. Kurzak, J. Langou, H. Ltaief, P. Luszczek, and S. Tomov, "Numerical Linear Algebra on Emerging Architectures: The PLASMA and MAGMA Projects," J. Phys.: Conf. Ser., vol. 180, no. 1, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  10. G. Morton, A Computer Oriented Geodetic Data Base and a New Technique in File Sequencing. IBM, 1966.Google ScholarGoogle Scholar
  11. S. Abdulah, H. Ltaief, Y. Sun, M. G. Genton, and D. E. Keyes, "Geostatistical Modeling and Prediction Using Mixed Precision Tile Cholesky Factorization," in 26th International Conference on High Performance Computing, Data, and Analytics (HiPC). IEEE, 2019, pp. 152--162.Google ScholarGoogle Scholar
  12. S. Abdulah, Q. Cao, Y. Pei, G. Bosilca, J. Dongarra, M. G. Genton, D. E. Keyes, H. Ltaief, and Y. Sun, "Accelerating Geostatistical Modeling and Prediction With Mixed-Precision Computations: A High-Productivity Approach with PaRSEC," IEEE Transactions on Parallel and Distributed Systems, vol. 33, no. 4, pp. 964--976, 2021.Google ScholarGoogle ScholarCross RefCross Ref
  13. M. G. Genton, "Separable Approximations of Space-time Covariance Matrices," Environmetrics: The Official Journal of the International Environmetrics Society, vol. 18, no. 7, pp. 681--695, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  14. N. Cressie and C. K. Wikle, Statistics for Spatio-Temporal Data. John Wiley & Sons, 2015.Google ScholarGoogle Scholar
  15. B. Matérn, Spatial Variation. Springer-Verlag, Berlin,, 1986, vol. 36.Google ScholarGoogle Scholar
  16. J.-P. Chiles and P. Delfiner, Geostatistics: Modeling Spatial Uncertainty. John Wiley & Sons, 2009, vol. 497.Google ScholarGoogle Scholar
  17. S. Börm and J. Garcke, "Approximating Gaussian Processes with H2-Matrices," in European Conference on Machine Learning. Springer, 2007, pp. 42--53.Google ScholarGoogle Scholar
  18. T. Gneiting, "Nonseparable, Stationary Covariance Functions for Space-time Data," Journal of the American Statistical Association, vol. 97, no. 458, pp. 590--600, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  19. C. Augonnet, S. Thibault, R. Namyst, and P.-A. Wacrenier, "StarPU: a unified platform for task scheduling on heterogeneous multicore architectures," Concurrency and Computation: Practice and Experience, vol. 23, no. 2, pp. 187--198, 2011.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. A. Duran, R. Ferrer, E. Ayguade, R. M. Badia, and J. Labarta, "A proposal to extend the OpenMP tasking model with dependent tasks," Intl. Journal of Parallel Programming, vol. 37, no. 3, pp. 292--305, 2009.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. R. Hoque, T. Herault, G. Bosilca, and J. Dongarra, "Dynamic Task Discovery in PaRSEC: A Data-flow Task-based Runtime," in Proceedings of the 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, ser. ScalA '17, 2017.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. T. Heller, H. Kaiser, and K. Iglberger, "Application of the ParalleX execution model to stencil-based problems," Computer Science - Research and Development, vol. 28, no. 2--3, pp. 253--261, 2013.Google ScholarGoogle Scholar
  23. M. Bauer, S. Treichler, E. Slaughter, and A. Aiken, "Legion: Expressing locality and independence with logical regions," in International Conference for High Performance Computing, Networking, Storage and Analysis, SC. IEEE, 2012, pp. 1--11.Google ScholarGoogle Scholar
  24. L. V. Kale and S. Krishnan, "CHARM++: a portable concurrent object oriented system based on C++," in ACM Sigplan Notices, vol. 28, no. 10. ACM, 1993, pp. 91--108.Google ScholarGoogle Scholar
  25. G. Bosilca, A. Bouteiller, A. Danalis, M. Faverge, T. Hérault, and J. J. Dongarra, "PaRSEC: Exploiting Heterogeneity to Enhance Scalability," Computing in Science & Engineering, vol. 15, no. 6, pp. 36--45, 2013.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. M. Asch, T. Moore, R. Badia, M. Beck, P. Beckman, T. Bidot, F. Bodin, F. Cappello, A. Choudhary, B. de Supinski, E. Deelman, J. Dongarra, A. Dubey, G. Fox, H. Fu, S. Girona, W. Gropp, M. Heroux, Y. Ishikawa, K. Keahey, D. Keyes, W. Kramer, J.-F. Lavignon, Y. Lu, S. Matsuoka, B. Mohr, D. Reed, S. Requena, J. Saltz, T. Schulthess, R. Stevens, M. Swany, A. Szalay, W. Tang, G. Varoquaux, J.-P. Vilotte, R. Wisniewski, Z. Xu, and I. Zacharov, "Big Data and Extreme-scale Computing: Pathways to Convergence-toward a Shaping Strategy for a Future Software and Data Ecosystem for Scientific Inquiry," The International Journal of High Performance Computing Applications, vol. 32, no. 4, pp. 435--479, 2018.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. "NVIDIA Tensor Cores," https://www.nvidia.com/en-us/data-center/tensorcore/, 2019, [Online; accessed June 2019].Google ScholarGoogle Scholar
  28. "Google Tensor Processing Unit (TPU)," https://cloud.google.com/tpu/, 2019, [Online; accessed June 2019].Google ScholarGoogle Scholar
  29. W. Joubert, D. Weighill, D. Kainer, S. Climer, A. Justice, K. Fagnan, and D. Jacobson, "Attacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction," in SC18: International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE, 2018, pp. 717--730.Google ScholarGoogle Scholar
  30. K. Ando, R. Bale, C. Li, S. Matsuoka, K. Onishi, and M. Tsubokura, "Digital Transformation of Droplet/Aerosol Infection Risk Assessment Realized on" Fugaku" for the Fight Against COVID-19," arXiv preprint arXiv:2110.09769, 2021.Google ScholarGoogle Scholar
  31. Y. Liu, X. Liu, F. Li, H. Fu, Y. Yang, J. Song, P. Zhao, Z. Wang, D. Peng, H. Chen et al., "Closing the Quantum Supremacy gap: Achieving Realtime Simulation of a Random Quantum Circuit Using a new Sunway Supercomputer," in Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2021, pp. 1--12.Google ScholarGoogle Scholar
  32. Q. Cao, Y. Pei, K. Akbudak, A. Mikhalev, G. Bosilca, H. Ltaief, D. Keyes, and J. Dongarra, "Extreme-scale Task-based Cholesky Factorization Toward Climate and Weather Prediction Applications," in Proceedings of the Platform for Advanced Scientific Computing Conference (PASC), 2020, pp. 1--11.Google ScholarGoogle Scholar
  33. A. Danalis, G. Bosilca, A. Bouteiller, T. Herault, and J. Dongarra, "PTG: an Abstraction for Unhindered Parallelism," in 2014 Fourth International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing. IEEE, 2014, pp. 21--30.Google ScholarGoogle Scholar
  34. Q. Cao, Y. Pei, T. Herault, K. Akbudak, A. Mikhalev, G. Bosilca, H. Ltaief, D. Keyes, and J. Dongarra, "Performance Analysis of Tile Low-Rank Cholesky Factorization Using PaRSEC Instrumentation Tools," in IEEE/ACM International Workshop on Programming and Performance Visualization Tools (ProTools) at SC19, 2019, pp. 25--32.Google ScholarGoogle Scholar
  35. Q. Cao, Y. Pei, K. Akbudak, G. Bosilca, H. Ltaief, D. Keyes, and J. Dongarra, "Leveraging PaRSEC Runtime Support to Tackle Challenging 3D Data-Sparse Matrix Problems," in 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS). IEEE, 2021, pp. 79--89.Google ScholarGoogle Scholar
  36. Q. Cao, R. Alomairy, Y. Pei, G. Bosilca, H. Ltaief, D. Keyes, and J. Dongarra, "A Framework to Exploit Data Sparsity in Tile Low-Rank Cholesky Factorization," in 36th IEEE International Parallel & Distributed Processing Symposium (IPDPS), 2022.Google ScholarGoogle Scholar
  37. N. W. Chaney, P. Metcalfe, and E. F. Wood, "Hydroblocks: a field-scale resolving land surface model for application over continental extents," Hydrological Processes, vol. 30, no. 20, pp. 3543--3559, 2016.Google ScholarGoogle ScholarCross RefCross Ref
  38. A. McNally, J. Jacob, K. Arsenault, K. Slinski, D. Sarmiento, A. Hoell, S. Pervez, J. Rowland, M. Budde, S. Kumar et al., "A hydrologic monitoring dataset for food and water security applications in central asia," Earth System Science Data Discussions, pp. 1--32, 2021.Google ScholarGoogle Scholar
  39. N. J. Higham and T. Mary, "Mixed precision algorithms in numerical linear algebra," Acta Numerica, vol. 31, pp. 347--414, 2022.Google ScholarGoogle ScholarCross RefCross Ref
  40. M. L. O. Salvana, S. Abdulah, H. Ltaief, Y. Sun, M. G. Genton, and D. Keyes, "Parallel Space-Time Likelihood Optimization for Air Pollution Prediction on Large-Scale Systems," in Proceedings of the Platform for Advanced Scientific Computing Conference (PASC), 2022.Google ScholarGoogle Scholar
  41. J. Kennedy and R. Eberhart, "Particle swarm optimization," in Proceedings of ICNN'95 - International Conference on Neural Networks, vol. 4. IEEE, 1995, pp. 1942--1948.Google ScholarGoogle Scholar

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Article Metrics

    • Downloads (Last 12 months)71
    • Downloads (Last 6 weeks)20

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader
About Cookies On This Site

We use cookies to ensure that we give you the best experience on our website.

Learn more

Got it!