Javascript must be enabled to continue!
Advancing Geophysical Data Analysis: HEALML for Efficient Sphere-Based Statistics on Pangeo-EOSC
View through CrossRef
A significant challenge in data integration and ML methodologies on cloud infrastructures is accurately determining correlated statistics. Initially, aligning data to a consistent pixel grid is essential, motivating the use of Discrete Global Grid Systems (DGGS). In geophysical studies, data reside on a sphere, and approximating with tangent planes can distort results. Our solution is the HEALPix pixelization as our DGGS framework, standardizing data on a common grid for consistent statistical analysis. HEALPix's unique features, such as its iso-latitude layout and uniform pixel areas, enable the use of spin-weighted spherical harmonics in managing vector fields. This enables the accurate calculation of  correlation statistics, such as between velocity and scalar fields on the sphere, while minimizing biases due to spherical approximations. By utilizing the HEALPix framework, known in cosmology, with TensorFlow or PyTorch as backends, we created the: HEALML library. This library facilitates gradient computations of all derived statistics for AI optimization, and has been validated on the Pangeo-EOSC platform. This library parallelizes the computation of localized spherical harmonics and includes features like scattering covariance calculations, allowing the extraction of more complex nonlinear statistics beyond the power spectrum. We compare these results to traditional 2D planar methods, demonstrating the advantages of sphere-based statistics on platforms like Pangeo-EOSC. Furthermore, we demonstrate: HEALML's ability to emulate using a substantially smaller dataset. This demonstration emphasizes the ways in which incorporating spherical statistical methods into Pangeo-EOSC fosters innovative and efficient statistical analysis within geophysical research.
Title: Advancing Geophysical Data Analysis: HEALML for Efficient Sphere-Based Statistics on Pangeo-EOSC
Description:
A significant challenge in data integration and ML methodologies on cloud infrastructures is accurately determining correlated statistics.
Initially, aligning data to a consistent pixel grid is essential, motivating the use of Discrete Global Grid Systems (DGGS).
In geophysical studies, data reside on a sphere, and approximating with tangent planes can distort results.
Our solution is the HEALPix pixelization as our DGGS framework, standardizing data on a common grid for consistent statistical analysis.
HEALPix's unique features, such as its iso-latitude layout and uniform pixel areas, enable the use of spin-weighted spherical harmonics in managing vector fields.
This enables the accurate calculation of  correlation statistics, such as between velocity and scalar fields on the sphere, while minimizing biases due to spherical approximations.
By utilizing the HEALPix framework, known in cosmology, with TensorFlow or PyTorch as backends, we created the: HEALML library.
This library facilitates gradient computations of all derived statistics for AI optimization, and has been validated on the Pangeo-EOSC platform.
This library parallelizes the computation of localized spherical harmonics and includes features like scattering covariance calculations, allowing the extraction of more complex nonlinear statistics beyond the power spectrum.
We compare these results to traditional 2D planar methods, demonstrating the advantages of sphere-based statistics on platforms like Pangeo-EOSC.
Furthermore, we demonstrate: HEALML's ability to emulate using a substantially smaller dataset.
This demonstration emphasizes the ways in which incorporating spherical statistical methods into Pangeo-EOSC fosters innovative and efficient statistical analysis within geophysical research.
Related Results
Predictors of Statistics Anxiety Among Graduate Students in Saudi Arabia
Predictors of Statistics Anxiety Among Graduate Students in Saudi Arabia
Problem The problem addressed in this study is the anxiety experienced by graduate students toward statistics courses, which often causes students to delay taking statistics cours...
Enhancing Pangeo-Fish with HEALPix Convolution: Impact Evaluation and Benefits
Enhancing Pangeo-Fish with HEALPix Convolution: Impact Evaluation and Benefits
The Pangeo-Fish project processes biologging data to analyze fish movement and migration patterns.  While SciPy’s convolution methods are robust, they are not op...
VESPA-Cloud
VESPA-Cloud
VESPA (Virtual European Solar and Planetary Access, Erard et al. EPSC2020-190, 2020) is a network of interoperable data services covering all fields of Solar System Sciences. It is...
Il riuso nel contesto di EOSC e di Horizon Europe
Il riuso nel contesto di EOSC e di Horizon Europe
Parlare di riuso oggi, nell’era di EOSC e di Horizon Europe, significa parlare di dati e servizi FAIR (Findable, Accessible, Interoperable, Reusable), che di EOSC sono i blocchi co...
GSPy: A new toolbox and data standard for Geophysical Datasets
GSPy: A new toolbox and data standard for Geophysical Datasets
The diversity of geophysical methods and datatypes, as well as the isolated nature of various specialties (e.g., electromagnetic, seismic, potential fields) leads to a profusion of...
CONCEPTUAL PRINCIPLES OF SUSTAINABLE INCLUSIVE DEVELOPMENT OF SPORTS AND HEALTH SPHERE OF THE REGION
CONCEPTUAL PRINCIPLES OF SUSTAINABLE INCLUSIVE DEVELOPMENT OF SPORTS AND HEALTH SPHERE OF THE REGION
Purpose. The aim of the article is scientific substantiation of the new concept of development of the sports and health sphere of the region, which is based on the observance of th...
Atmospheric Retrievals in a Modern Python Framework
Atmospheric Retrievals in a Modern Python Framework
<p>Modern Machine Learning (ML) techniques applied in atmospherical modeling rely heavily on two<br>aspects: good quality and good coverage observations...
THE INTERPRETATION OF GEOPHYSICAL DATA
THE INTERPRETATION OF GEOPHYSICAL DATA
Geophysical data result from measurements of physical properties. The geophysicist postulates certain possible physical causes of the observed effects. The geologist reasons from o...

