Javascript must be enabled to continue!
Advancing Geophysical Data Analysis: HEALML for Efficient Sphere-Based Statistics on Pangeo-EOSC
View through CrossRef
A significant challenge in data integration and ML methodologies on cloud infrastructures is accurately determining correlated statistics. Initially, aligning data to a consistent pixel grid is essential, motivating the use of Discrete Global Grid Systems (DGGS). In geophysical studies, data reside on a sphere, and approximating with tangent planes can distort results. Our solution is the HEALPix pixelization as our DGGS framework, standardizing data on a common grid for consistent statistical analysis. HEALPix's unique features, such as its iso-latitude layout and uniform pixel areas, enable the use of spin-weighted spherical harmonics in managing vector fields. This enables the accurate calculation of  correlation statistics, such as between velocity and scalar fields on the sphere, while minimizing biases due to spherical approximations. By utilizing the HEALPix framework, known in cosmology, with TensorFlow or PyTorch as backends, we created the: HEALML library. This library facilitates gradient computations of all derived statistics for AI optimization, and has been validated on the Pangeo-EOSC platform. This library parallelizes the computation of localized spherical harmonics and includes features like scattering covariance calculations, allowing the extraction of more complex nonlinear statistics beyond the power spectrum. We compare these results to traditional 2D planar methods, demonstrating the advantages of sphere-based statistics on platforms like Pangeo-EOSC. Furthermore, we demonstrate: HEALML's ability to emulate using a substantially smaller dataset. This demonstration emphasizes the ways in which incorporating spherical statistical methods into Pangeo-EOSC fosters innovative and efficient statistical analysis within geophysical research.
Title: Advancing Geophysical Data Analysis: HEALML for Efficient Sphere-Based Statistics on Pangeo-EOSC
Description:
A significant challenge in data integration and ML methodologies on cloud infrastructures is accurately determining correlated statistics.
Initially, aligning data to a consistent pixel grid is essential, motivating the use of Discrete Global Grid Systems (DGGS).
In geophysical studies, data reside on a sphere, and approximating with tangent planes can distort results.
Our solution is the HEALPix pixelization as our DGGS framework, standardizing data on a common grid for consistent statistical analysis.
HEALPix's unique features, such as its iso-latitude layout and uniform pixel areas, enable the use of spin-weighted spherical harmonics in managing vector fields.
This enables the accurate calculation of  correlation statistics, such as between velocity and scalar fields on the sphere, while minimizing biases due to spherical approximations.
By utilizing the HEALPix framework, known in cosmology, with TensorFlow or PyTorch as backends, we created the: HEALML library.
This library facilitates gradient computations of all derived statistics for AI optimization, and has been validated on the Pangeo-EOSC platform.
This library parallelizes the computation of localized spherical harmonics and includes features like scattering covariance calculations, allowing the extraction of more complex nonlinear statistics beyond the power spectrum.
We compare these results to traditional 2D planar methods, demonstrating the advantages of sphere-based statistics on platforms like Pangeo-EOSC.
Furthermore, we demonstrate: HEALML's ability to emulate using a substantially smaller dataset.
This demonstration emphasizes the ways in which incorporating spherical statistical methods into Pangeo-EOSC fosters innovative and efficient statistical analysis within geophysical research.
Related Results
Pangeo for everyone with Galaxy
Pangeo for everyone with Galaxy
<p>Pangeo has been deployed on a number of diverse infrastructures and learning resources are available with for instance the Pangeo Tutorial Gallery (http://gallery....
Enhancing Pangeo-Fish with HEALPix Convolution: Impact Evaluation and Benefits
Enhancing Pangeo-Fish with HEALPix Convolution: Impact Evaluation and Benefits
The Pangeo-Fish project processes biologging data to analyze fish movement and migration patterns.  While SciPy’s convolution methods are robust, they are not op...
Predictors of Statistics Anxiety Among Graduate Students in Saudi Arabia
Predictors of Statistics Anxiety Among Graduate Students in Saudi Arabia
Problem The problem addressed in this study is the anxiety experienced by graduate students toward statistics courses, which often causes students to delay taking statistics cours...
ELIXIR-Italy Laniakea: results and future perspectives
ELIXIR-Italy Laniakea: results and future perspectives
ELIXIR-Italy led the development of Laniakea [1-3], a software framework that facilitates the provisioning of on-demand Galaxy instances as a cloud service over e-infrastructures. ...
Pangeo for geolocating fish using biologging data
Pangeo for geolocating fish using biologging data
<p>In biologging, a small device attached to an animal is used to track its behaviour and environment. This data enables biologists to gain a better understanding of ...
Frequency of Common Chromosomal Abnormalities in Patients with Idiopathic Acquired Aplastic Anemia
Frequency of Common Chromosomal Abnormalities in Patients with Idiopathic Acquired Aplastic Anemia
Objective: To determine the frequency of common chromosomal aberrations in local population idiopathic determine the frequency of common chromosomal aberrations in local population...
VESPA-Cloud
VESPA-Cloud
VESPA (Virtual European Solar and Planetary Access, Erard et al. EPSC2020-190, 2020) is a network of interoperable data services covering all fields of Solar System Sciences. It is...
FAIR Digital Objects in Official Statistics
FAIR Digital Objects in Official Statistics
Introduction*1
Statistical offices on national and international scale provide statistics on demography, labour, income, society, economy, environment and othe...

