Javascript must be enabled to continue!
An Architecture-Aware Heterogeneous Multigrid Solver for Geodynamic Simulations on the New-Generation Tianhe Supercomputer
View through CrossRef
Large-scale mantle convection simulations repeatedly solve sparse velocity-pressure systems, and the multigrid velocity solver often dominates the total runtime. This paper presents an MT-3000-oriented heterogeneous multigrid solver for CitcomCU on the new-generation Tianhe supercomputer. The solver keeps the original governing equations, finite-element discretization, Uzawa iteration, and geometric multigrid framework, while redesigning the dominant kernels and data movement for the explicitly managed memory hierarchy of MT-3000. The central design is an 8-color block Gauss-Seidel smoother co-designed with ELL-like node-based storage, topology-derived neighbor indexing, AM-aware row-wise streaming, ping-pong double buffering, host-accelerator collaboration, and communication-computation overlap. These techniques expose block-level parallelism, improve locality, and reduce avoidable DMA traffic without increasing the number of multigrid cycles in the tested cases. For the Busse1993 benchmark, the optimized solver remains consistent with the original CitcomCU implementation: after 4000 time steps, the full-domain relative L2 errors are 0.47% for temperature and 2.27% for the three-component velocity vector. Compared with the unmodified production CPU-based CitcomCU baseline, the complete heterogeneous execution mode achieves system-level speedups of up to 4.91x in average multigrid-cycle time and 4.71x in average complete-Uzawa-solve time per simulation step. Weak scaling maintains 59.54% efficiency on 12,800 processes, demonstrating the effectiveness of MT-3000-specific algorithmic and data-movement redesign for matrix-based geodynamic multigrid solvers.
Title: An Architecture-Aware Heterogeneous Multigrid Solver for Geodynamic Simulations on the New-Generation Tianhe Supercomputer
Description:
Large-scale mantle convection simulations repeatedly solve sparse velocity-pressure systems, and the multigrid velocity solver often dominates the total runtime.
This paper presents an MT-3000-oriented heterogeneous multigrid solver for CitcomCU on the new-generation Tianhe supercomputer.
The solver keeps the original governing equations, finite-element discretization, Uzawa iteration, and geometric multigrid framework, while redesigning the dominant kernels and data movement for the explicitly managed memory hierarchy of MT-3000.
The central design is an 8-color block Gauss-Seidel smoother co-designed with ELL-like node-based storage, topology-derived neighbor indexing, AM-aware row-wise streaming, ping-pong double buffering, host-accelerator collaboration, and communication-computation overlap.
These techniques expose block-level parallelism, improve locality, and reduce avoidable DMA traffic without increasing the number of multigrid cycles in the tested cases.
For the Busse1993 benchmark, the optimized solver remains consistent with the original CitcomCU implementation: after 4000 time steps, the full-domain relative L2 errors are 0.
47% for temperature and 2.
27% for the three-component velocity vector.
Compared with the unmodified production CPU-based CitcomCU baseline, the complete heterogeneous execution mode achieves system-level speedups of up to 4.
91x in average multigrid-cycle time and 4.
71x in average complete-Uzawa-solve time per simulation step.
Weak scaling maintains 59.
54% efficiency on 12,800 processes, demonstrating the effectiveness of MT-3000-specific algorithmic and data-movement redesign for matrix-based geodynamic multigrid solvers.
Related Results
The architecture of differences
The architecture of differences
Following in the footsteps of the protagonists of the Italian architectural debate is a mark of culture and proactivity. The synthesis deriving from the artistic-humanistic factors...
Large-Scale Heterogeneous Computing for 3D Deterministic Particle Transport on Tianhe-2A Supercomputer
Large-Scale Heterogeneous Computing for 3D Deterministic Particle Transport on Tianhe-2A Supercomputer
Scalable parallel algorithm for particle transport is one of the main application fields in high-performance computing. Discrete ordinate method (Sn) is one of the most popular det...
Radiation and ablation coupling applied to the study of the Lost City bolide.
Radiation and ablation coupling applied to the study of the Lost City bolide.
IntroductionMeteor phenomena involve a series of complex aspects, from multiphase physics of the meteoroid (melting and evaporation) to non-equilibrium effects within the flow.The ...
Parallelizing and optimizing large‐scale 3D multi‐phase flow simulations on the Tianhe‐2 supercomputer
Parallelizing and optimizing large‐scale 3D multi‐phase flow simulations on the Tianhe‐2 supercomputer
SummaryThe lattice Boltzmann method (LBM) is a widely used computational fluid dynamics method for flow problems with complex geometries and various boundary conditions. Large‐scal...
Performance‐influence models of multigrid methods: A case study on triangular grids
Performance‐influence models of multigrid methods: A case study on triangular grids
SummaryMultigrid methods are among the most efficient algorithms for solving discretized partial differential equations. Typically, a multigrid system offers various configuration ...
A Modified Multilevel Scheme for Internal and External Constraints in Virtual Environments
A Modified Multilevel Scheme for Internal and External Constraints in Virtual Environments
Multigrid algorithms are gaining popularity in virtual reality simulations as they have a theoretically optimal performance that scales linearly with the number of degrees of freed...
Multigrid solvers for isogeometric discretizations of the second biharmonic problem
Multigrid solvers for isogeometric discretizations of the second biharmonic problem
We develop a multigrid solver for the second biharmonic problem in the context of Isogeometric Analysis (IgA), where we also allow a zero-order term. In a previous paper, the autho...
On Classification of the Earth's Crust Areas by the Level of Geodynamic Threat
On Classification of the Earth's Crust Areas by the Level of Geodynamic Threat
Abstract
It is accepted as a well-known fact that a similar anthropogenic impact on the Earth's crust in different places causes dissimilar response. Seismic zoning maps ar...

