Javascript must be enabled to continue!
Large-Scale Heterogeneous Computing for 3D Deterministic Particle Transport on Tianhe-2A Supercomputer
View through CrossRef
Scalable parallel algorithm for particle transport is one of the main application fields in high-performance computing. Discrete ordinate method (Sn) is one of the most popular deterministic numerical methods for solving particle transport equations. In this paper, we introduce a new method of large-scale heterogeneous computing of one energy group time-independent deterministic discrete ordinates neutron transport in 3D Cartesian geometry (Sweep3D) on Tianhe-2A supercomputer. In heterogeneous programming, we use customized Basic Communication Library (BCL) and Accelerated Computing Library (ACL) to control and communicate between CPU and the Matrix2000 accelerator. We use OpenMP instructions to exploit the parallelism of threads based on Matrix 2000. The test results show that the optimization of applying OpenMP on particle transport algorithm modified by our method can get 11.3 times acceleration at most. On Tianhe-2A supercomputer, the parallel efficiency of 1.01 million cores compared with 170 thousand cores is 52%.
Title: Large-Scale Heterogeneous Computing for 3D Deterministic Particle Transport on Tianhe-2A Supercomputer
Description:
Scalable parallel algorithm for particle transport is one of the main application fields in high-performance computing.
Discrete ordinate method (Sn) is one of the most popular deterministic numerical methods for solving particle transport equations.
In this paper, we introduce a new method of large-scale heterogeneous computing of one energy group time-independent deterministic discrete ordinates neutron transport in 3D Cartesian geometry (Sweep3D) on Tianhe-2A supercomputer.
In heterogeneous programming, we use customized Basic Communication Library (BCL) and Accelerated Computing Library (ACL) to control and communicate between CPU and the Matrix2000 accelerator.
We use OpenMP instructions to exploit the parallelism of threads based on Matrix 2000.
The test results show that the optimization of applying OpenMP on particle transport algorithm modified by our method can get 11.
3 times acceleration at most.
On Tianhe-2A supercomputer, the parallel efficiency of 1.
01 million cores compared with 170 thousand cores is 52%.
Related Results
Parallelizing and optimizing large‐scale 3D multi‐phase flow simulations on the Tianhe‐2 supercomputer
Parallelizing and optimizing large‐scale 3D multi‐phase flow simulations on the Tianhe‐2 supercomputer
SummaryThe lattice Boltzmann method (LBM) is a widely used computational fluid dynamics method for flow problems with complex geometries and various boundary conditions. Large‐scal...
A simplified Python-based kinematic model of particle transport in rivers
A simplified Python-based kinematic model of particle transport in rivers
We present results from a particle-scale numerical model inspired by the idea that a majority of the time during transport capable floods, bedload transport in rivers is rarefied, ...
Research on Approval of Domestic and International Transport Container Application of Radioactive Material
Research on Approval of Domestic and International Transport Container Application of Radioactive Material
Due to the potentially dangerous properties of radioactive material, it is during the transport that the process of nuclear energy and technology uses are prone to nuclear and radi...
Deploying and scaling distributed parallel deep neural networks on the Tianhe-3 prototype system
Deploying and scaling distributed parallel deep neural networks on the Tianhe-3 prototype system
AbstractDue to the increase in computing power, it is possible to improve the feature extraction and data fitting capabilities of DNN networks by increasing their depth and model c...
Mechanism and stochastic dynamics of transport in Darcy-scale heterogeneous porous media
Mechanism and stochastic dynamics of transport in Darcy-scale heterogeneous porous media
Solute transport in heterogeneous porous media in general exhibits anomalous behaviors, in the sense that it is characterized by features that cannot be explained in terms of trad...
An Architecture-Aware Heterogeneous Multigrid Solver for Geodynamic Simulations on the New-Generation Tianhe Supercomputer
An Architecture-Aware Heterogeneous Multigrid Solver for Geodynamic Simulations on the New-Generation Tianhe Supercomputer
Large-scale mantle convection simulations repeatedly solve sparse velocity-pressure systems, and the multigrid velocity solver often dominates the total runtime. This paper present...
Experimental and numerical investigation into the effect of surface roughness on particle rebound
Experimental and numerical investigation into the effect of surface roughness on particle rebound
Erosion damage and particle deposition are crucial wear phenomena in gas turbine engines. As a result, compressor efficiency decreases, stability margin reduces, and maintenance co...
SYSTEMATIZATION OF THE REGULATORY FRAMEWORK OF ENSURING THE WATER TRANSPORT COMPETITIVENESS IN UKRAINE
SYSTEMATIZATION OF THE REGULATORY FRAMEWORK OF ENSURING THE WATER TRANSPORT COMPETITIVENESS IN UKRAINE
Topicality. Business entities in the field of water transport can gain competitive advantages and ensure their competitiveness through the introduction of innovations into the proc...

