Javascript must be enabled to continue!
Fine Grain Algorithm Parallelization on a Hybrid Control-flow and Dataflow Processor
View through CrossRef
Abstract
The execution time of a high performance computing algorithm depends on multiple factors: the algorithm scalability, the chosen hardware, the communication speed between processing elements, etc. This work is based on a hybrid processor consisting of both the control-flow and dataflow hardware, where the control-flow hardware includes manycore architecture. The algorithm decomposition into portions suitable for different architectures is presented. The results are presented by decomposing the Lattice-Boltzmann method implemented for both control-flow and dataflow hardware. The total acceleration factor of the decomposed Lattice-Boltzmann method comparing to the execution time using the control-flow and the dataflow hardware is obtained by the analysis based on the speed of the communication between both hardware equal to the speed of shared cache memories. Results indicate the advantage of using the proposed hybrid architecture and the capability of accelerating considerably even the suitable for dataflow architectures. The analysis of the problem size needed to justify the utilization of control-flow hardware is also presented. The main benefit is in accelerating algorithms for which only some algorithm portions are suitable for the dataflow hardware.
Title: Fine Grain Algorithm Parallelization on a Hybrid Control-flow and Dataflow Processor
Description:
Abstract
The execution time of a high performance computing algorithm depends on multiple factors: the algorithm scalability, the chosen hardware, the communication speed between processing elements, etc.
This work is based on a hybrid processor consisting of both the control-flow and dataflow hardware, where the control-flow hardware includes manycore architecture.
The algorithm decomposition into portions suitable for different architectures is presented.
The results are presented by decomposing the Lattice-Boltzmann method implemented for both control-flow and dataflow hardware.
The total acceleration factor of the decomposed Lattice-Boltzmann method comparing to the execution time using the control-flow and the dataflow hardware is obtained by the analysis based on the speed of the communication between both hardware equal to the speed of shared cache memories.
Results indicate the advantage of using the proposed hybrid architecture and the capability of accelerating considerably even the suitable for dataflow architectures.
The analysis of the problem size needed to justify the utilization of control-flow hardware is also presented.
The main benefit is in accelerating algorithms for which only some algorithm portions are suitable for the dataflow hardware.
Related Results
Software synthesis from dataflow schedule graphs
Software synthesis from dataflow schedule graphs
AbstractThe dataflow-model of computation is widely used in design and implementation of signal processing systems. In dataflow-based design processes, scheduling—the assignment an...
Grain size evolution and heat transfer regime in the shells of icy moons
Grain size evolution and heat transfer regime in the shells of icy moons
IntroductionTogether with the ice shell thickness, grain size due to its effect on viscosity is perhaps the most crucial parameter determining the heat transfer regime inside the ...
Grain growth of ice doped with soluble impurities
Grain growth of ice doped with soluble impurities
Abstract. The grain size of polycrystalline ice affects key parameters related to the dynamics of ice masses, such as the rheological and dielectric properties of terrestrial ice f...
Modeling Hybrid Metaheuristic Optimization Algorithm for Convergence Prediction
Modeling Hybrid Metaheuristic Optimization Algorithm for Convergence Prediction
The project aims at the design and development of six hybrid nature inspired algorithms based on Grey Wolf Optimization algorithm with Artificial Bee Colony Optimization algorithm ...
Modeling Hybrid Metaheuristic Optimization Algorithm for Convergence Prediction
Modeling Hybrid Metaheuristic Optimization Algorithm for Convergence Prediction
The project aims at the design and development of six hybrid nature inspired algorithms based on Grey Wolf Optimization algorithm with Artificial Bee Colony Optimization algorithm ...
Dataflow Processing in Memory Achieves Significant Energy Efficiency
Dataflow Processing in Memory Achieves Significant Energy Efficiency
The large difference between processor CPU cycle time and memory access time, often referred to as the memory wall, severely limits the performance of streaming applications. Some ...
Multiphase Flow Metering:An Evaluation of Discharge Coefficients
Multiphase Flow Metering:An Evaluation of Discharge Coefficients
Abstract
The orifice discharge coefficient (CD) is the constant required to correct theoretical flow rate to actual flow rate. It is known that single phase orifi...
Experimental and Numerical Analysis of the Flow Field in the Integrated Valve for the Control Rod Hydraulic Drive System
Experimental and Numerical Analysis of the Flow Field in the Integrated Valve for the Control Rod Hydraulic Drive System
Control Rod Hydraulic Drive System (CRHDS) is a new type of built-in control rod drive technology, and the Integrated Valve (IV) is the key control component of it. The pulse water...

