Javascript must be enabled to continue!
Fine Grain Algorithm Parallelization on a Hybrid Control-flow and Dataflow Processor
View through CrossRef
Abstract
The execution time of a high performance computing algorithm depends on multiple factors: the algorithm scalability, the chosen hardware, the communication speed between processing elements, etc. This work is based on a hybrid processor consisting of both the control-flow and dataflow hardware, where the control-flow hardware includes manycore architecture. The algorithm decomposition into portions suitable for different architectures is presented. The results are presented by decomposing the Lattice-Boltzmann method implemented for both control-flow and dataflow hardware. The total acceleration factor of the decomposed Lattice-Boltzmann method comparing to the execution time using the control-flow and the dataflow hardware is obtained by the analysis based on the speed of the communication between both hardware equal to the speed of shared cache memories. Results indicate the advantage of using the proposed hybrid architecture and the capability of accelerating considerably even the suitable for dataflow architectures. The analysis of the problem size needed to justify the utilization of control-flow hardware is also presented. The main benefit is in accelerating algorithms for which only some algorithm portions are suitable for the dataflow hardware.
Title: Fine Grain Algorithm Parallelization on a Hybrid Control-flow and Dataflow Processor
Description:
Abstract
The execution time of a high performance computing algorithm depends on multiple factors: the algorithm scalability, the chosen hardware, the communication speed between processing elements, etc.
This work is based on a hybrid processor consisting of both the control-flow and dataflow hardware, where the control-flow hardware includes manycore architecture.
The algorithm decomposition into portions suitable for different architectures is presented.
The results are presented by decomposing the Lattice-Boltzmann method implemented for both control-flow and dataflow hardware.
The total acceleration factor of the decomposed Lattice-Boltzmann method comparing to the execution time using the control-flow and the dataflow hardware is obtained by the analysis based on the speed of the communication between both hardware equal to the speed of shared cache memories.
Results indicate the advantage of using the proposed hybrid architecture and the capability of accelerating considerably even the suitable for dataflow architectures.
The analysis of the problem size needed to justify the utilization of control-flow hardware is also presented.
The main benefit is in accelerating algorithms for which only some algorithm portions are suitable for the dataflow hardware.
Related Results
HAD:A Prototype Of Dataflow Compute Architecture
HAD:A Prototype Of Dataflow Compute Architecture
Abstract
To investigate the features, implementation, and applications of data flow architecture, a novel dataflow computing system, HAD (Hardware Accelerated Dataflow), is...
Software synthesis from dataflow schedule graphs
Software synthesis from dataflow schedule graphs
AbstractThe dataflow-model of computation is widely used in design and implementation of signal processing systems. In dataflow-based design processes, scheduling—the assignment an...
DataFlow based- Automatic parallelization of MATLAB/Simulink models for Fitting Modern Multicore Architectures
DataFlow based- Automatic parallelization of MATLAB/Simulink models for Fitting Modern Multicore Architectures
Abstract
In many fields including aerospace, automotive, and telecommunications, Math-Works’ MATLAB/Simulink is the current de facto standard for model-based design. The st...
Identification of Potential QTLs Related to Grain Size in Rice
Identification of Potential QTLs Related to Grain Size in Rice
Rice is a major crop, providing calories and food for most of the world’s population. Currently, the global population is rapidly increasing, and securing a yield of rice that can ...
Grain size evolution and heat transfer regime in the shells of icy moons
Grain size evolution and heat transfer regime in the shells of icy moons
IntroductionTogether with the ice shell thickness, grain size due to its effect on viscosity is perhaps the most crucial parameter determining the heat transfer regime inside the ...
Grain growth of ice doped with soluble impurities
Grain growth of ice doped with soluble impurities
Abstract. The grain size of polycrystalline ice affects key parameters related to the dynamics of ice masses, such as the rheological and dielectric properties of terrestrial ice f...
Modeling Hybrid Metaheuristic Optimization Algorithm for Convergence Prediction
Modeling Hybrid Metaheuristic Optimization Algorithm for Convergence Prediction
The project aims at the design and development of six hybrid nature inspired algorithms based on Grey Wolf Optimization algorithm with Artificial Bee Colony Optimization algorithm ...
Modeling Hybrid Metaheuristic Optimization Algorithm for Convergence Prediction
Modeling Hybrid Metaheuristic Optimization Algorithm for Convergence Prediction
The project aims at the design and development of six hybrid nature inspired algorithms based on Grey Wolf Optimization algorithm with Artificial Bee Colony Optimization algorithm ...

