Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

A general tridiagonal solver for coprocessors: Adapting g-Spike for the Intel Xeon Phi

View through CrossRef
Manycores like the Intel Xeon Phi and graphics processing units like the NVIDIA Tesla series are prime examples of systems for accelerating applications that run on current CPU multicores. It is therefore of interest to build fast, reliable linear system solvers targeting these architectures. Moreover, it is of interest to conduct cross comparisons between algorithmic implementations in order to organize the types of optimizations and transformations that are necessary when porting in order to succeed in obtaining performance portability. In this work we aim to present a detailed study of the adaptation and implementation of g-Spike for the Xeon Phi. g-Spike was originally developed to solve general tridiagonal systems on GPUs, on which it returns high performance while also solving systems for which other state-of-the-art general tridiagonal GPU solvers do not succeed. The solver is based on the Spike framework, using QR factorization without pivoting implemented via Givens rotations. We show the necessary adaptations on the Xeon Phi because of the significant differences in the programming models and the underlying architectures as well as the relative performance differences for data access and processing operations.
Title: A general tridiagonal solver for coprocessors: Adapting g-Spike for the Intel Xeon Phi
Description:
Manycores like the Intel Xeon Phi and graphics processing units like the NVIDIA Tesla series are prime examples of systems for accelerating applications that run on current CPU multicores.
It is therefore of interest to build fast, reliable linear system solvers targeting these architectures.
Moreover, it is of interest to conduct cross comparisons between algorithmic implementations in order to organize the types of optimizations and transformations that are necessary when porting in order to succeed in obtaining performance portability.
In this work we aim to present a detailed study of the adaptation and implementation of g-Spike for the Xeon Phi.
g-Spike was originally developed to solve general tridiagonal systems on GPUs, on which it returns high performance while also solving systems for which other state-of-the-art general tridiagonal GPU solvers do not succeed.
The solver is based on the Spike framework, using QR factorization without pivoting implemented via Givens rotations.
We show the necessary adaptations on the Xeon Phi because of the significant differences in the programming models and the underlying architectures as well as the relative performance differences for data access and processing operations.

Related Results

HPC-BLAST: Distributed BLAST for Modern HPC Clusters.
HPC-BLAST: Distributed BLAST for Modern HPC Clusters.
The near exponential growth in sequence data available to bioinformaticists, and the emergence of new fields of biological research, continue to fuel an incessant need for in- crea...
Un manoscritto equivocato del copista santo Theophilos († 1548)
Un manoscritto equivocato del copista santo Theophilos († 1548)
<p><font size="3"><span class="A1"><span style="font-family: 'Times New Roman','serif'">&Epsilon;&Nu;&Alpha; &Lambda;&Alpha;&Nu;&...
Improving decision tree and neural network learning for evolving data-streams
Improving decision tree and neural network learning for evolving data-streams
High-throughput real-time Big Data stream processing requires fast incremental algorithms that keep models consistent with most recent data. In this scenario, Hoeffding Trees are c...
North Syrian Mortaria and Other Late Roman Personal and Utility Objects Bearing Inscriptions of Good Luck
North Syrian Mortaria and Other Late Roman Personal and Utility Objects Bearing Inscriptions of Good Luck
<span style="font-size: 11pt; color: black; font-family: 'Times New Roman','serif'">&Pi;&Eta;&Lambda;&Iota;&Nu;&Alpha; &Iota;&Gamma;&Delta...
Coinheritance Studies of Yield and Yield Related Traits in Wheat (Triticum Aestivum L.) Preliminary Lines
Coinheritance Studies of Yield and Yield Related Traits in Wheat (Triticum Aestivum L.) Preliminary Lines
A field trial was performed to explore the genetic diversity and coheritability among the 64 bread wheat genotypes. These lines/genotypes were analyzed for days to heading, days to...
L᾽«unilinguisme» officiel de Constantinople byzantine (VIIe-XIIe s.)
L᾽«unilinguisme» officiel de Constantinople byzantine (VIIe-XIIe s.)
&nbsp; <p>&Nu;ί&kappa;&omicron;&sigmaf; &Omicron;&iota;&kappa;&omicron;&nu;&omicron;&mu;ί&delta;&eta;&sigmaf;</...
Intracellular pH regulation in rat Schwann cells
Intracellular pH regulation in rat Schwann cells
AbstractWe examined H+ and HCO3− transport mechanisms that are involved in the regulation of intracellular pH of Schwann cells. Primary cultures of Schwann cells were prepared from...
THE ZINC AGRONOMIC BIOFORTIFICATION AND ITS INFLUENCE ON WINTER WHEAT PRODUCTIVITY AND QUALITY
THE ZINC AGRONOMIC BIOFORTIFICATION AND ITS INFLUENCE ON WINTER WHEAT PRODUCTIVITY AND QUALITY
This study evaluates the influence of agronomic biofortification with zinc on the productivity and quality of winter wheat. The large number of people, globally, affected by the nu...

Back to Top