Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

R-GPU

View through CrossRef
Over the last decade, Graphics Processing Unit (GPU) architectures have evolved from a fixed-function graphics pipeline to a programmable, energy-efficient compute accelerator for massively parallel applications. The compute power arises from the GPU’s Single Instruction/Multiple Threads architecture: concurrently running many threads and executing them as Single Instruction/Multiple Data--style vectors. However, compute power is still lost due to cycles spent on data movement and control instructions instead of data computations. Even more cycles are lost on pipeline stalls resulting from long latency (memory) operations. To improve not only performance but also energy efficiency, we introduce R-GPU: a reconfigurable GPU architecture with communicating cores. R-GPU is an addition to a GPU, which can still be used as such, but also has the ability to reorganize the cores of a GPU in a reconfigurable network. In R-GPU data movement and control is implicit in the configuration of the network. Each core executes a fixed instruction, reducing instruction decode count and increasing energy efficiency. On a number of benchmarks we show an average performance improvement of 2.1 × over the same GPU without modifications. We further make a conservative power estimation of R-GPU which shows that power consumption can be reduced by 6%, leading to an energy consumption reduction of 55%, while area only increases by a mere 4%.
Title: R-GPU
Description:
Over the last decade, Graphics Processing Unit (GPU) architectures have evolved from a fixed-function graphics pipeline to a programmable, energy-efficient compute accelerator for massively parallel applications.
The compute power arises from the GPU’s Single Instruction/Multiple Threads architecture: concurrently running many threads and executing them as Single Instruction/Multiple Data--style vectors.
However, compute power is still lost due to cycles spent on data movement and control instructions instead of data computations.
Even more cycles are lost on pipeline stalls resulting from long latency (memory) operations.
To improve not only performance but also energy efficiency, we introduce R-GPU: a reconfigurable GPU architecture with communicating cores.
R-GPU is an addition to a GPU, which can still be used as such, but also has the ability to reorganize the cores of a GPU in a reconfigurable network.
In R-GPU data movement and control is implicit in the configuration of the network.
Each core executes a fixed instruction, reducing instruction decode count and increasing energy efficiency.
On a number of benchmarks we show an average performance improvement of 2.
1 × over the same GPU without modifications.
We further make a conservative power estimation of R-GPU which shows that power consumption can be reduced by 6%, leading to an energy consumption reduction of 55%, while area only increases by a mere 4%.

Related Results

Parallel metaheuristics on GPU
Parallel metaheuristics on GPU
Métaheuristiques parallèles sur GPU Les problèmes d'optimisation issus du monde réel sont souvent complexes et NP-difficiles. Leur modélisation est en constante évo...
Heat transfer in supercritical fluids: computational approaches & studies
Heat transfer in supercritical fluids: computational approaches & studies
(English) This thesis delves into investigating the complexities of heat transfer in supercritical fluids through the application of advanced theoretical and computational methodol...
Vina-GPU 2.1: towards further optimizing docking speed and precision of AutoDock Vina and its derivatives
Vina-GPU 2.1: towards further optimizing docking speed and precision of AutoDock Vina and its derivatives
Abstract AutoDock Vina and its derivatives have established themselves as a prevailing pipeline for virtual screening in contemporary drug discov...
Parallel Monte Carlo Tree Search on GPU
Parallel Monte Carlo Tree Search on GPU
Monte Carlo Tree Search (MCTS) is a method for making optimal decisions in artificial intelligence (AI) problems, typically move planning in combinatorial games. It combines the ge...
Pembuatan System GPU (Ground Power Unit) Untuk Alat Peraga Engine Doryz-95
Pembuatan System GPU (Ground Power Unit) Untuk Alat Peraga Engine Doryz-95
Seiring dengan berkembangnya industri kedirgantaraan di sekala internasional maupun nasional. Begitu banyak sekali aspek penting yang dapat diketahui terlebih ...
Vina-GPU 2.0:further accelerating AutoDock Vina and its derivatives with GPUs
Vina-GPU 2.0:further accelerating AutoDock Vina and its derivatives with GPUs
Modern drug discovery typically faces large virtual screens from huge compound databases where multiple docking tools are involved for meeting various real scenes or improving the ...
GPU-I-TASSER: a GPU accelerated I-TASSER protein structure prediction tool
GPU-I-TASSER: a GPU accelerated I-TASSER protein structure prediction tool
Abstract Motivation Accurate and efficient predictions of protein structures play an important role in understanding their funct...
Advancing Computational Intelligence: AI-Based Algorithm Design and Optimization in Programming
Advancing Computational Intelligence: AI-Based Algorithm Design and Optimization in Programming
The research explores AI technique implementations in algorithm optimization and design frameworks to understand their crucial impact on programming challenges and efficiency incre...

Back to Top