Javascript must be enabled to continue!
Binary Instrumentation for Scalable Performance Measurement of OpenMP Applications
View through CrossRef
In this paper we present a binary instrumentation methodology to monitor runtime events. We demonstrate our approach on OpenMP constructs for the Intel and GNU compilers. A binary-level static analysis detects the compiler patterns and the runtime function calls corresponding to OpenMP regions. To this effect we integrate the software tool MAQAO with the scalable measurement infrastructure Score-P. We design a new interface and modify both tools to support the new events. The main advantages of using binary instrumentation are the possibility to retrieve implicit runtime events, to instrument without recompilation, to be independent from the language, and not to interact with compiler optimization. Our validation experiments and first results shows that binary instrumentation has not introduced any additional overhead.
Title: Binary Instrumentation for Scalable Performance Measurement of OpenMP Applications
Description:
In this paper we present a binary instrumentation methodology to monitor runtime events.
We demonstrate our approach on OpenMP constructs for the Intel and GNU compilers.
A binary-level static analysis detects the compiler patterns and the runtime function calls corresponding to OpenMP regions.
To this effect we integrate the software tool MAQAO with the scalable measurement infrastructure Score-P.
We design a new interface and modify both tools to support the new events.
The main advantages of using binary instrumentation are the possibility to retrieve implicit runtime events, to instrument without recompilation, to be independent from the language, and not to interact with compiler optimization.
Our validation experiments and first results shows that binary instrumentation has not introduced any additional overhead.
Related Results
High-level compiler analysis for OpenMP
High-level compiler analysis for OpenMP
Nowadays, applications from dissimilar domains, such as high-performance computing and high-integrity systems, require levels of performance that can only be achieved by means of s...
Analysis of the instrumentation time and cleaning between manual and rotary techniques in deciduous molars
Analysis of the instrumentation time and cleaning between manual and rotary techniques in deciduous molars
The rotary instrumentation provides shorter instrumentation time with greater comfort for the patient but few studies have been conducted on primary teeth. Objective: this study co...
Towards a Performance Engineering Workflow for OpenMP 4.0
Towards a Performance Engineering Workflow for OpenMP 4.0
Parallel programming and performance optimization of parallel programs are not simple tasks. Various HPC and OpenMP courses as well as literature serve as introduction to this topi...
Competitive Indices in Cereal and Legume Mixtures in a South Asian Environment
Competitive Indices in Cereal and Legume Mixtures in a South Asian Environment
Core Ideas
Cereal‐legume binary mixtures increased forage productivity per unit area compared to cereal‐cereal and legume‐legume binary mixtures.
In binary mixtures, pearl millet w...
Hybrid Parallelization of Hyper-Dimensional Vlasov Code with OpenMP Loop Collapse Directive
Hybrid Parallelization of Hyper-Dimensional Vlasov Code with OpenMP Loop Collapse Directive
Space plasma is a collisionless, multi-scale, and highly nonlinear medium. Computer simulations with the first-principle kinetic equation are essential for space plasma studies. In...
Synthesis, characterization and application of novel ionic liquids
Synthesis, characterization and application of novel ionic liquids
Ionic liquids (ILs) or molten salts at room temperature presently experience significant attention in many areas of chemistry. The most attractive property is the “tenability” of t...
Practical Concerns In Metering Fluids From Production Operations
Practical Concerns In Metering Fluids From Production Operations
Abstract
Accurate measurement of oil and gas is of vital concern. Measurement of these vapor and liquid streams by mass, volumetric, and energy techniques are rev...
Performant Automatic BLAS Offloading on Unified Memory Architecture with OpenMP First-Touch Style Data Movement
Performant Automatic BLAS Offloading on Unified Memory Architecture with OpenMP First-Touch Style Data Movement
BLAS is a fundamental building block of advanced linear algebra libraries and many modern scientific computing applications. GPU is known for its strong arithmetic computing capabi...

