Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Architectural support for high-performing hardware transactional memory systems

View through CrossRef
Parallel programming presents an efficient solution to exploit future multicore processors. Unfortunately, traditional programming models depend on programmer’s skills for synchronizing concurrent threads, which makes the development of parallel software a hard and errorprone task. In addition to this, current synchronization techniques serialize the execution of those critical sections that conflict in shared memory and thus limit the scalability of multithreaded applications. Transactional Memory (TM) has emerged as a promising programming model that solves the trade-off between high performance and ease of use. In TM, the system is in charge of scheduling transactions (atomic blocks of instructions) and guaranteeing that they are executed in isolation, which simplifies writing parallel code and, at the same time, enables high concurrency when atomic regions access different data. Among all forms of TM environments, Hardware TM (HTM) systems is the only one that offers fast execution at the cost of adding dedicated logic in the processor. Existing HTMsystems suffer considerable delays when they execute complex transactional workloads, especially when they deal with large and contending transactions because they lack adaptability. Furthermore, most HTM implementations are ad hoc and require cumbersome hardware structures to be effective, which complicates the feasibility of the design. This thesis makes several contributions in the design and analysis of low-cost HTMsystems that yield good performance for any kind of TM program. Our first contribution, FASTM, introduces a novel mechanism to elegantly manage speculative (and already validated) versions of transactional data by slightly modifying on-chip memory engine. This approach permits fast recovery when a transaction that fits in private caches is discarded. At the same time, it keeps non-speculative values in software, which allows in-place x memory updates. Thus, FASTM is not hurt from capacity issues nor slows down when it has to undo transactional modifications. Our second contribution includes two different HTM systems that integrate deferred resolution of conflicts in a conventional multicore processor, which reduces the complexity of the system with respect to previous proposals. The first one, FUSETM, combines different-mode transactions under a unified infrastructure to gracefully handle resource overflow. As a result, FUSETM brings fast transactional computation without requiring additional hardware nor extra communication at the end of speculative execution. The second one, SPECTM, introduces a two-level data versioning mechanism to resolve conflicts in a speculative fashion even in the case of overflow. Our third and last contribution presents a couple of truly flexible HTM systems that can dynamically adapt their underlying mechanisms according to the characteristics of the program. DYNTM records statistics of previously executed transactions to select the best-suited strategy each time a new instance of a transaction starts. SWAPTM takes a different approach: it tracks information of the current transactional instance to change its priority level at runtime. Both alternatives obtain great performance over existing proposals that employ fixed transactional policies, especially in applications with phase changes.
Universitat Politècnica de Catalunya
Title: Architectural support for high-performing hardware transactional memory systems
Description:
Parallel programming presents an efficient solution to exploit future multicore processors.
Unfortunately, traditional programming models depend on programmer’s skills for synchronizing concurrent threads, which makes the development of parallel software a hard and errorprone task.
In addition to this, current synchronization techniques serialize the execution of those critical sections that conflict in shared memory and thus limit the scalability of multithreaded applications.
Transactional Memory (TM) has emerged as a promising programming model that solves the trade-off between high performance and ease of use.
In TM, the system is in charge of scheduling transactions (atomic blocks of instructions) and guaranteeing that they are executed in isolation, which simplifies writing parallel code and, at the same time, enables high concurrency when atomic regions access different data.
Among all forms of TM environments, Hardware TM (HTM) systems is the only one that offers fast execution at the cost of adding dedicated logic in the processor.
Existing HTMsystems suffer considerable delays when they execute complex transactional workloads, especially when they deal with large and contending transactions because they lack adaptability.
Furthermore, most HTM implementations are ad hoc and require cumbersome hardware structures to be effective, which complicates the feasibility of the design.
This thesis makes several contributions in the design and analysis of low-cost HTMsystems that yield good performance for any kind of TM program.
Our first contribution, FASTM, introduces a novel mechanism to elegantly manage speculative (and already validated) versions of transactional data by slightly modifying on-chip memory engine.
This approach permits fast recovery when a transaction that fits in private caches is discarded.
At the same time, it keeps non-speculative values in software, which allows in-place x memory updates.
Thus, FASTM is not hurt from capacity issues nor slows down when it has to undo transactional modifications.
Our second contribution includes two different HTM systems that integrate deferred resolution of conflicts in a conventional multicore processor, which reduces the complexity of the system with respect to previous proposals.
The first one, FUSETM, combines different-mode transactions under a unified infrastructure to gracefully handle resource overflow.
As a result, FUSETM brings fast transactional computation without requiring additional hardware nor extra communication at the end of speculative execution.
The second one, SPECTM, introduces a two-level data versioning mechanism to resolve conflicts in a speculative fashion even in the case of overflow.
Our third and last contribution presents a couple of truly flexible HTM systems that can dynamically adapt their underlying mechanisms according to the characteristics of the program.
DYNTM records statistics of previously executed transactions to select the best-suited strategy each time a new instance of a transaction starts.
SWAPTM takes a different approach: it tracks information of the current transactional instance to change its priority level at runtime.
Both alternatives obtain great performance over existing proposals that employ fixed transactional policies, especially in applications with phase changes.

Related Results

Performance simulation methodologies for hardware/software co-designed processors
Performance simulation methodologies for hardware/software co-designed processors
Recently the community started looking into Hardware/Software (HW/SW) co-designed processors as potential solutions to move towards the less power consuming and the less complex de...
Virtualizable hardware/software design infrastructure for dynamically partially reconfigurable systems
Virtualizable hardware/software design infrastructure for dynamically partially reconfigurable systems
In most existing works, reconfigurable hardware modules are still managed as conventional hardware devices. Further, the software reconfiguration overhead incurred by loading corre...
Management and Transactional Analysis Approach in Group Counseling
Management and Transactional Analysis Approach in Group Counseling
The purpose of this study is to explore the application of Transactional Analysis in the context of group counseling. Transactional Analysis, developed by Eric Berne, is a psycholo...
The Observatory
The Observatory
<p><b>This thesis investigation looks at how transformative heritage stories linked to abandoned architectural sites can be reawakened through an allegorical architectu...
Transactional Memory without Hardware Support: A Comprehensive Software-Driven Approach
Transactional Memory without Hardware Support: A Comprehensive Software-Driven Approach
Transactional memory (TM) simplifies concurrent programming by enabling code blocks to execute in isolation, eliminating the complexities of locks and ensuring data con- sistency. ...
NO moral qualms about transactional sex? Materialism and objectification
NO moral qualms about transactional sex? Materialism and objectification
I explored the psychological mechanism underlying the exchanging of sex for tangible benefits such as drugs, money, and career opportunities, that is, transactional sex. I conducte...
Publishing as architectural practice
Publishing as architectural practice
Abstract Appropriating its title from the recent collection of essays, Publishing as Artistic Practice (ed. Annette Gilbert, Sternberg Press, 2016) – in which publis...
The City of Ladies
The City of Ladies
<p><strong>In The Book of the City of Ladies, 15th century medieval author and first-person narrator Christine de Pizan describes constructing an imaginary city, a utop...

Back to Top