Javascript must be enabled to continue!
Oasis : Controlling Data Migration in Expansion of Object-based Storage Systems
View through CrossRef
Object-based storage systems have been widely used for various scenarios such as file storage, block storage, blob (e.g., large videos) storage, and so on, where the data is placed among a large number of object storage devices (OSDs). Data placement is critical for the scalability of decentralized object-based storage systems. The state-of-the-art CRUSH placement method is a decentralized algorithm that deterministically places object replicas onto storage devices without relying on a central directory. While enjoying the benefits of decentralization such as high scalability, robustness, and performance, CRUSH-based storage systems suffer from
uncontrolled
data migration when expanding the capacity of the storage clusters (i.e., adding new OSDs), which is determined by the nature of CRUSH and will cause significant performance degradation when the expansion is nontrivial.
This article presents
MapX
, a novel extension to CRUSH that uses an extra time-dimension mapping (from object creation times to cluster expansion times) for controlling data migration after cluster expansions. Each expansion is viewed as a new layer of the CRUSH map represented by a virtual node beneath the CRUSH root.
MapX
controls the mapping from objects onto layers by manipulating the timestamps of the intermediate placement groups (PGs).
MapX
is applicable to a large variety of object-based storage scenarios where object timestamps can be maintained as higher-level metadata. We have applied
MapX
to the state-of-the-art Ceph-RBD (RADOS Block Device) to implement a migration-controllable, decentralized object-based block store (called
Oasis
).
Oasis
extends the RBD metadata structure to maintain and retrieve approximate object creation times (for migration control) at the granularity of expansion layers. Experimental results show that the
MapX
-based
Oasis
block store outperforms the CRUSH-based Ceph-RBD (which is busy in migrating objects after expansions) by 3.17× ∼ 4.31× in tail latency, and 76.3% (respectively, 83.8%) in IOPS for reads (respectively, writes).
Association for Computing Machinery (ACM)
Title: Oasis
: Controlling Data Migration in Expansion of Object-based Storage Systems
Description:
Object-based storage systems have been widely used for various scenarios such as file storage, block storage, blob (e.
g.
, large videos) storage, and so on, where the data is placed among a large number of object storage devices (OSDs).
Data placement is critical for the scalability of decentralized object-based storage systems.
The state-of-the-art CRUSH placement method is a decentralized algorithm that deterministically places object replicas onto storage devices without relying on a central directory.
While enjoying the benefits of decentralization such as high scalability, robustness, and performance, CRUSH-based storage systems suffer from
uncontrolled
data migration when expanding the capacity of the storage clusters (i.
e.
, adding new OSDs), which is determined by the nature of CRUSH and will cause significant performance degradation when the expansion is nontrivial.
This article presents
MapX
, a novel extension to CRUSH that uses an extra time-dimension mapping (from object creation times to cluster expansion times) for controlling data migration after cluster expansions.
Each expansion is viewed as a new layer of the CRUSH map represented by a virtual node beneath the CRUSH root.
MapX
controls the mapping from objects onto layers by manipulating the timestamps of the intermediate placement groups (PGs).
MapX
is applicable to a large variety of object-based storage scenarios where object timestamps can be maintained as higher-level metadata.
We have applied
MapX
to the state-of-the-art Ceph-RBD (RADOS Block Device) to implement a migration-controllable, decentralized object-based block store (called
Oasis
).
Oasis
extends the RBD metadata structure to maintain and retrieve approximate object creation times (for migration control) at the granularity of expansion layers.
Experimental results show that the
MapX
-based
Oasis
block store outperforms the CRUSH-based Ceph-RBD (which is busy in migrating objects after expansions) by 3.
17× ∼ 4.
31× in tail latency, and 76.
3% (respectively, 83.
8%) in IOPS for reads (respectively, writes).
Related Results
Study of Sediments in the Yutian‐Hotan Oasis, South Xinjiang, China
Study of Sediments in the Yutian‐Hotan Oasis, South Xinjiang, China
Abstract Based on investigation of sediments in the Yutian‐Hotan Oasis at the southern margin of the Tarim basin, 14C ages and grain‐size and chemical element analyses, the follow...
Enhancing recognition of obstetric anal sphincter injuries in six maternity units in Palestine: an interventional quality improvement study
Enhancing recognition of obstetric anal sphincter injuries in six maternity units in Palestine: an interventional quality improvement study
Objective
To explore the impact of a training intervention on obstetric anal sphincter injuries’ (OASIS) detection rate.
...
Rural-Urban Migration
Rural-Urban Migration
Rural-urban migration refers to the movement of people from rural to urban areas. Defining migration is not easy; the same can be said for “rural” and “urban.” All three of these c...
Feminisation of Migration; Historical Aspects, Contemporary Trends and Socio-economic Empowerment of Women
Feminisation of Migration; Historical Aspects, Contemporary Trends and Socio-economic Empowerment of Women
Migration is a multi-faceted experience with social, economic, and personal development opportunities. Gender-specific migration also has different dynamics. This paper explores th...
Siwa Oasis
Siwa Oasis
The Siwa Oasis is located in Egypt’s Western Desert and lies about 50 kilometers east of the Libyan border and 300 kilometers south of the Mediterranean coast. The oasis has been r...
Query expansion by relying on the structure of knowledge bases
Query expansion by relying on the structure of knowledge bases
Query expansion techniques aim at improving the results achieved by a user's query by means of introducing new expansion terms, called expansion features. Expansion features introd...
A Scalable Near Line Storage Solution for Very Big Data
A Scalable Near Line Storage Solution for Very Big Data
Managing huge volumes of data is a problem now, and will only become worse with the advent of exascale computing and next generation observational systems. An important recognition...
Implementation of Outpatient Automated Stewardship Information System (OASIS©) Audit and Feedback in Two Healthcare Systems
Implementation of Outpatient Automated Stewardship Information System (OASIS©) Audit and Feedback in Two Healthcare Systems
Background: Combating antibiotic resistance, exacerbated by widespread unnecessary outpatient antibiotic prescriptions, necessitates innovative stewardship solutions. Audit and fee...

