Javascript must be enabled to continue!
TriJoin: A Time-Efficient and Scalable Three-Way Distributed Stream Join System
View through CrossRef
<p>Stream join is one of the most fundamental operations in data stream processing applications. Existing distributed stream join systems can support efficient two-way join, which is a join operation between two streams. Based the two-way join, implementing a three-way join require to be split into double two-way joins, where the second two-way join needs to wait for the join result transmitted from the first two-way join. We show through experiments that such a design raises prohibitively high processing latency. To solve this problem, we propose TriJoin, a time-efficient three-way distributed stream join system. We design a symmetric wait-free structure by symmetrically partitioning tuples and reused join. TriJoin utilizes reused join to join each new tuple with the intermediate result of the other two streams and stored tuples locally. For a new tuple, TriJoin only joins it with the intermediate result to generate the final result without waiting, greatly reducing the processing latency. In TriJoin, we design two partitioning and storage schemes according to two different forms of three-way stream join. We implement TriJoin and conduct comprehensive experiments to evaluate the performance using real-world traces. Results show that TriJoin significantly reduces the processing latency by up to 68%, compared to existing designs.</p>
<p> </p>
Journal of Internet Technology
Title: TriJoin: A Time-Efficient and Scalable Three-Way Distributed Stream Join System
Description:
<p>Stream join is one of the most fundamental operations in data stream processing applications.
Existing distributed stream join systems can support efficient two-way join, which is a join operation between two streams.
Based the two-way join, implementing a three-way join require to be split into double two-way joins, where the second two-way join needs to wait for the join result transmitted from the first two-way join.
We show through experiments that such a design raises prohibitively high processing latency.
To solve this problem, we propose TriJoin, a time-efficient three-way distributed stream join system.
We design a symmetric wait-free structure by symmetrically partitioning tuples and reused join.
TriJoin utilizes reused join to join each new tuple with the intermediate result of the other two streams and stored tuples locally.
For a new tuple, TriJoin only joins it with the intermediate result to generate the final result without waiting, greatly reducing the processing latency.
In TriJoin, we design two partitioning and storage schemes according to two different forms of three-way stream join.
We implement TriJoin and conduct comprehensive experiments to evaluate the performance using real-world traces.
Results show that TriJoin significantly reduces the processing latency by up to 68%, compared to existing designs.
</p>
<p> </p>.
Related Results
Lahar simulation using Laharz_py program for the Mt. Halla volcano, Jeju, Korea
Lahar simulation using Laharz_py program for the Mt. Halla volcano, Jeju, Korea
This study using Laharz_py program, was performed schematic prediction
on the impact area of lahar hazards at the Mt. Halla volcano, Jeju
island volcanic field, Korea. In order to ...
Using join.me to help library patrons
Using join.me to help library patrons
PurposeAs the Informatics Librarian at Olivet Nazarene University, my staff and I are often responsible for troubleshooting our patrons' technology issues. My experience with join....
Along-stream evolution of Gulf Stream volume transport and water properties from underwater glider observations
Along-stream evolution of Gulf Stream volume transport and water properties from underwater glider observations
<p>The Gulf Stream is the western boundary current in the subtropical North Atlantic and a principal component of the upper limb of the Atlantic Meridional Overturnin...
Geochemical Survey of Stream Sediments and Stream Water for Ion-Adsorption Type Rare Earth Deposits (IAREDs): A Pilot Study in Jiaping IARED, Guangxi, South China
Geochemical Survey of Stream Sediments and Stream Water for Ion-Adsorption Type Rare Earth Deposits (IAREDs): A Pilot Study in Jiaping IARED, Guangxi, South China
Rare earth elements (REEs) are critical mineral resources that play a pivotal role in modern technology and industry. Currently, the global supply of light rare earth elements (LRE...
Influence of diurnal variations in stream temperature on streamflow loss and groundwater recharge
Influence of diurnal variations in stream temperature on streamflow loss and groundwater recharge
We demonstrate that for losing reaches with significant diurnal variations in stream temperature, the effect of stream temperature on streambed seepage is a major factor contributi...
Resource efficient distributed computing
Resource efficient distributed computing
There is a surge of interests in distributed computing thanks to advances in clustered computing and big data technology. My research explores topics on machine learning and big da...
Subglacial Conditions of the Kamb Ice Stream and its Response to Environmental Change
Subglacial Conditions of the Kamb Ice Stream and its Response to Environmental Change
<p>The Siple Coast ice streams, which drain the West Antarctic Ice Sheet into the Ross Ice Shelf, are susceptible to temporal changes in flow dynamics. The Kamb Ice Stream on...
Enhancing Water Level Estimates with DEM-derived Stream Geomorphometry
Enhancing Water Level Estimates with DEM-derived Stream Geomorphometry
Accurate water level predictions are increasingly crucial for mitigating flood risks. Hydrological and hydrodynamic models provide water level predictions, but their accuracy depen...

