Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Determination of the Optimal Window Size for the Spatial XOR Filter

View through CrossRef
ABSTRACTIntroductionAn XOR filter is a probabilistic data structure representing a set of keys for membership queries. Given a set of keys, and hash functions , the filter relies on filling in an array such that, for all , equals a special value, called the fingerprint of . A common approach to fill in is by using a greedy algorithm, which may or may not succeed, and whose probability of failing increases as the load factor increases. The Spatial XOR filter is a variant proposing that the hash functions map each key only into a smaller contiguous portion of , called window, and empirical results show that it is possible to achieve a larger for the same chance of succeeding in filling in using the greedy approach. A result in the literature conjectures that the optimal window size is for .MethodsIn this work, we comprehensively test this conjecture, considering various values of and . Using the leave‐one‐out validation process of machine learning, and Occam's razor principle, to determine the optimal window size and maximum load factor as a function of .ResultsWe find that the optimal window size of is confirmed by our methodology, and we provide the concrete function behind the asymptotic notation; as a byproduct of the methodology, we suggest alternative candidate functions for this window optimal size. We also propose the maximum load factor function possible to achieve in terms of .ConclusionsUsing the optimal window size empirically provided by our methodology, the results show that the Spatial XOR filter is competitive among its peers. Other filters have very close space savings compared to the Spatial XOR filter. The derived functions extrapolated well in the experiments, although the theoretical optimal window size remains an open problem.
Title: Determination of the Optimal Window Size for the Spatial XOR Filter
Description:
ABSTRACTIntroductionAn XOR filter is a probabilistic data structure representing a set of keys for membership queries.
Given a set of keys, and hash functions , the filter relies on filling in an array such that, for all , equals a special value, called the fingerprint of .
A common approach to fill in is by using a greedy algorithm, which may or may not succeed, and whose probability of failing increases as the load factor increases.
The Spatial XOR filter is a variant proposing that the hash functions map each key only into a smaller contiguous portion of , called window, and empirical results show that it is possible to achieve a larger for the same chance of succeeding in filling in using the greedy approach.
A result in the literature conjectures that the optimal window size is for .
MethodsIn this work, we comprehensively test this conjecture, considering various values of and .
Using the leave‐one‐out validation process of machine learning, and Occam's razor principle, to determine the optimal window size and maximum load factor as a function of .
ResultsWe find that the optimal window size of is confirmed by our methodology, and we provide the concrete function behind the asymptotic notation; as a byproduct of the methodology, we suggest alternative candidate functions for this window optimal size.
We also propose the maximum load factor function possible to achieve in terms of .
ConclusionsUsing the optimal window size empirically provided by our methodology, the results show that the Spatial XOR filter is competitive among its peers.
Other filters have very close space savings compared to the Spatial XOR filter.
The derived functions extrapolated well in the experiments, although the theoretical optimal window size remains an open problem.

Related Results

On Flores Island, do "ape-men" still exist? https://www.sapiens.org/biology/flores-island-ape-men/
On Flores Island, do "ape-men" still exist? https://www.sapiens.org/biology/flores-island-ape-men/
<span style="font-size:11pt"><span style="background:#f9f9f4"><span style="line-height:normal"><span style="font-family:Calibri,sans-serif"><b><spa...
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
<p><em><span style="font-size: 11.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-langua...
Enhancement of Rabin-Karp Algorithmusing XOR Filter
Enhancement of Rabin-Karp Algorithmusing XOR Filter
Purpose–Thestudy aims to enhance the Rabin-Karp Algorithm that underlinesthe problem encountered wherein the algorithm’s runtimeperformanceis affected due tothe continuous rap...
Even Star Decomposition of Complete Bipartite Graphs
Even Star Decomposition of Complete Bipartite Graphs
<p><span lang="EN-US"><span style="font-family: 宋体; font-size: medium;">A decomposition (</span><span><span style="font-family: 宋体; font-size: medi...
Nigella sativa L. oil: Study of its toxicity, antiradical activity, and effect on circulating xanthine oxidoreductase
Nigella sativa L. oil: Study of its toxicity, antiradical activity, and effect on circulating xanthine oxidoreductase
Introduction: Nigella sativa L. is a widely used medicinal plant throughout the world. The low toxic effects and low price of this plant make it an excellent treatment choice for m...
Synthesis and design of dissipative filters with improved performance
Synthesis and design of dissipative filters with improved performance
Connect, upload, download, share and transfer anything at anytime and anywhere is not a futuristic vision and is indeed a real demand on current and future wireless and fixed commu...
Xor-Magic Graphs
Xor-Magic Graphs
Abstract A connected graph on 2n vertices is defined to be xor-magic if the vertices can be labeled with distinct n-bit binary numbers in such a way that the label a...

Back to Top