Javascript must be enabled to continue!

Optimized Distributed Subgraph Matching Algorithm Based on Partition Replication

At present, with the explosive growth of data scale, subgraph matching for massive graph data is difficult to satisfy with efficiency. Meanwhile, the graph index used in existing subgraph matching algorithm is difficult to update and maintain when facing dynamic graphs. We propose a distributed subgraph matching algorithm based on Partition Replica (noted as PR-Match) to process the partition and storage of large-scale data graphs. The PR-Match algorithm first splits the query graph into sub-queries, then assigns the sub-query to each node for sub-graph matching, and finally merges the matching results. In the PR-Match algorithm, we propose a heuristic rule based on prediction cost to select the optimal merging plan, which greatly reduces the cost of merging. In order to accelerate the matching speed of the sub-query graph, a vertex code based on the vertex neighbor label signature is proposed, which greatly reduces the search space for the subquery. As the vertex code is based on the increment, the problem that the feature-based graph index is difficult to maintain in the face of the dynamic graph is solved. An abundance of experiments on real and synthetic datasets demonstrate the high efficiency and strong scalability of the PR-Match algorithm when handling large-scale data graphs.

MDPI AG

Ling Yuan Jiali Bin Peng Pan

Electronics

2020

Title: Optimized Distributed Subgraph Matching Algorithm Based on Partition Replication

Description:

At present, with the explosive growth of data scale, subgraph matching for massive graph data is difficult to satisfy with efficiency.

Meanwhile, the graph index used in existing subgraph matching algorithm is difficult to update and maintain when facing dynamic graphs.

We propose a distributed subgraph matching algorithm based on Partition Replica (noted as PR-Match) to process the partition and storage of large-scale data graphs.

The PR-Match algorithm first splits the query graph into sub-queries, then assigns the sub-query to each node for sub-graph matching, and finally merges the matching results.

In the PR-Match algorithm, we propose a heuristic rule based on prediction cost to select the optimal merging plan, which greatly reduces the cost of merging.

In order to accelerate the matching speed of the sub-query graph, a vertex code based on the vertex neighbor label signature is proposed, which greatly reduces the search space for the subquery.

As the vertex code is based on the increment, the problem that the feature-based graph index is difficult to maintain in the face of the dynamic graph is solved.

An abundance of experiments on real and synthetic datasets demonstrate the high efficiency and strong scalability of the PR-Match algorithm when handling large-scale data graphs.

Back

Partition of the Indian subcontinent is the darkest chapter in our history. India was divided into two halves and the reason of this fateful division was a consequence of many even...

Common Cases of Partition Recovery

A number of automatic operations are carried out by partition recovery tools in an effort to repair damaged or erased partitions and/or recover data from them. A deleted partition ...

A truss‐based approach for densest homogeneous subgraph mining in node‐attributed graphs

AbstractIn a wide range of graph analysis tasks such as community detection and event detection, densest subgraph mining is important and primitive. With the development of social ...

Chromatin-dependent pre-replication complex positioning and activation in mammals

Positionnement et activation du complexe de pré-réplication dépendant de la chromatine dans les mammifères Chaque division cellulaire requiert une duplication préci...

CIDER: Counterfactual-Invariant Diffusion-based GNN Explainer for Causal Subgraph Inference

Abstract Inferring causal links or subgraphs corresponding to a specific phenotype or label based solely on measured data is an important yet challenging task, which is als...

Distributed subgraph matching on timely dataflow

Recently there emerge many distributed algorithms that aim at solving subgraph matching at scale. Existing algorithm-level comparisons failed to provide a systematic view of distri...

RDF Subgraph Matching by Means of Star Decomposition

<p>With the continuous development of the network, the scale of RDF data is becoming larger and larger. In the face of large-scale RDF data processing, the traditional databa...

A Fast Pattern Matching Algorithm Based on Middle Characters of Pattern String

String pattern matching is one of the important string operation. At present, the pattern matching algorithm of strings mainly includes BF algorithm, KMP algorithm, and improved KM...

Email:
Password:

Email:

Optimized Distributed Subgraph Matching Algorithm Based on Partition Replication

Related Results