Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Dynamic Top-K Interesting Subgraph Query on Large-Scale Labeled Graphs

View through CrossRef
A labeled graph is a special structure with node identification capability, which is often used in information networks, biological networks, and other fields. The subgraph query is widely used as an important means of graph data analysis. As the size of the labeled graph increases and changes dynamically, users tend to focus on the high-match results that are of interest to them, and they want to take advantage of the relationship and number of results to get the results of the query quickly. For this reason, we consider the individual needs of users and propose a dynamic Top-K interesting subgraph query. This method establishes a novel graph topology feature index (GTSF index) including a node topology feature index (NTF index) and an edge feature index (EF index), which can effectively prune and filter the invalid nodes and edges that do not meet the restricted condition. The multi-factor candidate set filtering strategy is proposed based on the GTSF index, which can be further pruned to obtain fewer candidate sets. Then, we propose a dynamic Top-K interesting subgraph query method based on the idea of the sliding window to realize the dynamic modification of the matching results of the subgraph in the dynamic evolution of the label graph, to ensure real-time and accurate results of the query. In addition, considering the factors, such as frequent Input/Output (I/O) and network communication overheads, the optimization mechanism of the graph changes and an incremental maintenance strategy for the index are proposed to reduce the huge cost of redundant operation and global updates. The experimental results show that the proposed method can effectively deal with a dynamic Top-K interesting subgraph query on a large-scale labeled graph, at the same time the optimization mechanism of graph changes and the incremental maintenance strategy of the index can effectively reduce the maintenance overheads.
Title: Dynamic Top-K Interesting Subgraph Query on Large-Scale Labeled Graphs
Description:
A labeled graph is a special structure with node identification capability, which is often used in information networks, biological networks, and other fields.
The subgraph query is widely used as an important means of graph data analysis.
As the size of the labeled graph increases and changes dynamically, users tend to focus on the high-match results that are of interest to them, and they want to take advantage of the relationship and number of results to get the results of the query quickly.
For this reason, we consider the individual needs of users and propose a dynamic Top-K interesting subgraph query.
This method establishes a novel graph topology feature index (GTSF index) including a node topology feature index (NTF index) and an edge feature index (EF index), which can effectively prune and filter the invalid nodes and edges that do not meet the restricted condition.
The multi-factor candidate set filtering strategy is proposed based on the GTSF index, which can be further pruned to obtain fewer candidate sets.
Then, we propose a dynamic Top-K interesting subgraph query method based on the idea of the sliding window to realize the dynamic modification of the matching results of the subgraph in the dynamic evolution of the label graph, to ensure real-time and accurate results of the query.
In addition, considering the factors, such as frequent Input/Output (I/O) and network communication overheads, the optimization mechanism of the graph changes and an incremental maintenance strategy for the index are proposed to reduce the huge cost of redundant operation and global updates.
The experimental results show that the proposed method can effectively deal with a dynamic Top-K interesting subgraph query on a large-scale labeled graph, at the same time the optimization mechanism of graph changes and the incremental maintenance strategy of the index can effectively reduce the maintenance overheads.

Related Results

Query expansion by relying on the structure of knowledge bases
Query expansion by relying on the structure of knowledge bases
Query expansion techniques aim at improving the results achieved by a user's query by means of introducing new expansion terms, called expansion features. Expansion features introd...
RDF Subgraph Matching by Means of Star Decomposition
RDF Subgraph Matching by Means of Star Decomposition
<p>With the continuous development of the network, the scale of RDF data is becoming larger and larger. In the face of large-scale RDF data processing, the traditional databa...
Categorical Multi-Query Subgraph Matching on Labeled Graph
Categorical Multi-Query Subgraph Matching on Labeled Graph
Subgraph matching stands as a fundamental issue within the research realm of graph analysis. In this paper, we investigate a novel combinatorial problem that encompasses both multi...
A truss‐based approach for densest homogeneous subgraph mining in node‐attributed graphs
A truss‐based approach for densest homogeneous subgraph mining in node‐attributed graphs
AbstractIn a wide range of graph analysis tasks such as community detection and event detection, densest subgraph mining is important and primitive. With the development of social ...
A Survey of Query Auto Completion in Information Retrieval
A Survey of Query Auto Completion in Information Retrieval
In information retrieval, query auto completion (QAC), also known as type-ahead [Xiao et al., 2013, Cai et al., 2014b] and auto-complete suggestion [Jain and Mishne, 2010], refers ...
Discriminative Subgraph Mining for Protein Classification
Discriminative Subgraph Mining for Protein Classification
Protein classification can be performed by representing 3-D protein structures by graphs and then classifying the corresponding graphs. One effective way to classify such graphs is...
Optimized Distributed Subgraph Matching Algorithm Based on Partition Replication
Optimized Distributed Subgraph Matching Algorithm Based on Partition Replication
At present, with the explosive growth of data scale, subgraph matching for massive graph data is difficult to satisfy with efficiency. Meanwhile, the graph index used in existing s...
CIDER: Counterfactual-Invariant Diffusion-based GNN Explainer for Causal Subgraph Inference
CIDER: Counterfactual-Invariant Diffusion-based GNN Explainer for Causal Subgraph Inference
Abstract Inferring causal links or subgraphs corresponding to a specific phenotype or label based solely on measured data is an important yet challenging task, which is als...

Back to Top