Javascript must be enabled to continue!
Empirical study of knowledge network based on complex network theory
View through CrossRef
Knowledge graph is a hot topic in artificial intelligence area and has been widely adopted in intelligent search and question-and-answer system. Knowledge graph can be regarded as a complex network system and analyzed by complex network theory, which studies the interaction or relationship between various factors and basic characteristics of complex system. Its characteristics and their physical meanings are very helpful in understanding the nature of the knowledge graph. Concept graph is a large-scaled knowledge graph published by Microsoft. In this paper, we construct a huge complex network according to Microsoft’s concept graph. Its complex network characteristics, such as degree distribution, average shortest distance, clustering coefficient and degree correlation, are calculated and analyzed. The concept graph is not a connected network and its scale is very large; an approach is proposed to extract its largest connected subnet. The method has obvious advantages in both time complexity and space complexity. In this paper, we also present a method of calculating the approximate average shortest path of the largest connected subnet. The method estimates the maximum and minimum value of the shortest distance between nodes according to the distance between the central node and the network layer that the node belongs to and the distance between different layers. In order to calculate the clustering coefficient, different methods are introduced for nodes with different degree values and Map/Reduce idea is adopted to reduce the time cost. The experimental results show that the largest subnet of the concept graph is an ultra-small world network with the characteristics of scale-free. The average shortest path length decreases towards 4 with the network size increasing, which can be easily explained by the diamond-shaped network structure. The concept graph is a disassortative network where low degree nodes tend to connect to high degree nodes. The subConcepts account for 99.5% of nodes in the innermost <i>k</i>-core after <i>k</i>-shell decomposition. It shows that the subConcepts play an important role in the connectivity of network. The absence of subConcept affects the complexness of concept graph most, the concept next, and the instance least. The 82% instance nodes and 40% concept nodes of the concept graph each have a degree value of 1. It is believed that compared with the concept words, the instance words do not lead to the ambiguity in the understanding of natural language, caused by polysemy.
Acta Physica Sinica, Chinese Physical Society and Institute of Physics, Chinese Academy of Sciences
Title: Empirical study of knowledge network based on complex network theory
Description:
Knowledge graph is a hot topic in artificial intelligence area and has been widely adopted in intelligent search and question-and-answer system.
Knowledge graph can be regarded as a complex network system and analyzed by complex network theory, which studies the interaction or relationship between various factors and basic characteristics of complex system.
Its characteristics and their physical meanings are very helpful in understanding the nature of the knowledge graph.
Concept graph is a large-scaled knowledge graph published by Microsoft.
In this paper, we construct a huge complex network according to Microsoft’s concept graph.
Its complex network characteristics, such as degree distribution, average shortest distance, clustering coefficient and degree correlation, are calculated and analyzed.
The concept graph is not a connected network and its scale is very large; an approach is proposed to extract its largest connected subnet.
The method has obvious advantages in both time complexity and space complexity.
In this paper, we also present a method of calculating the approximate average shortest path of the largest connected subnet.
The method estimates the maximum and minimum value of the shortest distance between nodes according to the distance between the central node and the network layer that the node belongs to and the distance between different layers.
In order to calculate the clustering coefficient, different methods are introduced for nodes with different degree values and Map/Reduce idea is adopted to reduce the time cost.
The experimental results show that the largest subnet of the concept graph is an ultra-small world network with the characteristics of scale-free.
The average shortest path length decreases towards 4 with the network size increasing, which can be easily explained by the diamond-shaped network structure.
The concept graph is a disassortative network where low degree nodes tend to connect to high degree nodes.
The subConcepts account for 99.
5% of nodes in the innermost <i>k</i>-core after <i>k</i>-shell decomposition.
It shows that the subConcepts play an important role in the connectivity of network.
The absence of subConcept affects the complexness of concept graph most, the concept next, and the instance least.
The 82% instance nodes and 40% concept nodes of the concept graph each have a degree value of 1.
It is believed that compared with the concept words, the instance words do not lead to the ambiguity in the understanding of natural language, caused by polysemy.
Related Results
Network Automation
Network Automation
Purpose: The article "Network Automation in the Contemporary Economy" explores the concepts and methods of effective network management. The application stack, Jinja template engin...
KNOWLEDGE IN PRACTICE
KNOWLEDGE IN PRACTICE
Knowledge is an understanding of someone or something, such as facts, information, descriptions or skills, which is acquired by individuals through education, learning, experience ...
Detection of gene communities in multi-networks reveals cancer drivers
Detection of gene communities in multi-networks reveals cancer drivers
In the past years the advent of high-throughput experimental technologies provided biologists with a flood of molecular data. This huge amount of information requires the design of...
Sygeplejevidenskab myte eller virkelighed?
Sygeplejevidenskab myte eller virkelighed?
ENGLISH SUMMARY: The aim of this study is to explore the feasibility and productivity of a model for description and analysis of what Bourdieu has called “the genesis and structure...
Extending Post-Interpretive Criticism: Additional Diagnostic Indices for Enhanced Phenomenological Fidelity in Art Criticism
Extending Post-Interpretive Criticism: Additional Diagnostic Indices for Enhanced Phenomenological Fidelity in Art Criticism
This paper extends Post-Interpretive Criticism (PIC) by introducing a second layer of diagnostic indices designed to evaluate the phenomenological fidelity of art criticism. While ...
The impact of employees’ relationships on tacit knowledge sharing
The impact of employees’ relationships on tacit knowledge sharing
Purpose– This paper aims to study the impact of individual relationships on tacit knowledge sharing in the company setting of compulsory bond, expressive bond, instrumental bond an...
Analysis of the characteristics and evolution of knowledge label networks in the Q&A community: taking the Zhihu platform as an example
Analysis of the characteristics and evolution of knowledge label networks in the Q&A community: taking the Zhihu platform as an example
PurposeIn the era of mobile internet, the social Q&A community has built a large-scale and complex knowledge label network through its internal knowledge units, and the scale a...
Theory of Misplacement
Theory of Misplacement
Theory of Misplacement
By Dorian Vale
— A Treatise in the Post-Interpretive Movement
Theory of Misplacement is a foundational treatise in the Post-Interpretive canon developed by...

