Javascript must be enabled to continue!
Abnormal data detection method for multidimensional datasets based on spectral clustering
View through CrossRef
Aiming at the problem that the detection of abnormal data in multidimensional data sets is not processed by dimensionality reduction, which leads to low detection accuracy, high false positive rate and long detection time, a method for detecting abnormal data in multidimensional data sets based on spectral clustering is proposed. Firstly, the data in the multidimensional data set is clustered by Laplacian matrix to preliminarily classify the data; secondly, the local linear embedding (LLE) algorithm is used to reduce the dimensionality of the classified data, and the high-dimensional data set is expressed by feature vectors to remove redundant information in the multidimensional data set; finally, the processed multidimensional data set is input into the support vector machine model, and the abnormal data is detected according to the calculation of regression estimation value. Experimental results show that the proposed algorithm has higher accuracy, lower false positive rate and shorter detection time for detecting abnormal data in multidimensional data sets.
Title: Abnormal data detection method for multidimensional datasets based on spectral clustering
Description:
Aiming at the problem that the detection of abnormal data in multidimensional data sets is not processed by dimensionality reduction, which leads to low detection accuracy, high false positive rate and long detection time, a method for detecting abnormal data in multidimensional data sets based on spectral clustering is proposed.
Firstly, the data in the multidimensional data set is clustered by Laplacian matrix to preliminarily classify the data; secondly, the local linear embedding (LLE) algorithm is used to reduce the dimensionality of the classified data, and the high-dimensional data set is expressed by feature vectors to remove redundant information in the multidimensional data set; finally, the processed multidimensional data set is input into the support vector machine model, and the abnormal data is detected according to the calculation of regression estimation value.
Experimental results show that the proposed algorithm has higher accuracy, lower false positive rate and shorter detection time for detecting abnormal data in multidimensional data sets.
Related Results
The Kernel Rough K-Means Algorithm
The Kernel Rough K-Means Algorithm
Background:
Clustering is one of the most important data mining methods. The k-means
(c-means ) and its derivative methods are the hotspot in the field of clustering research in re...
Image clustering using exponential discriminant analysis
Image clustering using exponential discriminant analysis
Local learning based image clustering models are usually employed to deal with images sampled from the non‐linear manifold. Recently, linear discriminant analysis (LDA) based vario...
Abnormal Status Detection of Catenary Based on TSNE Dimensionality
Reduction Method and IGWO-LSSVM Model
Abnormal Status Detection of Catenary Based on TSNE Dimensionality
Reduction Method and IGWO-LSSVM Model
Background:
Catenary is a crucial component of an electrified railroad's traction power supply system.
There is a considerable incidence of abnormal status and failures due to prol...
Anomaly Detection of Dam Monitoring Data based on Improved Spectral Clustering
Anomaly Detection of Dam Monitoring Data based on Improved Spectral Clustering
<p>In response to the abnormal data mining in dam safety monitoring, and based on the traditional spectral clustering, this paper presents an anomaly detection method based o...
The Application of S‐transform Spectrum Decomposition Technique in Extraction of Weak Seismic Signals
The Application of S‐transform Spectrum Decomposition Technique in Extraction of Weak Seismic Signals
AbstractIn processing of deep seismic reflection data, when the frequency band difference between the weak useful signal and noise both from the deep subsurface is very small and h...
Investigating Outlier Detection Techniques Based on Kernel Rough
Clustering
Investigating Outlier Detection Techniques Based on Kernel Rough
Clustering
Background:
Data quality is crucial to the success of big data analytics. However, the
presence of outliers affects data quality and data analysis. Employing effective outlier dete...
A COMPARATIVE ANALYSIS OF K-MEANS AND HIERARCHICAL CLUSTERING
A COMPARATIVE ANALYSIS OF K-MEANS AND HIERARCHICAL CLUSTERING
Clustering is the process of arranging comparable data elements into groups. One of the most frequent data mining analytical techniques is clustering analysis; the clustering algor...
IDCUP Algorithm to Classifying Arbitrary Shapes and Densities for Center-based Clustering Performance Analysis
IDCUP Algorithm to Classifying Arbitrary Shapes and Densities for Center-based Clustering Performance Analysis
Aim/Purpose: The clustering techniques are normally considered to determine the significant and meaningful subclasses purposed in datasets. It is an unsupervised type of Machine Le...

