Javascript must be enabled to continue!
Rayleigh-quotient-based Data Classification Method for Machine Learning
View through CrossRef
Abstract
Inspired by the ability of human ears to distinguish between different kinds of music and the ability of an ancient instrument to play countless beautiful melodies, this paper proposes the Rayleigh-quotient-based data classification method, which implements data classification by selecting an appropriate eigenfunction coordinate system to represent different categories of data and making different categories of data show significant differences in the coordinate components, meanwhile, transforming the selection problem of an eigenfunction coordinate system into a design problem of physical parameters of music instruments, such as designing a non-uniform string or membrane with distributed supports. In theory, the method is based on the eigenvalue problem for Sturm-Liouville differential equation, the eigenvalue problem for functional minimization, and the generalized eigenvalue problem for matrices, which have a time-honored history and can provide strong theoretical foundation and support. In addition, each parameter in the method has a clear physical interpretation and can be adjusted under the guidance of a set of complete theory, and their influence on classification results is definite, avoiding the large time consumption due to randomly adjusting parameters. The data classification procedure is divided into 2 steps, including mass density function design and stiffness function design, which not only simplifies the process of parameter adjustment, but also enables the optimization algorithm to be implemented by the linear programming with a low computational complexity, theoretically guaranteeing a high computational efficiency. Considering the amount of data being classified is large, two kinds of algorithms are developed to implement data classification, where one is a statistical averaging algorithm and the other is an incremental iterative algorithm. Finally, by combining the image database MNIST, the Rayleigh-quotient-based data classification method is employed to build an image classifier, which is trained on the MNIST training set including 60,000 images, and is tested on the MNIST test set including 10,000 images. The classification results show that by selecting an appropriate eigenfunction coordinate system, the Rayleigh quotient distribution of each category of image can be controlled within a very narrow range, while different categories of images show significant difference in the Rayleigh quotient distribution; the classification accuracy on the MNIST training set and on the MNIST test set are stable at about 99.50%, and the maximum accuracy can reach to 100.00% on the MNIST training set and 99.76% on the MNIST test set; and the Rayleigh-quotient threshold can be selected within a very wide range, beneficial for the robustness and stability of data classification.
Title: Rayleigh-quotient-based Data Classification Method for Machine Learning
Description:
Abstract
Inspired by the ability of human ears to distinguish between different kinds of music and the ability of an ancient instrument to play countless beautiful melodies, this paper proposes the Rayleigh-quotient-based data classification method, which implements data classification by selecting an appropriate eigenfunction coordinate system to represent different categories of data and making different categories of data show significant differences in the coordinate components, meanwhile, transforming the selection problem of an eigenfunction coordinate system into a design problem of physical parameters of music instruments, such as designing a non-uniform string or membrane with distributed supports.
In theory, the method is based on the eigenvalue problem for Sturm-Liouville differential equation, the eigenvalue problem for functional minimization, and the generalized eigenvalue problem for matrices, which have a time-honored history and can provide strong theoretical foundation and support.
In addition, each parameter in the method has a clear physical interpretation and can be adjusted under the guidance of a set of complete theory, and their influence on classification results is definite, avoiding the large time consumption due to randomly adjusting parameters.
The data classification procedure is divided into 2 steps, including mass density function design and stiffness function design, which not only simplifies the process of parameter adjustment, but also enables the optimization algorithm to be implemented by the linear programming with a low computational complexity, theoretically guaranteeing a high computational efficiency.
Considering the amount of data being classified is large, two kinds of algorithms are developed to implement data classification, where one is a statistical averaging algorithm and the other is an incremental iterative algorithm.
Finally, by combining the image database MNIST, the Rayleigh-quotient-based data classification method is employed to build an image classifier, which is trained on the MNIST training set including 60,000 images, and is tested on the MNIST test set including 10,000 images.
The classification results show that by selecting an appropriate eigenfunction coordinate system, the Rayleigh quotient distribution of each category of image can be controlled within a very narrow range, while different categories of images show significant difference in the Rayleigh quotient distribution; the classification accuracy on the MNIST training set and on the MNIST test set are stable at about 99.
50%, and the maximum accuracy can reach to 100.
00% on the MNIST training set and 99.
76% on the MNIST test set; and the Rayleigh-quotient threshold can be selected within a very wide range, beneficial for the robustness and stability of data classification.
Related Results
Correlates of adaptive behavior profiles in a large cohort of children with autism: The autism speaks Autism Treatment Network registry data
Correlates of adaptive behavior profiles in a large cohort of children with autism: The autism speaks Autism Treatment Network registry data
Children with autism spectrum disorder have deficits in adaptive functioning. This study examines the adaptive behavior, its association with cognitive ability, gender, age, and sy...
Research and Application of Rayleigh Wave 3-D Phase Velocity Imaging Based on Wavelet Transform
Research and Application of Rayleigh Wave 3-D Phase Velocity Imaging Based on Wavelet Transform
Rayleigh wave detection is a recently developed method for shallow seismic exploration. Current Rayleigh wave data processing and interpretation methods can only provide the transv...
<strong>Kabir Wealth Code: The combined effect of emotional awareness (EA) and intelligence quotient (IQ)</strong>
<strong>Kabir Wealth Code: The combined effect of emotional awareness (EA) and intelligence quotient (IQ)</strong>
Background: Intelligence quotient (IQ) is a measure of intellectual ability of performing, comprehension, and learning. Previous studies reported that intelligence measures predict...
Non-Rayleigh statistics of ultrasonic backscattered echoes
Non-Rayleigh statistics of ultrasonic backscattered echoes
Till recently, many of the tissue characterization models have been based on the Rayleigh distribution. However, there have been many instances cited in literature suggesting that ...
Enhancing Non-Formal Learning Certificate Classification with Text Augmentation: A Comparison of Character, Token, and Semantic Approaches
Enhancing Non-Formal Learning Certificate Classification with Text Augmentation: A Comparison of Character, Token, and Semantic Approaches
Aim/Purpose: The purpose of this paper is to address the gap in the recognition of prior learning (RPL) by automating the classification of non-formal learning certificates using d...
Analysis of Influencing Factors and Numerical Simulation of Horizontal-to-Vertical Spectral Ratio Method
Analysis of Influencing Factors and Numerical Simulation of Horizontal-to-Vertical Spectral Ratio Method
To improve the calculation accuracy of the horizontal-to-vertical spectral ratio (HVSR) method, this study theoretically analyzed the influencing factors of Rayleigh wave polarizab...
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Smart manufacturing has been developed since the introduction of Industry 4.0. It consists of resource sharing and networking, predictive engineering, and material and data analyti...
APPLICATIONS OF INVERSE WEIBULL RAYLEIGH DISTRIBUTION TO FAILURE RATES AND VINYL CHLORIDE DATA SETS
APPLICATIONS OF INVERSE WEIBULL RAYLEIGH DISTRIBUTION TO FAILURE RATES AND VINYL CHLORIDE DATA SETS
In this work, a new three parameter distribution called the Inverse Weibull Rayleigh distribution is proposed. Some of its statistical properties were presented. The PDF plot of In...

