Javascript must be enabled to continue!
Small Stochastic Data Compactification Concept Justified in the Entropy Basis
View through CrossRef
Measurement is a typical way of gathering information about an investigated object, generalized by a finite set of characteristic parameters. The result of each iteration of the measurement is an instance of the class of the investigated object in the form of a set of values of characteristic parameters. An ordered set of instances forms a collection whose dimensionality for a real object is a factor that cannot be ignored. Managing the dimensionality of data collections, as well as classification, regression, and clustering, are fundamental problems for machine learning. Compactification is the approximation of the original data collection by an equivalent collection (with a reduced dimension of characteristic parameters) with the control of accompanying information capacity losses. Related to compactification is the data completeness verifying procedure, which is characteristic of the data reliability assessment. If there are stochastic parameters among the initial data collection characteristic parameters, the compactification procedure becomes more complicated. To take this into account, this study proposes a model of a structured collection of stochastic data defined in terms of relative entropy. The compactification of such a data model is formalized by an iterative procedure aimed at maximizing the relative entropy of sequential implementation of direct and reverse projections of data collections, taking into account the estimates of the probability distribution densities of their attributes. The procedure for approximating the relative entropy function of compactification to reduce the computational complexity of the latter is proposed. To qualitatively assess compactification this study undertakes a formal analysis that uses data collection information capacity and the absolute and relative share of information losses due to compaction as its metrics. Taking into account the semantic connection of compactification and completeness, the proposed metric is also relevant for the task of assessing data reliability. Testing the proposed compactification procedure proved both its stability and efficiency in comparison with previously used analogues, such as the principal component analysis method and the random projection method.
Title: Small Stochastic Data Compactification Concept Justified in the Entropy Basis
Description:
Measurement is a typical way of gathering information about an investigated object, generalized by a finite set of characteristic parameters.
The result of each iteration of the measurement is an instance of the class of the investigated object in the form of a set of values of characteristic parameters.
An ordered set of instances forms a collection whose dimensionality for a real object is a factor that cannot be ignored.
Managing the dimensionality of data collections, as well as classification, regression, and clustering, are fundamental problems for machine learning.
Compactification is the approximation of the original data collection by an equivalent collection (with a reduced dimension of characteristic parameters) with the control of accompanying information capacity losses.
Related to compactification is the data completeness verifying procedure, which is characteristic of the data reliability assessment.
If there are stochastic parameters among the initial data collection characteristic parameters, the compactification procedure becomes more complicated.
To take this into account, this study proposes a model of a structured collection of stochastic data defined in terms of relative entropy.
The compactification of such a data model is formalized by an iterative procedure aimed at maximizing the relative entropy of sequential implementation of direct and reverse projections of data collections, taking into account the estimates of the probability distribution densities of their attributes.
The procedure for approximating the relative entropy function of compactification to reduce the computational complexity of the latter is proposed.
To qualitatively assess compactification this study undertakes a formal analysis that uses data collection information capacity and the absolute and relative share of information losses due to compaction as its metrics.
Taking into account the semantic connection of compactification and completeness, the proposed metric is also relevant for the task of assessing data reliability.
Testing the proposed compactification procedure proved both its stability and efficiency in comparison with previously used analogues, such as the principal component analysis method and the random projection method.
Related Results
Entropy and Wealth
Entropy and Wealth
While entropy was introduced in the second half of the 19th century in the international vocabulary as a scientific term, in the 20th century it became common in colloquial use. Po...
Compactifications of horospheric products
Compactifications of horospheric products
We define and study a new compactification, called the height compactification of the horospheric product of two infinite trees. We will provide a complete description of this comp...
A Generalized Measure of Cumulative Residual Entropy
A Generalized Measure of Cumulative Residual Entropy
In this work, we introduce a generalized measure of cumulative residual entropy and study its properties. We show that several existing measures of entropy such as cumulative resid...
Numerical Study on Entropy Generation of the Multi-Stage Centrifugal Pump
Numerical Study on Entropy Generation of the Multi-Stage Centrifugal Pump
The energy loss of the multi-stage centrifugal pump was investigated by numerical analysis using the entropy generation method with the RNG k-ε turbulence model. Entropy generation...
Influence of ideals in compactifications
Influence of ideals in compactifications
Abstract
One point compactification is studied in the light of ideal of subsets of ℕ. ????-proper map is introduced and showed that a continuous map can be extended ...
Implementasi Metode SAW dan Entropy pada Pemilihan Armada Travel
Implementasi Metode SAW dan Entropy pada Pemilihan Armada Travel
Abstract. Travel is a mode of transportation that can be used to travel the Jakarta – Bandung route. Many travel fleets that can be the user's choice. Errors in selecting travel fl...
Discussion on the Full Entropy Assumption of the SP 800-90 Series
Discussion on the Full Entropy Assumption of the SP 800-90 Series
NIST SP 800-90 series support the generation of high-quality random bits for cryptographic and non-cryptographic use. The security of a random number generator depends on the unpre...
Can remotely-sensed Earth’s entropy production reveal its ecological fitness?
Can remotely-sensed Earth’s entropy production reveal its ecological fitness?
It is straightforward to analyse Earth´s fitness in terms of controlling and governing global warming due to human emissions of greenhouse gasses. We make room, however, f...

