Javascript must be enabled to continue!
A Framework on Data Mining on Uncertain Data with Related Research Issues in Service Industry
View through CrossRef
There has been a large amount of research work done on mining on relational databases that store data in exact values. However, in many real-life applications such as those commonly used in service industry, the raw data are usually uncertain when they are collected or produced. Sources of uncertain data include readings from sensors (such as RFID tagged in products in retail stores), classification results (e.g., identities of products or customers) of image processing using statistical classifiers, results from predictive programs used for stock market or targeted marketing as well as predictive churn model in customer relationship management. However, since traditional databases only store exact values, uncertain data are usually transformed into exact data by, for example, taking the mean value (for quantitative attributes) or by taking the value with the highest frequency or possibility. The shortcomings are obvious: (1) by approximating the uncertain source data values, the results from the mining tasks will also be approximate and may be wrong; (2) useful probabilistic information may be omitted from the results. Research on probabilistic databases began in 1980s. While there has been a great deal of work on supporting uncertainty in databases, there is increasing work on mining on such uncertain data. By classifying uncertain data into different categories, a framework is proposed to develop different probabilistic data mining techniques that can be applied directly on uncertain data in order to produce results that preserve the accuracy. In this chapter, we introduce the framework with a scheme to categorize uncertain data with different properties. We also propose a variety of definitions and approaches for different mining tasks on uncertain data with different properties. The advances in data mining application in this aspect are expected to improve the quality of services provided in various service industries.
Title: A Framework on Data Mining on Uncertain Data with Related Research Issues in Service Industry
Description:
There has been a large amount of research work done on mining on relational databases that store data in exact values.
However, in many real-life applications such as those commonly used in service industry, the raw data are usually uncertain when they are collected or produced.
Sources of uncertain data include readings from sensors (such as RFID tagged in products in retail stores), classification results (e.
g.
, identities of products or customers) of image processing using statistical classifiers, results from predictive programs used for stock market or targeted marketing as well as predictive churn model in customer relationship management.
However, since traditional databases only store exact values, uncertain data are usually transformed into exact data by, for example, taking the mean value (for quantitative attributes) or by taking the value with the highest frequency or possibility.
The shortcomings are obvious: (1) by approximating the uncertain source data values, the results from the mining tasks will also be approximate and may be wrong; (2) useful probabilistic information may be omitted from the results.
Research on probabilistic databases began in 1980s.
While there has been a great deal of work on supporting uncertainty in databases, there is increasing work on mining on such uncertain data.
By classifying uncertain data into different categories, a framework is proposed to develop different probabilistic data mining techniques that can be applied directly on uncertain data in order to produce results that preserve the accuracy.
In this chapter, we introduce the framework with a scheme to categorize uncertain data with different properties.
We also propose a variety of definitions and approaches for different mining tasks on uncertain data with different properties.
The advances in data mining application in this aspect are expected to improve the quality of services provided in various service industries.
Related Results
Optimisation of potash mining technology for cell and pillar mining method
Optimisation of potash mining technology for cell and pillar mining method
The diverse demand for inorganic fertilizers has predetermined the intensification of potash mining, which is a raw material for their production. In this regard, it has become nec...
PENGEMBANGAN MASYARAKAT LINGKAR TAMBANG DALAM PENGUSAHAAN PERTAMBANGAN
PENGEMBANGAN MASYARAKAT LINGKAR TAMBANG DALAM PENGUSAHAAN PERTAMBANGAN
Indonesia is a country rich in mining resources. Mining resources include gold, silver, copper, oil and gas, coal and others. There are a large number of companies operating in the...
Potential for increasing the efficiency of design processes for mining the solid mineral deposits based on digitalization and advanced analytics
Potential for increasing the efficiency of design processes for mining the solid mineral deposits based on digitalization and advanced analytics
Purpose. The research purpose is to develop and adapt the existing scientific-methodological, as well as software and information base for managing the geotechnological complexes t...
Domain Driven Data Mining
Domain Driven Data Mining
Quantitative intelligence based traditional data mining is facing grand challenges from real-world enterprise and cross-organization applications. For instance, the usual demonstra...
The Hazards of Data Mining in Healthcare
The Hazards of Data Mining in Healthcare
From the mid-1990s, data mining methods have been used to explore and find patterns and relationships in healthcare data. During the 1990s and early 2000's, data mining was a topic...
Research on Chinese Stock Market during COVID-19—Based on Random Matrix Theory
Research on Chinese Stock Market during COVID-19—Based on Random Matrix Theory
This paper focuses on the three industries that are greatly impacted by COVID-19, including the consumption industry, the pharmaceutical industry, and the financial industry. The d...
ON THE DEVELOPMENT OF A GENERAL METHOD FOR FORECASTING THE DANGEROUS PROPERTIES OF COAL SEAMS
ON THE DEVELOPMENT OF A GENERAL METHOD FOR FORECASTING THE DANGEROUS PROPERTIES OF COAL SEAMS
Purpose: to establish a quantitative effect on the dust-generating ability of mine layers of the degree of metamorphic transformations of fossil coals, mining-geological and mining...
Environmental History of Mining
Environmental History of Mining
To the casual observer, the topic of mining history is a natural fit with the field of environmental history. Mining, after all, has caused massive landscape changes; mines and the...

