Javascript must be enabled to continue!

A Distributed Algorithm for Mining Fuzzy Association Rules in Traditional Databases

The mining of fuzzy association rules has been proposed in the literature recently. Many of the ensuing algorithms are developed to make use of only a single processor or machine. They can be further enhanced by taking advantage of the scalability of parallel or distributed computer systems. The increasing ability to collect data and the resulting huge data volume make the exploitation of parallel or distributed systems become more and more important to the success of fuzzy association rule mining algorithms. This chapter proposes a new distributed algorithm, called DFARM, for mining fuzzy association rules from very large databases. Unlike many existing algorithms that adopt the support-confidence framework such that an association is considered interesting if it satisfies some user-specified minimum percentage thresholds, DFARM embraces an objective measure to distinguish interesting associations from uninteresting ones. This measure is defined as a function of the difference in the actual and the expected number of tuples characterized by different linguistic variables (attributes) and linguistic terms (attribute values). Given a database, DFARM first divides it into several horizontal partitions and assigns them to different sites in a distributed system. It then has each site scan its own database partition to obtain the number of tuples characterized by different linguistic variables and linguistic terms (i.e., the local counts), and exchange the local counts with all the other sites to find the global counts. Based on the global counts, the values of the interestingness measure are computed, and the sites can uncover interesting associations. By repeating this process of counting, exchanging counts, and calculating the interestingness measure, it unveils the underlying interesting associations hidden in the data. We implemented DFARM in a distributed system and used a popular benchmark data set to evaluate its performance. The results show that it has very good size-up, speedup, and scale-up performance. We also evaluated the effectiveness of the proposed interestingness measure on two synthetic data sets. The experimental results show that it is very effective in differentiating between interesting and uninteresting associations.

IGI Global

Wai-Ho Au

Handbook of Research on Fuzzy Information Processing in Databases

2011

Title: A Distributed Algorithm for Mining Fuzzy Association Rules in Traditional Databases

Description:

The mining of fuzzy association rules has been proposed in the literature recently.

Many of the ensuing algorithms are developed to make use of only a single processor or machine.

They can be further enhanced by taking advantage of the scalability of parallel or distributed computer systems.

The increasing ability to collect data and the resulting huge data volume make the exploitation of parallel or distributed systems become more and more important to the success of fuzzy association rule mining algorithms.

This chapter proposes a new distributed algorithm, called DFARM, for mining fuzzy association rules from very large databases.

Unlike many existing algorithms that adopt the support-confidence framework such that an association is considered interesting if it satisfies some user-specified minimum percentage thresholds, DFARM embraces an objective measure to distinguish interesting associations from uninteresting ones.

This measure is defined as a function of the difference in the actual and the expected number of tuples characterized by different linguistic variables (attributes) and linguistic terms (attribute values).

Given a database, DFARM first divides it into several horizontal partitions and assigns them to different sites in a distributed system.

It then has each site scan its own database partition to obtain the number of tuples characterized by different linguistic variables and linguistic terms (i.

, the local counts), and exchange the local counts with all the other sites to find the global counts.

Based on the global counts, the values of the interestingness measure are computed, and the sites can uncover interesting associations.

By repeating this process of counting, exchanging counts, and calculating the interestingness measure, it unveils the underlying interesting associations hidden in the data.

We implemented DFARM in a distributed system and used a popular benchmark data set to evaluate its performance.

The results show that it has very good size-up, speedup, and scale-up performance.

We also evaluated the effectiveness of the proposed interestingness measure on two synthetic data sets.

The experimental results show that it is very effective in differentiating between interesting and uninteresting associations.

Back

Abstract. Fuzzy Inference System requires several stages to get the output, 1) formation of fuzzy sets, 2) formation of rules, 3) application of implication functions, 4) compositi...

Generated Fuzzy Quasi-ideals in Ternary Semigroups

Here in this paper, we provide characterizations of fuzzy quasi-ideal in terms of level and strong level subsets. Along with it, we provide expression for the generated fuzzy quasi...

New Approaches of Generalised Fuzzy Soft sets on fuzzy Codes and Its Properties on Decision-Makings

Background Several scholars defined the concepts of fuzzy soft set theory and their application on decision-making problem. Based on this concept, researchers defined the generalis...

New Approaches of Generalised Fuzzy Soft sets on fuzzy Codes and Its Properties on Decision-Makings

Background Several scholars defined the concepts of fuzzy soft set theory and their application on decision-making problem. Based on this concept, researchers defined the generalis...

Fuzzy Chaotic Neural Networks

An understanding of the human brain’s local function has improved in recent years. But the cognition of human brain’s working process as a whole is still obscure. Both fuzzy logic ...

FUZZY RINGS AND ITS PROPERTIES

Abstract One of algebraic structure that involves a binary operation is a group that is defined an un empty set (classical) with an associative binary operation, it has identity e...

Fuzzy Semantic Models of Fuzzy Concepts in Fuzzy Systems

The fuzzy properties of language semantics are a central problem towards machine-enabled natural language processing in cognitive linguistics, fuzzy systems, and computational ling...

Explainable Itemset Utility Maximization with Fuzzy Set

<>Recently, fuzzy utility pattern mining has received much attention for its practicality and comprehensibility. It aims to discover high fuzzy utility itemsets (HFUIs) by co...

Email:
Password:

Email:

A Distributed Algorithm for Mining Fuzzy Association Rules in Traditional Databases

Related Results