Javascript must be enabled to continue!
Research and Improvement of Apriori Algorithm Based on Hadoop
View through CrossRef
Abstract
Association rules can forcefully get a horizontal relation in the big data, the Apriori algorithm is one of the most significant association rules. Traditional mining based on parallel Apriori algorithms needs much more time in data IO with the increasing size of large transaction database. This paper improves the Apriori algorithm from compressing transactions, reducing the number of scans and simplifying candidate set generation. And then the improved algorithm is parallelized on the Hadoop framework. The experiments show that this improved algorithm is suitable for large-scale data mining and has good scalability and effectiveness.
Walter de Gruyter GmbH
Title: Research and Improvement of Apriori Algorithm Based on Hadoop
Description:
Abstract
Association rules can forcefully get a horizontal relation in the big data, the Apriori algorithm is one of the most significant association rules.
Traditional mining based on parallel Apriori algorithms needs much more time in data IO with the increasing size of large transaction database.
This paper improves the Apriori algorithm from compressing transactions, reducing the number of scans and simplifying candidate set generation.
And then the improved algorithm is parallelized on the Hadoop framework.
The experiments show that this improved algorithm is suitable for large-scale data mining and has good scalability and effectiveness.
Related Results
Hadoop Tools
Hadoop Tools
As the name indicates, this chapter explains the various additional tools provided by Hadoop. The additional tools provided by Hadoop distribution are Hadoop Streaming, Hadoop Arch...
Secure Cloud Data with Attribute-based Honey Encryption
Secure Cloud Data with Attribute-based Honey Encryption
Abstract
Encryption is a Technique to convert plain text into Cipher text, which is unreadable without an appropriate decryption key. Hadoop is a platform to process and st...
Clustering and Pattern Mining of Customer Transaction Data using Apriori Algorithm
Clustering and Pattern Mining of Customer Transaction Data using Apriori Algorithm
Clustering customer transaction data is an important procedure for analyzing customer behavior in retail and e-Commerce. Clustering of trading data with finding patterns using Apri...
Hadoop Ecosystem and Cloud Integration
Hadoop Ecosystem and Cloud Integration
The integration of the Hadoop ecosystem with cloud computing marks a transformative evolution in the way organizations manage and analyze large-scale data. This study examines how ...
Parallel Design of Apriori Algorithm Based on the Method of “Determine Infrequent Items & Remove Infrequent Itemsets”
Parallel Design of Apriori Algorithm Based on the Method of “Determine Infrequent Items & Remove Infrequent Itemsets”
Abstract
In the method of fault association rule diagnosis, Apriori algorithm has low efficiency for big data processing. In this paper, aiming at the defects of Apr...
ANALISIS KERANJANG PASAR UNTUK PENINGKATAN PENJUALAN MENGUNAKAN ALGORITMA APRIORI
ANALISIS KERANJANG PASAR UNTUK PENINGKATAN PENJUALAN MENGUNAKAN ALGORITMA APRIORI
Penelitian ini bertujuan untuk meningkatkan strategi penjualan melalui analisis keranjang pasar menggunakan algoritma Apriori. Saat ini, perusahaan menghadapi tantangan dalam menin...
A Novel Approach to Enhance the Efficiency of Apriori Algorithm
A Novel Approach to Enhance the Efficiency of Apriori Algorithm
Data mining is the process of obtaining valuable or significant information from a large-scale database. One significantarea of research in the field of data mining is association ...
Optimization of Association Rule Using Ant Colony Optimization (ACO) Approach
Optimization of Association Rule Using Ant Colony Optimization (ACO) Approach
The Apriori algorithm creates all possible association rules between items in the database using the Association Rule Mining and Apriori Algorithm. Using Ant Colony Optimization, a...

