Javascript must be enabled to continue!
An algebraic semigroup method for discovering maximal frequent itemsets
View through CrossRef
Abstract
Discovering maximal frequent itemsets is an important issue and key technique in many data mining problems such as association rule mining. In the literature, generating maximal frequent itemsets proves either to be NP-hard or to have
O
(
l
3
4
l
(
m
+
n
)
)
O\left({l}^{3}{4}^{l}\left(m+n))
complexity in the worst case from the perspective of generating maximal complete bipartite graphs of a bipartite graph, where
m
m
,
n
n
are the item number and the transaction number, respectively, and
l
l
denotes the maximum of
∣
C
∣
∣
Ψ
(
C
)
∣
/
(
∣
C
∣
+
∣
Ψ
(
C
)
∣
−
1
)
| C| | \Psi \left(C)| \hspace{0.1em}\text{/}\hspace{0.1em}\left(| C| +| \Psi \left(C)| -1)
, with the maximum taken over all maximal frequent itemsets
C
C
. In this article, we put forward a method for discovering maximal frequent itemsets, whose complexity is
O
(
3
m
n
2
β
+
4
β
n
)
O\left(3mn{2}^{\beta }+{4}^{\beta }n)
, lower than the known complexity both in the worst case, from the perspective of semigroup algebra, where
β
\beta
is the number of items whose support is more than the minimum support threshold. Experiments also show that an algorithm based on the algebraic method performs better than the other three well-known algorithms. Meanwhile, we explore some algebraic properties with respect to items and transactions, prove that the maximal frequent itemsets are exactly the simplified generators of frequent itemsets, give a necessary and sufficient condition for a maximal
i
+
1
i+1
-frequent itemset being a subset of a closed
i
i
-frequent itemset, and provide a recurrence formula of maximal frequent itemsets.
Walter de Gruyter GmbH
Title: An algebraic semigroup method for discovering maximal frequent itemsets
Description:
Abstract
Discovering maximal frequent itemsets is an important issue and key technique in many data mining problems such as association rule mining.
In the literature, generating maximal frequent itemsets proves either to be NP-hard or to have
O
(
l
3
4
l
(
m
+
n
)
)
O\left({l}^{3}{4}^{l}\left(m+n))
complexity in the worst case from the perspective of generating maximal complete bipartite graphs of a bipartite graph, where
m
m
,
n
n
are the item number and the transaction number, respectively, and
l
l
denotes the maximum of
∣
C
∣
∣
Ψ
(
C
)
∣
/
(
∣
C
∣
+
∣
Ψ
(
C
)
∣
−
1
)
| C| | \Psi \left(C)| \hspace{0.
1em}\text{/}\hspace{0.
1em}\left(| C| +| \Psi \left(C)| -1)
, with the maximum taken over all maximal frequent itemsets
C
C
.
In this article, we put forward a method for discovering maximal frequent itemsets, whose complexity is
O
(
3
m
n
2
β
+
4
β
n
)
O\left(3mn{2}^{\beta }+{4}^{\beta }n)
, lower than the known complexity both in the worst case, from the perspective of semigroup algebra, where
β
\beta
is the number of items whose support is more than the minimum support threshold.
Experiments also show that an algorithm based on the algebraic method performs better than the other three well-known algorithms.
Meanwhile, we explore some algebraic properties with respect to items and transactions, prove that the maximal frequent itemsets are exactly the simplified generators of frequent itemsets, give a necessary and sufficient condition for a maximal
i
+
1
i+1
-frequent itemset being a subset of a closed
i
i
-frequent itemset, and provide a recurrence formula of maximal frequent itemsets.
Related Results
BA-Semigroup
BA-Semigroup
Several algebraic structures have been studied by many authors to discuss the relationships among them. This article aims to study two algebraic structures, namely semigroup and BA...
Skew compact semigroups
Skew compact semigroups
<p>Skew compact spaces are the best behaving generalization of compact Hausdorff spaces to non-Hausdorff spaces. They are those (X ; τ ) such that there is another topology τ...
A Semigroup Is Finite Iff It Is Chain-Finite and Antichain-Finite
A Semigroup Is Finite Iff It Is Chain-Finite and Antichain-Finite
A subset A of a semigroup S is called a chain (antichain) if ab∈{a,b} (ab∉{a,b}) for any (distinct) elements a,b∈A. A semigroup S is called periodic if for every element x∈S there ...
Regular elements and the BQ - Property of transformation semigroups and rings of linear transformations
Regular elements and the BQ - Property of transformation semigroups and rings of linear transformations
An element x of a semigroup [ring] A is said to regular if there is an element y of A such that x = xyx, and A is called a regular semigroup [(Von Neumann) regular ring] if every e...
An efficient algorithm to extract Skyline itemsets
An efficient algorithm to extract Skyline itemsets
Mining skyline frequent-utility patterns (SFUPs) is the discovery of itemsets that surpasses all other itemsets in both frequency and utility in transactional database. The discove...
Generated Fuzzy Quasi-ideals in Ternary Semigroups
Generated Fuzzy Quasi-ideals in Ternary Semigroups
Here in this paper, we provide characterizations of fuzzy quasi-ideal in terms of level and strong level subsets. Along with it, we provide expression for the generated fuzzy quasi...
Masticatory muscle activation patterns manifested by changes in index values
Masticatory muscle activation patterns manifested by changes in index values
Relevance. Surface electromyography (sEMG) is a method used to record the bioelectrical activity of masticatory muscles both at rest and during movement. This method generates rela...
Distributed frequent hierarchical pattern mining for robust and efficient large-scale association discovery
Distributed frequent hierarchical pattern mining for robust and efficient large-scale association discovery
Frequent pattern mining is a classic data mining technique, generally applicable to a wide range of application domains, and a mature area of research. The fundamental challenge ar...

