Javascript must be enabled to continue!
An algebraic semigroup method for discovering maximal frequent itemsets
View through CrossRef
Abstract
Discovering maximal frequent itemsets is an important issue and key technique in many data mining problems such as association rule mining. In the literature, generating maximal frequent itemsets proves either to be NP-hard or to have
O
(
l
3
4
l
(
m
+
n
)
)
O\left({l}^{3}{4}^{l}\left(m+n))
complexity in the worst case from the perspective of generating maximal complete bipartite graphs of a bipartite graph, where
m
m
,
n
n
are the item number and the transaction number, respectively, and
l
l
denotes the maximum of
∣
C
∣
∣
Ψ
(
C
)
∣
/
(
∣
C
∣
+
∣
Ψ
(
C
)
∣
−
1
)
| C| | \Psi \left(C)| \hspace{0.1em}\text{/}\hspace{0.1em}\left(| C| +| \Psi \left(C)| -1)
, with the maximum taken over all maximal frequent itemsets
C
C
. In this article, we put forward a method for discovering maximal frequent itemsets, whose complexity is
O
(
3
m
n
2
β
+
4
β
n
)
O\left(3mn{2}^{\beta }+{4}^{\beta }n)
, lower than the known complexity both in the worst case, from the perspective of semigroup algebra, where
β
\beta
is the number of items whose support is more than the minimum support threshold. Experiments also show that an algorithm based on the algebraic method performs better than the other three well-known algorithms. Meanwhile, we explore some algebraic properties with respect to items and transactions, prove that the maximal frequent itemsets are exactly the simplified generators of frequent itemsets, give a necessary and sufficient condition for a maximal
i
+
1
i+1
-frequent itemset being a subset of a closed
i
i
-frequent itemset, and provide a recurrence formula of maximal frequent itemsets.
Walter de Gruyter GmbH
Title: An algebraic semigroup method for discovering maximal frequent itemsets
Description:
Abstract
Discovering maximal frequent itemsets is an important issue and key technique in many data mining problems such as association rule mining.
In the literature, generating maximal frequent itemsets proves either to be NP-hard or to have
O
(
l
3
4
l
(
m
+
n
)
)
O\left({l}^{3}{4}^{l}\left(m+n))
complexity in the worst case from the perspective of generating maximal complete bipartite graphs of a bipartite graph, where
m
m
,
n
n
are the item number and the transaction number, respectively, and
l
l
denotes the maximum of
∣
C
∣
∣
Ψ
(
C
)
∣
/
(
∣
C
∣
+
∣
Ψ
(
C
)
∣
−
1
)
| C| | \Psi \left(C)| \hspace{0.
1em}\text{/}\hspace{0.
1em}\left(| C| +| \Psi \left(C)| -1)
, with the maximum taken over all maximal frequent itemsets
C
C
.
In this article, we put forward a method for discovering maximal frequent itemsets, whose complexity is
O
(
3
m
n
2
β
+
4
β
n
)
O\left(3mn{2}^{\beta }+{4}^{\beta }n)
, lower than the known complexity both in the worst case, from the perspective of semigroup algebra, where
β
\beta
is the number of items whose support is more than the minimum support threshold.
Experiments also show that an algorithm based on the algebraic method performs better than the other three well-known algorithms.
Meanwhile, we explore some algebraic properties with respect to items and transactions, prove that the maximal frequent itemsets are exactly the simplified generators of frequent itemsets, give a necessary and sufficient condition for a maximal
i
+
1
i+1
-frequent itemset being a subset of a closed
i
i
-frequent itemset, and provide a recurrence formula of maximal frequent itemsets.
Related Results
Editorial Messages
Editorial Messages
Just as it has been continually happening in the world of mathematical sciences, the group of mathematical scientists led by (for example) Professor Eyup Cetin and his colleagues (...
Order-preserving generalized transformation semigroups
Order-preserving generalized transformation semigroups
For a set X, let P(X), T(X) and I(X) denote respectively the partial transformation semigroup on X, the full transformation semigroup on X and the 1-1 partial transformation semigr...
Semiprimeness of semigroup algebras
Semiprimeness of semigroup algebras
AbstractAbundant semigroups originate from p.p. rings and are generalizations of regular semigroups. The main aim of this paper is to study the primeness and the primitivity of abu...
Declarative Approaches for Mining Frequent Itemsets over Transactional Databases
Declarative Approaches for Mining Frequent Itemsets over Transactional Databases
Approches déclaratives pour l'extraction des itemsets fréquents à partir des bases de données transactionnelles
La fouille de données est une étape primordiale du p...
Fouille de représentations concises des motifs fréquents à travers les espaces de recherche conjonctif et disjonctif
Fouille de représentations concises des motifs fréquents à travers les espaces de recherche conjonctif et disjonctif
Durant ces dernières années, les quantités de données collectées, dans divers domaines d'application de l'informatique, deviennent de plus en plus importantes. Cela suscite le beso...
Letter from the Editors
Letter from the Editors
“The present moment seems a very appropriate one to launch a new journal on Algebraic Statistics”Fabrizio Catanese, Editor of the Journal of Algebraic GeometryMany classical statis...
The honest embedding dimension of a numerical semigroup
The honest embedding dimension of a numerical semigroup
Abstract
To a curve germ in
d
-space we can associate a numerical semigroup called its value semigrou...
CONCISE: An Algorithm for Mining Positive and Negative Non-Redundant Association Rules
CONCISE: An Algorithm for Mining Positive and Negative Non-Redundant Association Rules
One challenge problem in association rules mining is the huge size of the extracted rule set many of which are uninteresting and redundant. In this paper, we propose an efficient a...

