Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Approximate Integrity Constraints in Incomplete Databases With Limited Domains 

View through CrossRef
Abstract In case of incomplete database tables, a possible world is obtained by replacing any missing value by a value from the corresponding attribute's domain that can be infinite. A possible key or possible functional dependency constraint is satisfied by an incomplete table if we can obtain a possible world that satisfies the given key or functional dependency. On the other hand, a certain key or certain functional dependency holds if all possible worlds satisfy the constraint, A strongly possible constraint is an intermediate concept between possible and certain constraints, based on the strongly possible world approach (a strongly possible world is obtained by replacing \nul's by a value from the ones appearing in the corresponding attribute of the table).A strongly possible key or functional dependency holds in an incomplete table if there exists a strongly possible world that satisfies the given constraint. In the present paper, we introduce strongly possible versions of multivalued dependencies and cross joins, and we analyse the complexity of checking the validity of a given strongly possible cross joins.We also study approximation measures of strongly possible keys (spKeys), functional dependencies (spFDs), multivalued dependencies (spMVDs) and cross joins (spCJs). $g_3$ and $g_5$ measures are used to measure how close a table $Y$ satisfies a constraint if it is violated in $T$. Where the two measures $g_3$ and $g_5$ represent the ratio of the minimum number of tuples that are required to be removed from or added to, respectively, the table so that the constraint holds. Removing tuples may remove the cases that caused the constraint violation and adding tuples can extend the values shown on an attribute. For spKeys and spFDs, We show that the $g_3$ value is always an upper bound of the $g_5$ value for a given constraint in a table. However, there are tables of arbitrarily large number of tuples and a constant number of attributes that satisfy $g_3-g_5=\frac{p}{q}$ for any rational number $0\le\frac{p}{q}<1$. On the other hand, we show that the two measures values are independent of each other in the case of spMVDs and spCJs.We also treat complexity questions of determination of the approximation values.
Springer Science and Business Media LLC
Title: Approximate Integrity Constraints in Incomplete Databases With Limited Domains 
Description:
Abstract In case of incomplete database tables, a possible world is obtained by replacing any missing value by a value from the corresponding attribute's domain that can be infinite.
A possible key or possible functional dependency constraint is satisfied by an incomplete table if we can obtain a possible world that satisfies the given key or functional dependency.
On the other hand, a certain key or certain functional dependency holds if all possible worlds satisfy the constraint, A strongly possible constraint is an intermediate concept between possible and certain constraints, based on the strongly possible world approach (a strongly possible world is obtained by replacing \nul's by a value from the ones appearing in the corresponding attribute of the table).
A strongly possible key or functional dependency holds in an incomplete table if there exists a strongly possible world that satisfies the given constraint.
In the present paper, we introduce strongly possible versions of multivalued dependencies and cross joins, and we analyse the complexity of checking the validity of a given strongly possible cross joins.
We also study approximation measures of strongly possible keys (spKeys), functional dependencies (spFDs), multivalued dependencies (spMVDs) and cross joins (spCJs).
$g_3$ and $g_5$ measures are used to measure how close a table $Y$ satisfies a constraint if it is violated in $T$.
Where the two measures $g_3$ and $g_5$ represent the ratio of the minimum number of tuples that are required to be removed from or added to, respectively, the table so that the constraint holds.
Removing tuples may remove the cases that caused the constraint violation and adding tuples can extend the values shown on an attribute.
For spKeys and spFDs, We show that the $g_3$ value is always an upper bound of the $g_5$ value for a given constraint in a table.
However, there are tables of arbitrarily large number of tuples and a constant number of attributes that satisfy $g_3-g_5=\frac{p}{q}$ for any rational number $0\le\frac{p}{q}<1$.
On the other hand, we show that the two measures values are independent of each other in the case of spMVDs and spCJs.
We also treat complexity questions of determination of the approximation values.

Related Results

Actualització consistent de bases de dades deductives
Actualització consistent de bases de dades deductives
En aquesta tesi, proposem un nou mètode per a l'actualització consistent de bases de dades deductives. Donada una petició d'actualització, aquest mètode tradueix de forma automàtic...
Developing guidelines for research institutions
Developing guidelines for research institutions
As introduced in Chapter 1, in this thesis, I developed guidelines to research institutions on how to foster research integrity. I did this by exploring how research institutions c...
Autoinhibition of cMyBP-C by its middle domains
Autoinhibition of cMyBP-C by its middle domains
AbstractCardiac myosin binding protein-C (cMyBP-C) is a sarcomere regulatory protein consisting of 11 well-folded immunoglobulin-like (Ig-like) and fibronectin type-III domains wit...
Non-Recommended Publishing Lists: Strategies for Detecting Deceitful Journals
Non-Recommended Publishing Lists: Strategies for Detecting Deceitful Journals
Abstract The rapid growth of open access publishing (OAP) has significantly improved the accessibility and dissemination of scientific knowledge. However, this expansion has also c...
Genetic Programming for Symbolic Regression on Incomplete Data
Genetic Programming for Symbolic Regression on Incomplete Data
<p><b>Symbolic regression is the process of constructing mathematical expressions that best fit given data sets, where a target variable is expressed in terms of input ...
Incremental checking and maintenance of UML/OCL integrity constraints
Incremental checking and maintenance of UML/OCL integrity constraints
Ensuring the data correctness of some information system is a crucial task. So, software engineers specify sets of integrity constraints that should be satisfied by the system's da...
libFLASM: a software library for fixed-length approximate string matching
libFLASM: a software library for fixed-length approximate string matching
Abstract Background Approximate string matching is the problem of finding all factors of a given text that are at a distance at most k from a given ...

Back to Top