Towards Approximating Incomplete Queries over Partially Complete Databases (Extended Abstract)
نویسندگان
چکیده
Motivation. Building reliable systems over partially complete data poses significant challenges because queries they send to the available data retrieve answers that may significantly differ from the real answers. This may lead to a wrong understanding of the data and the events and processes it describes. This problem is especially critical for analytical systems that aggregate retrieved data since missing answers may significantly change results of analytical computations, e.g., computation of minimal or average values is sensitive to missing values [2,7]. One way to ensure reliability of (analytical) systems over partially complete data is to guarantee that whatever data they touch is complete w.r.t. to the real data. A possible way to model partial data completeness is with tuple generating dependencies (TGDs) [1] that specify what parts of a relation are complete [8– 10]:
منابع مشابه
Expanding Queries to Incomplete Databases by Interpolating General Logic Programs
In databases, queries are usually deened on complete databases. In this paper we introduce and motivate the notion of extended queries that are deened on incomplete databases. We argue that the language of extended logic program is appropriate for representing extended queries. We show through examples that given a query, a particular extension of it has important characteristics which correspo...
متن کاملEfficient Algorithms for Approximating Answers to Queries Against Incomplete Relational Databases
متن کامل
Computing Possible and Certain Answers over Order-Incomplete Data
This paper studies the complexity of query evaluation for databases whose relations are partially ordered; the problem commonly arises when combining ordered data from multiple sources. We focus on queries in a useful fragment of SQL, namely positive relational algebra with aggregates, whose bag semantics we extend to the partially ordered setting. Our semantics leads to the study of two main c...
متن کاملOntology-Mediated Queries for Probabilistic Databases (Extended Abstract)
The semantics of large-scale knowledge bases like NELL and Google’s Knowledge Vault is founded on (tuple-independent) probabilistic databases (PDBs) [3]. As for ordinary databases, they employ the closed-world assumption, i.e., missing facts are treated as being false (having the probability 0), which leads to unintuitive results when querying PDBs. Recently, open-world probabilistic databases ...
متن کاملEfficient Evaluation of Well-designed Pattern Trees (Extended Abstract)
Conjunctive queries (CQs) constitute the core of the query languages for relational databases and also the most intensively studied querying mechanism in the database theory community. But CQs suffer from a serious drawback when dealing with incomplete information: If it is not possible to match the complete query with the data, they return no answer at all. The semantic web therefore provides ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017