A Data Warehouse (DW) is a data repository that organizes and physically integrates data from multiple sources under special kinds of schemas. A DW is composed by a set of dimensions that reflect the way the data is structured, and the facts that correspond to quantitative data related with the dimensions. A dimension schema is a hierarchical graph of categories. A dimension instance is strict if every element of the dimension has a unique ancestor element in each of the ancestor categories. This property is crucial for the efficiency of the system since it allows for the correct computation of aggregate queries using pre-computed views. A dimension instance may become non-strict after update operations. When this happens, the instance can be minimally repaired in several ways. In this paper we characterize consistent answers to aggregate queries by means of smallest ranges that contain the answers obtained from every minimal repair. We also introduce the notion of canonical dimension which captures information about all the minimal repairs. We use this dimension to approximate consistent query answers.

Additional Metadata
Conference 3rd Alberto Mendelzon International Workshop on Foundations of Data Management, AMW 2009
Citation
Bertossi, L, Bravo, L. (Loreto), & Caniupán, M. (Mónica). (2009). Consistent query answering in Data Warehouses. Presented at the 3rd Alberto Mendelzon International Workshop on Foundations of Data Management, AMW 2009.