Summarizability in a multidimensional (MD) database refers to the correct reusability of pre-computed aggregate queries (or views) when computing higher-level aggregations or roll-ups. A dimension instance has this property if and only if it is strict and homogeneous. A dimension instance may fail to satisfy either of these two semantics conditions, and has to be repaired, restoring strictness and homogeneity. In this work, we take a relational approach to the problem of repairing dimension instances. A dimension repair is obtained by translating the dimension instance into a relational instance, repairing the latter using established techniques in the relational framework, and properly inverting the process. We show that the common relational star and snowflake schemas for MD databases are not the best choice for this process. Actually, for this purpose, we propose and formalize the path relational schema, which becomes the basis for obtaining dimensional repairs. The path schema turns out to have useful properties in general, as a basis for a relational representation and implementation of MD databases and data warehouses. It is also particularly suitable for restoring MD summarizability through relational repairs. We compare the dimension repairs so obtained with existing repair approaches for MD databases.

Additional Metadata
Keywords multidimensional DBs, repairs, semantic constraints
Persistent URL
Conference 15th International Conference on Extending Database Technology, EDBT 2012
Yaghmaie, M. (Mahkameh), Bertossi, L, & Ariyan, S. (Sina). (2012). Repair-oriented relational schemas for multidimensional databases. Presented at the 15th International Conference on Extending Database Technology, EDBT 2012. doi:10.1145/2247596.2247644