An optimal workload-based data allocation approach for multidisk databases

作者:

Highlights:

摘要

Parallel processing mechanisms and data layout approaches that significantly affect access performance of database systems have received increased attention in the last few decades. Multidisk allocation problems try to find an allocation of relations to disks such that expected query cost is minimized. Solving this NP-complete problem is extremely time-consuming, especially because the need for solution time rises exponentially as the number of 0–1 variables increases. This study presents a novel and efficient approach for deriving an optimal layout of relations on disks based on database statistics of access patterns and relation sizes. In addition to minimizing query cost, the proposed model allows replication of relations, minimizes storage cost, and enhances computational efficiency by reducing the number of 0–1 variables and constraints. Illustrative examples and experimental results demonstrate the advantages and efficiency of the proposed method.

论文关键词:Multi DBs,Data allocation,Query optimization,Integer programming

论文评审过程:Received 18 January 2008, Revised 4 February 2009, Accepted 5 February 2009, Available online 15 February 2009.

论文官网地址:https://doi.org/10.1016/j.datak.2009.02.001