View selection using randomized search

作者:

Highlights:

摘要

An important issue in data warehouse development is the selection of a set of views to materialize in order to accelerate On-line analytical processing queries, given certain space and maintenance time constraints. Existing methods provide good results but their high execution cost limits their applicability for large problems. In this paper, we explore the application of randomized, local search algorithms to the view selection problem. The efficiency of the proposed techniques is evaluated using synthetic datasets, which cover a wide range of data and query distributions. The results show that randomized search methods provide near-optimal solutions in limited time, being robust to data and query skew. Furthermore, they can be easily adapted for various versions of the problem, including the simultaneous existence of size and time constraints, and view selection in dynamic environments. The proposed heuristics scale well with the problem size, and are therefore particularly useful for real life warehouses, which need to be analyzed by numerous business perspectives.

论文关键词:View selection,On-line analytical processing,Data warehouse

论文评审过程:Received 9 October 2000, Revised 8 June 2001, Accepted 10 October 2001, Available online 22 March 2002.

论文官网地址:https://doi.org/10.1016/S0169-023X(02)00045-9