Review
Discovering and Exploiting Statistical Properties for Query Optimization in Relational Databases: A Survey
Article first published online: 16 JAN 2009
DOI: 10.1002/sam.10016
Copyright © 2009 Wiley Periodicals, Inc., A Wiley Company
Additional Information
How to Cite
Haas, P. J., Ilyas, I. F., Lohman, G. M. and Markl, V. (2009), Discovering and Exploiting Statistical Properties for Query Optimization in Relational Databases: A Survey. Statistical Analysis and Data Mining, 1: 223–250. doi: 10.1002/sam.10016
Publication History
- Issue published online: 25 FEB 2009
- Article first published online: 16 JAN 2009
- Manuscript Accepted: 5 AUG 2008
- Manuscript Revised: 28 JUL 2008
- Manuscript Received: 19 FEB 2008
- Abstract
- Article
- References
- Cited By
Keywords:
- relational database;
- query optimization;
- data mining;
- query feedback;
- sampling;
- statistical structure
Abstract
Discovering and exploiting statistical features in relational datasets is key to query optimization in a relational database management system (RDBMS ), and is also needed for database design, cleaning, and integration. This paper surveys a variety of methods for automatically discovering important statistical features such as correlations, functional dependencies, keys, and algebraic constraints. We discuss proactive approaches in which the data is scanned or sampled (periodically, at optimization time or at query time), or in which exploratory queries are executed. Also discussed are reactive approaches that monitor the results of the query processing. Finally, we discuss methods for dealing with the practical challenges of maintaining statistical information in the face of heavy system utilization, and of dealing with inconsistencies that arise from incomplete cardinality models, use of multiple discovery methods, or changes in the underlying data over time. © 2009 Wiley Periodicals, Inc. Statistical Analysis and Data Mining 1: 000-000, 2008

1932-1872/asset/SAM_left.gif?v=1&s=95cbbe8824e1aaad351b489e115d8595305bcb3f)
1932-1872/asset/cover.gif?v=1&s=0f380272270594bb30722e7b51bbc3f9b45039b2)