ADAPTIVE QUERY PROCESSING CSE 8330 Conventional Query Processing

Conventional Query Processing Statistics generation Query optimization Query execution

Motivations Unreliable cardinality estimates Complex queries Changes in the runtime environment Query interactivity Multi-query

Adaptive Query Processing Measureme nt Analysis Planning Actuation

Adaptive Query Processing Selection Ordering A-Greedy Eddies Join Processing Pipeline execution Non-pipeline execution

Adaptive Selection Ordering A-Greedy Eddies

Adaptive Join Processing Pipelined Execution History independent � M-Joins � Eddies with Ste. Ms

Adaptive Join Processing Eddies with M-Joins Eddies with Ste. Ms

Adaptive Optimization Techniques Plan partitioning Horizontal Partitioning Tuple routing Deferring plan decision until runtime

Challenges Effective Handling of Correlation Resource Sharing Developing Optimal Policies Developing Adaptation Metrics

• R. Avnur and J. M. Hellerstein, “Eddies: continuously adaptive query processing, ”

• P. J. Haas and J. M. Hellerstein, “Ripple joins for online aggregation,

Slides: 13

Download presentation

ADAPTIVE QUERY PROCESSING CSE 8330

Conventional Query Processing Statistics generation Query optimization Query execution

Motivations Unreliable cardinality estimates Complex queries Changes in the runtime environment Query interactivity Multi-query optimization

Adaptive Query Processing Measureme nt Analysis Planning Actuation

Adaptive Query Processing Selection Ordering A-Greedy Eddies Join Processing Pipeline execution Non-pipeline execution

Adaptive Selection Ordering A-Greedy Eddies

Adaptive Join Processing Pipelined Execution History independent � M-Joins � Eddies with Ste. Ms � A-Caching Non-Pipeline Execution History dependent � Corrective Query Processing � Eddies with STAIRs Plan Staging Mid-Query Reoptimization Query Scrambling

Adaptive Join Processing Eddies with M-Joins Eddies with Ste. Ms

Corrective Query Processing

Adaptive Optimization Techniques Plan partitioning Horizontal Partitioning Tuple routing Deferring plan decision until runtime Smart operators

Challenges Effective Handling of Correlation Resource Sharing Developing Optimal Policies Developing Adaptation Metrics

• R. Avnur and J. M. Hellerstein, “Eddies: continuously adaptive query processing, ” in SIGMOD ’ 00: Proceedings of the 2000 ACM SIGMOD international conference on Management of data, (New York, NY, USA), pp. 261– 272, ACM Press, 2000. • S. Babu, R. Motwani, K. Munagala, I. Nishizawa, and J. Widom, “Adaptive ordering of pipelined stream filters, ” in SIGMOD ’ 04: Proceedings of the 2004 ACM SIGMOD international conference on Management of data, (New York, NY, USA), pp. 407– 418, ACM Press, 2004. • S. Babu and J. Widom, “Strea. Mon: an adaptive engine for stream query processing, ” in SIGMOD ’ 04: Proceedings of the 2004 ACM SIGMOD international conference on Management of data, (New York, NY, USA), pp. 931– 932, ACM Press, 2004. • P. Bizarro and D. De. Witt, “Adaptive and robust query processing with SHARP, ” Tech. Rep. 1562, University of Wisconsin – Madison, CS Dept. , 2006. • C. M. Chen and N. Roussopoulos, “Adaptive selectivity estimation using query feedback, ” in SIGMOD ’ 94: Proceedings of the 1994 ACM SIGMOD international conference on Management of data, (New York, NY, USA), pp. 161– 172, ACM Press 1994. • A. Condon, A. Deshpande, L. Hellerstein, and N. Wu, “Flow algorithms for two pipelined filter ordering problems, ” in PODS ’ 06: Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, (New York, NY, USA), pp. 193– 202, ACM Press, 2006. • A. Deshpande and J. M. Hellerstein, “Lifting the burden of history from adaptive query processing, ” in VLDB ’ 04: Proceedings of the 30 th International Conference on Very Large Data Bases, Toronto, Canada, August 29– September 3 2004. • A. Deshpande, Z. Ives and V. Raman, “Adaptive Query Processing” in Foundations and Trends in Databases, 2007. • S. Ewen, H. Kache, V. Markl, and V. Raman, “Progressive query optimization for federated queries, ” in EDBT ’ 06: Proceedings of the 10 th International Conference on Extending Database Technology, pp. 847– 864, 2006. • G. Graefe, R. Bunker, and S. Cooper, “Hash joins and hash teams in Microsoft SQL Server, ” in VLDB ’ 98: Proceedings of 24 th International Conference on Very Large Data Bases, pp. 86– 97, Morgan Kaufman, August 24– 27 1998. Bibliography CSE 8330

• P. J. Haas and J. M. Hellerstein, “Ripple joins for online aggregation, ” in SIGMOD’ 99: Proceedings of the 1999 ACM SIGMOD international conference on Management of data, (New York, NY, USA), pp. 287– 298, ACM Press, 1999. • J. -H. Hwang, M. Balazinska, A. Rasin, U. Cetintemel, M. Stonebraker, and S. Zdonik, “High-availability algorithms for distributed stream processing, ” in ICDE ’ 05: Proceedings of the 21 st International Conference on Data Engineering (ICDE’ 05), (Washington, DC, USA), pp. 779– 790, IEEE Computer Society, 2005. • N. Kabra and D. J. De. Witt, “Efficient mid-query re-optimization of suboptimal query execution plans, ” in SIGMOD ’ 98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data, (New York, NY, USA), pp. 106– 117, ACM Press, 1998. • V. Markl, V. Raman, D. Simmen, G. Lohman, H. Pirahesh, and M. Cilimdzic, “Robust query processing through progressive optimization, ” in SIGMOD ’ 04: Proceedings of the 2004 ACM SIGMOD international conference on Management of data, (New York, NY, USA), pp. 659– 670, ACM Press, 2004. • N. Polyzotis, “Selectivity-based partitioning: A divide-and-union paradigm for effective query optimization, ” in CIKM ’ 05: Proceedings of the 14 th ACM International Conference on Information and knowledge management, pp. 720– 727, New York, NY: ACM Press, 2005. • V. Raman, A. Deshpande, and J. M. Hellerstein, “Using state modules for adaptive query processing. , ” in ICDE ’ 03: Proceedings of the 19 th International Conference on Data Engineering, Bangalore, India, pp. 353– 364, 2003. • E. A. Rundensteiner, L. Ding, T. M. Sutherland, Y. Zhu, B. Pielech, and N. Mehta, “CAPE: Continuous query engine with heterogeneous-grained adaptivity, ” in VLDB ’ 04: Proceedings of the Thirtieth International Conference on Very Large Data Bases, Toronto, Canada, pp. 1353– 1356, 2004. • M. Stillger, G. Lohman, V. Markl, and M. Kandil, “LEO – DB 2’s Learning Optimizer, ” in VLDB ’ 01: Proceedings of 27 th International Conference on Very Large Data Bases, Morgan Kaufmann, September 11– 14 2001. • F. Tian and D. J. De. Witt, “Tuple routing strategies for distributed eddies, ” in VLDB ’ 03: Proceedings of 29 th International Conference on Very Large Data Bases, pp. 333– 344, Berlin, Germany: Morgan Kaufmann, September 9– 12 2003. • D. Zhang, J. Li, K. Kimeli, and W. Wang, “Sliding window based multi-join algorithms over distributed data streams, ” in ICDE ’ 06: Proceedings of the 22 nd International Conference on Data Engineering (ICDE’ 06), (Washington, DC, USA), p. 139, IEEE Computer Society, 2006. Bibliography CSE 8330