Flow Algorithms for Parallel Query Optimization

TitleFlow Algorithms for Parallel Query Optimization
Publication TypeConference Papers
Year of Publication2008
AuthorsDeshpande A, Hellerstein L
Conference NameIEEE 24th International Conference on Data Engineering, 2008. ICDE 2008
Date Published2008/04/07/12
PublisherIEEE
ISBN Number978-1-4244-1836-7
KeywordsCasting, computational complexity, Cost function, Databases, Delay, distributed environment, Educational institutions, flow maximization algorithm, Interleaved codes, interoperator parallelism, minimisation, multiway join query response time minimization problem, parallel database, Parallel databases, parallel query optimization, Partitioning algorithms, pipeline processing, pipelined parallelism, polynomial-time algorithm, query planning problem, Query processing, Web service, Web services
Abstract

We address the problem of minimizing the response time of a multi-way join query using pipelined (inter-operator) parallelism, in a parallel or a distributed environment. We observe that in order to fully exploit the parallelism in the system, we must consider a new class of ";interleaving"; plans, where multiple query plans are used simultaneously to minimize the response time of a query (or to maximize the tuple-throughput of the system). We cast the query planning problem in this environment as a ";flow maximization problem";, and present polynomial-time algorithms that (statically) find the optimal set of plans to use for a given query, for a large class of multi-way join queries. Our proposed algorithms also naturally extend to query optimization over web services. Finally we present an extensive experimental evaluation that demonstrates both the need to consider such plans in parallel query processing and the effectiveness of our algorithms.

DOI10.1109/ICDE.2008.4497484