Efficient Execution of Multi-Query Data Analysis Batches Using Compiler Optimization Strategies

TitleEfficient Execution of Multi-Query Data Analysis Batches Using Compiler Optimization Strategies
Publication TypeJournal Articles
Year of Publication2004
AuthorsAndrade H, Aryangat S, Kurc T, Saltz J, Sussman A
JournalLanguages and Compilers for Parallel Computing
Date Published2004///
Abstract

This work investigates the leverage that can be obtained from compiler optimization techniques for efficient execution of multi-query workloads in data analysis applications. Our approach is to address multi-query optimization at the algorithmic level, by transforming a declarative specification of scientific data analysis queries into a high-level imperative program that can be made more efficient by applying compiler optimization techniques. These techniques – including loop fusion, common subexpression elimination and dead code elimination – are employed to allow data and computation reuse across queries. We describe a preliminary experimental analysis on a real remote sensing application that analyzes very large quantities of satellite data. The results show our techniques achieve sizable reductions in the amount of computation and I/O necessary for executing query batches and in average execution times for the individual queries in a given batch.

DOI10.1007/978-3-540-24644-2_33