Time and space optimization for processing groups of multi-dimensional scientific queries

TitleTime and space optimization for processing groups of multi-dimensional scientific queries
Publication TypeConference Papers
Year of Publication2004
AuthorsAryangat S, Andrade H, Sussman A
Conference NameProceedings of the 18th annual international conference on Supercomputing
Date Published2004///
PublisherACM
Conference LocationNew York, NY, USA
ISBN Number1-58113-839-3
Keywordsmulti-query optimization, Scientific databases
Abstract

Data analysis applications in areas as diverse as remote sensing and telepathology require operating on and processing very large datasets. For such applications to execute efficiently, careful attention must be paid to the storage, retrieval, and manipulation of the datasets. This paper addresses the optimizations performed by a high performance database system that processes groups of data analysis requests for these applications, which we call queries. The system performs end-to-end processing of the requests, formulated as PostgreSQL declarative queries. The queries are converted into imperative descriptions, multiple imperative descriptions are merged into a single execution plan, the plan is optimized to decrease execution time via common compiler optimization techniques, and, finally, the plan is optimized to decrease memory consumption. The last two steps are experimentally shown to effectively reduc the amount of time required while conserving memory space as a group of queries is processed by the database.

URLhttp://doi.acm.org/10.1145/1006209.1006224
DOI10.1145/1006209.1006224