URI | http://purl.tuc.gr/dl/dias/D90992AF-1B30-4B4C-B517-2FF9075D0ED5 | - |
Αναγνωριστικό | http://www.sciencedirect.com/science/article/pii/S0306437908000598 | - |
Αναγνωριστικό | https://doi.org/doi:10.1016/j.is.2008.06.002 | - |
Γλώσσα | en | - |
Μέγεθος | 22 pages | en |
Τίτλος | Multi-query optimization for sketch-based estimation | en |
Δημιουργός | Dobra Alin | en |
Δημιουργός | Garofalakis Minos | en |
Δημιουργός | Γαροφαλακης Μινως | el |
Δημιουργός | Gehrke Johannes | en |
Δημιουργός | Rastogi Rajeev | en |
Εκδότης | Elsevier | en |
Περίληψη | Randomized techniques, based on computing small “sketch” synopses for each stream, have recently been shown to be a very effective tool for approximating the result of a single SQL query over streaming data tuples. In this paper, we investigate the problems arising when data-stream sketches are used to process multiple such queries concurrently. We demonstrate that, in the presence of multiple query expressions, intelligently sharing sketches among concurrent query evaluations can result in substantial improvements in the utilization of the available sketching space and the quality of the resulting approximation error guarantees. We provide necessary and sufficient conditions for multi-query sketch sharing that guarantee the correctness of the result-estimation process. We also investigate the difficult optimization problem of determining sketch-sharing configurations that are optimal (e.g., under a certain error metric for a given amount of space). We prove that optimal sketch sharing typically gives rise to NP-hard questions, and we propose novel heuristic algorithms for finding good sketch-sharing configurations in practice. Results from our experimental study with queries from the TPC-H benchmark verify the effectiveness of our approach, clearly demonstrating the benefits of our sketch-sharing methodology. | en |
Τύπος | Peer-Reviewed Journal Publication | en |
Τύπος | Δημοσίευση σε Περιοδικό με Κριτές | el |
Άδεια Χρήσης | http://creativecommons.org/licenses/by/4.0/ | en |
Ημερομηνία | 2015-10-29 | - |
Ημερομηνία Δημοσίευσης | 2009 | - |
Θεματική Κατηγορία | Data streaming | en |
Θεματική Κατηγορία | Sketches | en |
Θεματική Κατηγορία | Approximate query processing | en |
Θεματική Κατηγορία | Multi-query optimization | en |
Βιβλιογραφική Αναφορά | A. Dobra, M. Garofalakis, J. Gehrke and R. Rastogi, "Multi-query optimization for sketch-based estimation", Inform. Syst., vol. 34, no. 2, pp. 209-230, Apr. 2009. doi:10.1016/j.is.2008.06.002 | en |