Efficient Query Processing on Distributed Stream Processing Engine

Information

Title Efficient Query Processing on Distributed Stream Processing Engine
Authors
Manhui Han, Jonghem Youn, Sang-goo Lee
Year 2017 / 1
Keywords query optimization, distributed stream processing, cloud computing
Acknowledgement ITRC
Publication Type International Conference
Publication Proceedings of the 11th International Conference on Ubiquitous Information Management and Communication (IMCOM 2017)
Link doi

Abstract

Distributed stream processing engines, such as Storm and Samza, have been developed to process large scale stream data. The engines are scale out horizontally with shared nothing architecture, but they do not provide high-level query language like SQL. Supporting query language for flexible analysis has become an important issue. In this paper, we provide efficient continuous relational query processing on distributed stream processing engine. We propose a methodology to transform queries executable in the engine and optimization technique for query processing. Our experimental results show that our methodology is efficient on processing queries for data streams.