Elastic scaling for data stream processing

Gedik, B.; Schneider S.; Hirzel M.; Wu, Kun-Lung

Elastic scaling for data stream processing

Files

Elastic scaling for data stream processing.pdf (1.68 MB)

Date

2014

Authors

Gedik, B.

Schneider S.

Hirzel M.

Wu, Kun-Lung

BUIR Usage Stats

3
views

18
downloads

Citation Stats

Abstract

This article addresses the profitability problem associated with auto-parallelization of general-purpose distributed data stream processing applications. Auto-parallelization involves locating regions in the application's data flow graph that can be replicated at run-time to apply data partitioning, in order to achieve scale. In order to make auto-parallelization effective in practice, the profitability question needs to be answered: How many parallel channels provide the best throughput? The answer to this question changes depending on the workload dynamics and resource availability at run-time. In this article, we propose an elastic auto-parallelization solution that can dynamically adjust the number of channels used to achieve high throughput without unnecessarily wasting resources. Most importantly, our solution can handle partitioned stateful operators via run-time state migration, which is fully transparent to the application developers. We provide an implementation and evaluation of the system on an industrial-strength data stream processing platform to validate our solution. © 1990-2012 IEEE.

Source Title

IEEE Transactions on Parallel and Distributed Systems

Publisher

IEEE Computer Society

Keywords

Data stream processing, Elasticity, Parallelization, Data flow analysis, Data flow graphs, Profitability, Application developers, Auto-parallelization, Data partitioning, Distributed data stream processing, Parallel channel, Parallelizations, Resource availability, Data communication systems

Permalink

http://hdl.handle.net/11693/26675

Published Version (Please cite this version)

http://dx.doi.org/10.1109/TPDS.2013.295

Collections

Scholarly Publications - Computer Engineering

Language

English

Type

Article

Full item page

Elastic scaling for data stream processing

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Elastic scaling for data stream processing

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type