Elastic scaling for data stream processing
dc.citation.epage | 1463 | en_US |
dc.citation.issueNumber | 6 | en_US |
dc.citation.spage | 1447 | en_US |
dc.citation.volumeNumber | 25 | en_US |
dc.contributor.author | Gedik, B. | en_US |
dc.contributor.author | Schneider S. | en_US |
dc.contributor.author | Hirzel M. | en_US |
dc.contributor.author | Wu, Kun-Lung | en_US |
dc.date.accessioned | 2016-02-08T11:03:15Z | |
dc.date.available | 2016-02-08T11:03:15Z | |
dc.date.issued | 2014 | en_US |
dc.department | Department of Computer Engineering | en_US |
dc.description.abstract | This article addresses the profitability problem associated with auto-parallelization of general-purpose distributed data stream processing applications. Auto-parallelization involves locating regions in the application's data flow graph that can be replicated at run-time to apply data partitioning, in order to achieve scale. In order to make auto-parallelization effective in practice, the profitability question needs to be answered: How many parallel channels provide the best throughput? The answer to this question changes depending on the workload dynamics and resource availability at run-time. In this article, we propose an elastic auto-parallelization solution that can dynamically adjust the number of channels used to achieve high throughput without unnecessarily wasting resources. Most importantly, our solution can handle partitioned stateful operators via run-time state migration, which is fully transparent to the application developers. We provide an implementation and evaluation of the system on an industrial-strength data stream processing platform to validate our solution. © 1990-2012 IEEE. | en_US |
dc.description.provenance | Made available in DSpace on 2016-02-08T11:03:15Z (GMT). No. of bitstreams: 1 bilkent-research-paper.pdf: 70227 bytes, checksum: 26e812c6f5156f83f0e77b261a471b5a (MD5) Previous issue date: 2014 | en |
dc.identifier.doi | 10.1109/TPDS.2013.295 | en_US |
dc.identifier.issn | 1045-9219 | en_US |
dc.identifier.uri | http://hdl.handle.net/11693/26675 | en_US |
dc.language.iso | English | en_US |
dc.publisher | IEEE Computer Society | en_US |
dc.relation.isversionof | http://dx.doi.org/10.1109/TPDS.2013.295 | en_US |
dc.source.title | IEEE Transactions on Parallel and Distributed Systems | en_US |
dc.subject | Data stream processing | en_US |
dc.subject | Elasticity | en_US |
dc.subject | Parallelization | en_US |
dc.subject | Data flow analysis | en_US |
dc.subject | Data flow graphs | en_US |
dc.subject | Profitability | en_US |
dc.subject | Application developers | en_US |
dc.subject | Auto-parallelization | en_US |
dc.subject | Data partitioning | en_US |
dc.subject | Distributed data stream processing | en_US |
dc.subject | Parallel channel | en_US |
dc.subject | Parallelizations | en_US |
dc.subject | Resource availability | en_US |
dc.subject | Data communication systems | en_US |
dc.title | Elastic scaling for data stream processing | en_US |
dc.type | Article | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Elastic scaling for data stream processing.pdf
- Size:
- 1.68 MB
- Format:
- Adobe Portable Document Format
- Description:
- Full printable version