SPL: an extensible language for distributed stream processing
ACM Transactions on Programming Languages and Systems
Association for Computing Machinery
Item Usage Stats
MetadataShow full item record
Big data is revolutionizing how all sectors of our economy do business, including telecommunication, transportation, medical, and finance. Big data comes in two flavors: data at rest and data in motion. Processing data in motion is stream processing. Stream processing for big data analytics often requires scale that can only be delivered by a distributed system, exploiting parallelism on many hosts and many cores. One such distributed stream processing system is IBM Streams. Early customer experience with IBM Streams uncovered that another core requirement is extensibility, since customers want to build high-performance domain-specific operators for use in their streaming applications. Based on these two core requirements of distribution and extensibility, we designed and implemented the Streams Processing Language (SPL). This article describes SPL with an emphasis on the language design, distributed runtime, and extensibility mechanism. SPL is now the gateway for the IBM Streams platform, used by our customers for stream processing in a broad range of application domains. © 2017 ACM.
Computer hardware description languages
Distributed parameter control systems
Distributed stream processing
Published Version (Please cite this version)http://dx.doi.org/10.1145/3039207
Showing items related by title, author, creator and subject.
Akar, N. (Taylor and Francis Inc., 2015)A novel algorithmic method is proposed to fit matrix geometric distributions of desired order to empirical data or arbitrary discrete distributions. The proposed method effectively combines two existing approaches from two ...
Quantifying input uncertainty in an assemble-to-order system simulation with correlated input variables of mixed types Akçay, Alp; Biller, B. (IEEE, 2014)We consider an assemble-to-order production system where the product demands and the time since the last customer arrival are not independent. The simulation of this system requires a multivariate input model that generates ...
Kosar, T.; Akturk I.; Balman, M.; Wang X. (2011)Modern collaborative science has placed increasing burden on data management infrastructure to handle the increasingly large data archives generated. Beside functionality, reliability and availability are also key factors ...