Şahin, Muhammed Yağmur2016-12-052016-12-052016-102016-102016-12-01http://hdl.handle.net/11693/32564Cataloged from PDF version of article.Thesis (M.S.): Bilkent University, Department of Computer Engineering, İhsan Doğramacı Bilkent University, 2016.Includes bibliographical references (leaves 46-48).Today, there are many applications that deal with high-volume data streams. These distributed stream processing applications process data on-the-fly and provide real-time distributed computing for big data. Due to the volume of data they process, some of these applications make use of data parallel nodes. The state management for distributed nodes in these applications is an important task to handle, because of different use cases such as: dealing with node failures, checkpointing, data enrichment, and re-partitioning. Therefore, distributed stream processing applications need a state management mechanism. In this thesis, we present a locality-aware state management mechanism for distributed stream processing applications. The proposed mechanism provides a transparent locality-aware data partitioning and state management system for distributed stream processing applications. The mechanism partitions data while preserving locality and handles state transfer among nodes transparently, in order to adapt to potential changes in the partitioning. In addition to this, it provides operators with a high-performance state management facility that can tackle check-pointing scenarios. The idea is implemented as a pluggable library for the open-source, distributed stream-processing engine, Apache Storm.x, 48 leaves : charts (some color)Englishinfo:eu-repo/semantics/openAccessLocality-aware state partitioningConsistent HashApache StormLocality-aware distributed state partitioning for stream processing systemsVeri katarı işleme sistemleri için veri yerelliği farkındalığı olan dağıtık durum bölümlendirmesiThesisB154854