Kankaya, Hüseyin Emre2016-01-082016-01-082009http://hdl.handle.net/11693/14777Ankara : The Department of Electrical and Electronics Engineering and the Institute of Engineering and Sciences of Bilkent University, 2009.Thesis (Ph.D.) -- Bilkent University, 2009.Includes bibliographical references leaves 140-147.Today’s state-of-the-art search engines utilize the inverted index data structure for fast text retrieval on large document collections. To parallelize the retrieval process, the inverted index should be distributed among multiple index servers. Generally the distribution of the inverted index is done in either a term-based or a document-based fashion. The performances of both schemes depend on the total number of disk accesses and the total volume of communication in the system. The classical approach for both distributions is to use the Central Broker Query Evaluation Scheme (CB) for parallel text retrieval. It is known that in this approach the central broker is heavily loaded and becomes a bottleneck. Recently, an alternative query evaluation technique, named Pipelined Query Evaluation Scheme (PPL), has been proposed to alleviate this problem by performing the merge operation on the index servers. In this study, we analyze the scalability and relative performances of the CB and PPL under various query loads to report the benefits and drawbacks of each method.xv, 147 leaves, graphicsEnglishinfo:eu-repo/semantics/openAccessparallel text retrievaldocument-based distrributionterm-based distributionpipelined query evaluationcentral broker query evaluationTK5101 .K35 2009Telecommunication systems.Computer networks.Queuing theory.Markov processing.Performance evaluation of telecommunication networks based on multi-regime fluid queuesThesis