Baykan, İzzet Çağrı2016-01-082016-01-082008http://hdl.handle.net/11693/14779Ankara : The Department of Computer Engineering and the Institute of Engineering and Science of Bilkent University, 2008.Thesis (Master's) -- Bilkent University, 2008.Includes bibliographical references leaves 43-46.Compression of inverted indexes received great attention in recent years. An inverted index consists of lists of document identifiers, also referred as posting lists, for each term. Compressing an inverted index reduces the size of the index, which also improves the query performance due to the reduction on disk access times. In recent studies, it is shown that reassigning document identifiers has great effect in compression of an inverted index. In this work, we propose a novel technique that reassigns both term and document identifiers of an inverted index by transforming the matrix representation of the index into a block-diagonal form, which improves the compression ratio dramatically. We adapted row-net hypergraph-partitioning model for the transformation into block-diagonal form, which improves the compression ratio by as much as 50%. To the best of our knowledge, this method performs more effectively than previous inverted index compression techniques.ix, 46 leaves, graphsEnglishinfo:eu-repo/semantics/openAccessInverted indexInverted index compressionBlock-diagonal formDocument identifier reassignmentHypergraph partitioningQA76.9.T48 B39 2008Text processing (Computer science)Information storage and retrieval systems.Information retrieval.Inverted index compression based on term and document identifier reassignmentThesisBILKUTUPB109724