Parallel text retrieval on PC clusters

buir.advisorAykanat, Cevdet
dc.contributor.authorÇatal, Aytül
dc.date.accessioned2016-07-01T10:58:57Z
dc.date.available2016-07-01T10:58:57Z
dc.date.issued2003
dc.descriptionCataloged from PDF version of article.en_US
dc.description.abstractThe inverted index partitioning problem is investigated for parallel text retrieval systems. The objective is to perform efficient query processing on an inverted index distributed across a PC cluster. Alternative strategies are considered and evaluated for inverted index partitioning, where index entries are distributed according to their document-ids or term-ids. The performance of both partitioning schemes depend on the total number of disk accesses and the total volume of communication in the system. In document-id partitioning, the total volume of communication is naturally minimum, whereas the total number of disk accesses may be larger compared to term-id partitioning. On the other hand, in term-id partitioning the total number of disk accesses is already equivalent to the lower bound achieved by the sequential algorithm, albeit the total communication volume may be quite large. The studies done so far perform these partitioning schemes in a round-robin fashion and compare the performance of them by simulation. In this work, a parallel text retrieval system is designed and implemented on a PC cluster. We adopted hypergraph-theoretical partitioning models and carried out performance comparison of round-robin and hypergraph-theoretical partitioning schemes on our parallel text retrieval system. We also designed and implemented a query interface and a user interface of our system.en_US
dc.description.provenanceMade available in DSpace on 2016-07-01T10:58:57Z (GMT). No. of bitstreams: 1 0002397.pdf: 559615 bytes, checksum: cf31075e54bdd2a82b4caaea36212692 (MD5) Previous issue date: 2003en
dc.description.statementofresponsibilityÇatal, Aytülen_US
dc.format.extentxi, 56 leaves, tables, graphics, 30 cmen_US
dc.identifier.itemidBILKUTUPB072124
dc.identifier.urihttp://hdl.handle.net/11693/29391
dc.language.isoEnglishen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectParallel text retrievalen_US
dc.subjectsystem performanceen_US
dc.subjectinverted index partitioningen_US
dc.subjectparallel query processingen_US
dc.subjectinverted indexen_US
dc.subject.lccQA76.5 .C38 2003en_US
dc.subject.lcshParallel processing (Electronic computers).en_US
dc.titleParallel text retrieval on PC clustersen_US
dc.typeThesisen_US
thesis.degree.disciplineComputer Engineering
thesis.degree.grantorBilkent University
thesis.degree.levelMaster's
thesis.degree.nameMS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
0002397.pdf
Size:
546.5 KB
Format:
Adobe Portable Document Format
Description:
Full printable version