Web-site-based partitioning techniques for efficient parallelization of the PageRank computation
buir.advisor | Aykanat, Cevdet | |
dc.contributor.author | Cevahir, Ali | |
dc.date.accessioned | 2016-07-01T11:08:17Z | |
dc.date.available | 2016-07-01T11:08:17Z | |
dc.date.issued | 2006 | |
dc.description | Cataloged from PDF version of article. | en_US |
dc.description.abstract | Web search engines use ranking techniques to order Web pages in query results. PageRank is an important technique, which orders Web pages according to the linkage structure of the Web. The efficiency of the PageRank computation is important since the constantly evolving nature of the Web requires this computation to be repeated many times. PageRank computation includes repeated iterative sparse matrix-vector multiplications. Due to the enormous size of the Web matrix to be multiplied, PageRank computations are usually carried out on parallel systems. However, efficiently parallelizing PageRank is not an easy task, because of the irregular sparsity pattern of the Web matrix. Graph and hypergraphpartitioning-based techniques are widely used for efficiently parallelizing matrixvector multiplications. Recently, a hypergraph-partitioning-based decomposition technique for fast parallel computation of PageRank is proposed. This technique aims to minimize the communication overhead of the parallel matrix-vector multiplication. However, the proposed technique has a high prepropocessing time, which makes the technique impractical. In this work, we propose 1D (rowwise and columnwise) and 2D (fine-grain and checkerboard) decomposition models using web-site-based graph and hypergraph-partitioning techniques. Proposed models minimize the communication overhead of the parallel PageRank computations with a reasonable preprocessing time. The models encapsulate not only the matrix-vector multiplication, but the overall iterative algorithm. Conducted experiments show that the proposed models achieve fast PageRank computation with low preprocessing time, compared with those in the literature. | en_US |
dc.description.provenance | Made available in DSpace on 2016-07-01T11:08:17Z (GMT). No. of bitstreams: 1 0003187.pdf: 562898 bytes, checksum: 7c80e5f66f17a068360f6a8d8abeab76 (MD5) Previous issue date: 2006 | en |
dc.description.statementofresponsibility | Cevahir, Ali | en_US |
dc.format.extent | xii, 78 leaves, graphics | en_US |
dc.identifier.itemid | BILKUTUPB100118 | |
dc.identifier.uri | http://hdl.handle.net/11693/29894 | |
dc.language.iso | English | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.subject | PageRank | en_US |
dc.subject | Parallel Sparse-Matrix Vector Multiplication | en_US |
dc.subject | Graph and Hypergraph Partitioning | en_US |
dc.subject.lcc | QA188 .C49 2006 | en_US |
dc.subject.lcsh | Sparse matrices Data processing. | en_US |
dc.title | Web-site-based partitioning techniques for efficient parallelization of the PageRank computation | en_US |
dc.type | Thesis | en_US |
thesis.degree.discipline | Computer Engineering | |
thesis.degree.grantor | Bilkent University | |
thesis.degree.level | Master's | |
thesis.degree.name | MS (Master of Science) |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- 0003187.pdf
- Size:
- 549.71 KB
- Format:
- Adobe Portable Document Format
- Description:
- Full printable version