Browsing by Keywords "Parallel computing"
Now showing items 118 of 18

Efficient overlapped FFT algorithms for hypercubeconnected multicomputers
(1994)In this work, we propose parallel FFT algorithms, for mediumtocoarse grain hypercubeconnected multicomputers, which are more elegant and efficient than the existing ones. The proposed algorithms achieve perfect loadbalance ... 
Encapsulating multiple communicationcost metrics in partitioning sparse rectangular matrices for parallel matrixvector multiplies
(SIAM, 2004)This paper addresses the problem of onedimensional partitioning of structurally unsymmetric square and rectangular sparse matrices for parallel matrixvector and matrixtransposevector multiplies. The objective is to ... 
Hypergraph models for parallel sparse matrixmatrix multiplication
(Bilkent University, 201509)Multiplication of two sparse matrices (i.e., sparse matrixmatrix multiplication, which is abbreviated as SpGEMM) is a widely used kernel in many applications such as molecular dynamics simulations, graph operations, and ... 
Largescale solutions of electromagnetics problems using the multilevel fast multipole algorithm and physical optics
(Bilkent University, 201504)Integral equations provide fullwave (accurate) solutions of Helmholtztype electromagnetics problems. The multilevel fast multipole algorithm (MLFMA) discretizes the equations and solves them numerically with O(NLogN) ... 
Latencycentric models and methods for scaling sparse operations
(Bilkent University, 201608)Parallelization of sparse kernels and operations on largescale distributed memory systems remains as a major challenge due to everincreasing scale of modern high performance computing systems and multiple con icting ... 
Matrix factorization with stochastic gradient descent for recommender systems
(Bilkent University, 201902)Matrix factorization is an efficient technique used for disclosing latent features of realworld data. It finds its application in areas such as text mining, image analysis, social network and more recently and popularly ... 
Modeldriven transformations for mapping parallel algorithms on parallel computing platforms
(CEURWS, 2013)One of the important problems in parallel computing is the mapping of the parallel algorithm to the parallel computing platform. Hereby, for each parallel node the corresponding code for the parallel nodes must be implemented. ... 
Onedimensional partitioning for heterogeneous systems: theory and practice
(Academic Press, 200811)We study the problem of onedimensional partitioning of nonuniform workload arrays, with optimal load balancing for heterogeneous systems. We look at two cases: chainonchain partitioning, where the order of the processors ... 
A parallel boundary element formulation for tracking multiple particle trajectories in Stoke's flow for microfluidic applications
(Tech Science Press, 2015)A new formulation for tracking multiple particles in slow viscous flow for microfluidic applications is presented. The method employs the manipulation of the boundary element matrices so that finally a system of equations ... 
Parallel image restoration using surrogate constraint methods
(Academic Press, 2007)When formulated as a system of linear inequalities, the image restoration problem yields huge, unstructured, sparse matrices even for images of small size. To solve the image restoration problem, we use the surrogate ... 
Parpatoh : A 2Dparallel hypergraph partitioning tool
(Bilkent University, 2006)Hypergraph partitioning is a process that is being used to find solutions for optimization problems in various areas, including parallel volume rendering, parallel information retrieval and VLSI circuit design. While the ... 
Partitioning sparse matrices for parallel preconditioned iterative methods
(Society for Industrial and Applied Mathematics, 2007)This paper addresses the parallelization of the preconditioned iterative methods that use explicit preconditioners such as approximate inverses. Parallelizing a full step of these methods requires the coefficient and ... 
Progressive refinement radiosity on ringconnected multicomputers
(Publ by ACM, New York, NY, United States, 1993)The progressive refinement method is investigated for parallelization on ringconnected multicomputers. A synchronous scheme, based on static task assignment, is proposed, in order to achieve better coherence during the ... 
A recursive graph bipartitioning algorithm by vertex separators with fixed vertices for permuting sparse matrices into block diagonal form with overlap
(Bilkent University, 2011)Solving sparse system of linear equations Ax=b using preconditioners can be effi ciently parallelized using graph partitioning tools. In this thesis, we investigate the problem of permuting a sparse matrix into a block ... 
Reducing communication volume overhead in largescale parallel SpGEMM
(Bilkent University, 201612)Sparse matrixmatrix multiplication of the form of C = A x B, C = A x A and C = A x AT is a key operation in various domains and is characterized with high complexity and runtime overhead. There exist models for parallelizing ... 
Revisiting hypergraph models for sparse matrix partitioning
(Society for Industrial and Applied Mathematics, 2007)We provide an exposition of hypergraph models for parallelizing sparse matrixvector multiplies. Our aim is to emphasize the expressive power of hypergraph models. First, we set forth an elementary hypergraph model for the ... 
Simulation of a flowing snow avalanche using molecular dynamics
(Bilkent University, 2010)This thesis presents an approach for modeling and simulation of a flowing snow avalanche, which is formed of dry and liquefied snow that slides down a slope, by using molecular dynamics and discrete element method. A ... 
Simultaneous input and output matrix partitioning for outerproductparallel sparse matrixmatrix multiplication
(Society for Industrial and Applied Mathematics, 20141023)FFor outerproductparallel sparse matrixmatrix multiplication (SpGEMM) of the form C=A×B, we propose three hypergraph models that achieve simultaneous partitioning of input and output matrices without any replication of ...