Browsing by Keywords "Parallel computing"
Now showing items 120 of 22

Coloring for distributedmemoryparallel GaussSeidel algorithm
(Bilkent University, 201909)GaussSeidel is a wellknown iterative method for solving linear system of equations. The computations performed on GaussSeidel sweeps are sequential in nature since each component of new iterations depends on previously ... 
Domain specific language for deployment of parallel applications on parallel computing platforms
(Association for Computing Machinery, 201408)To increase the computing performance the current trend is towards applying parallel computing in which parallel tasks are executed on multiple nodes. The deployment of tasks on the computing platform usually impacts the ... 
Efficient overlapped FFT algorithms for hypercubeconnected multicomputers
(1994)In this work, we propose parallel FFT algorithms, for mediumtocoarse grain hypercubeconnected multicomputers, which are more elegant and efficient than the existing ones. The proposed algorithms achieve perfect loadbalance ... 
Encapsulating multiple communicationcost metrics in partitioning sparse rectangular matrices for parallel matrixvector multiplies
(SIAM, 2004)This paper addresses the problem of onedimensional partitioning of structurally unsymmetric square and rectangular sparse matrices for parallel matrixvector and matrixtransposevector multiplies. The objective is to ... 
Hypergraph models for parallel sparse matrixmatrix multiplication
(Bilkent University, 201509)Multiplication of two sparse matrices (i.e., sparse matrixmatrix multiplication, which is abbreviated as SpGEMM) is a widely used kernel in many applications such as molecular dynamics simulations, graph operations, and ... 
Largescale solutions of electromagnetics problems using the multilevel fast multipole algorithm and physical optics
(Bilkent University, 201504)Integral equations provide fullwave (accurate) solutions of Helmholtztype electromagnetics problems. The multilevel fast multipole algorithm (MLFMA) discretizes the equations and solves them numerically with O(NLogN) ... 
Latencycentric models and methods for scaling sparse operations
(Bilkent University, 201608)Parallelization of sparse kernels and operations on largescale distributed memory systems remains as a major challenge due to everincreasing scale of modern high performance computing systems and multiple con icting ... 
Matrix factorization with stochastic gradient descent for recommender systems
(Bilkent University, 201902)Matrix factorization is an efficient technique used for disclosing latent features of realworld data. It finds its application in areas such as text mining, image analysis, social network and more recently and popularly ... 
Modeldriven transformations for mapping parallel algorithms on parallel computing platforms
(MDHPCL, 2013)One of the important problems in parallel computing is the mapping of the parallel algorithm to the parallel computing platform. Hereby, for each parallel node the corresponding code for the parallel nodes must be implemented. ... 
Onedimensional partitioning for heterogeneous systems: theory and practice
(Academic Press, 200811)We study the problem of onedimensional partitioning of nonuniform workload arrays, with optimal load balancing for heterogeneous systems. We look at two cases: chainonchain partitioning, where the order of the processors ... 
A parallel boundary element formulation for tracking multiple particle trajectories in Stoke's flow for microfluidic applications
(Tech Science Press, 2015)A new formulation for tracking multiple particles in slow viscous flow for microfluidic applications is presented. The method employs the manipulation of the boundary element matrices so that finally a system of equations ... 
Parallel image restoration using surrogate constraint methods
(Academic Press, 2007)When formulated as a system of linear inequalities, the image restoration problem yields huge, unstructured, sparse matrices even for images of small size. To solve the image restoration problem, we use the surrogate ... 
Parallelization of Sparse Matrix Kernels for big data applications
(Springer, 2016)Analysis of big data on largescale distributed systems often necessitates efficient parallel graph algorithms that are used to explore the relationships between individual components. Graph algorithms use the basic adjacency ... 
Parpatoh : A 2Dparallel hypergraph partitioning tool
(Bilkent University, 2006)Hypergraph partitioning is a process that is being used to find solutions for optimization problems in various areas, including parallel volume rendering, parallel information retrieval and VLSI circuit design. While the ... 
Partitioning sparse matrices for parallel preconditioned iterative methods
(Society for Industrial and Applied Mathematics, 2007)This paper addresses the parallelization of the preconditioned iterative methods that use explicit preconditioners such as approximate inverses. Parallelizing a full step of these methods requires the coefficient and ... 
Progressive refinement radiosity on ringconnected multicomputers
(ACM, 199310)The progressive refinement method is investigated for parallelization on ringconnected multicomputers. A synchronous scheme, based on static task assignment, is proposed, in order to achieve better coherence during the ... 
A recursive graph bipartitioning algorithm by vertex separators with fixed vertices for permuting sparse matrices into block diagonal form with overlap
(Bilkent University, 2011)Solving sparse system of linear equations Ax=b using preconditioners can be effi ciently parallelized using graph partitioning tools. In this thesis, we investigate the problem of permuting a sparse matrix into a block ... 
Reducing communication overhead in sparse matrix and tensor computations
(Bilkent University, 202008)Encapsulating multiple communication cost metrics, i.e., bandwidth and latency, is proven to be important in reducing communication overhead in the parallelization of sparse and irregular applications. Communication ... 
Reducing communication volume overhead in largescale parallel SpGEMM
(Bilkent University, 201612)Sparse matrixmatrix multiplication of the form of C = A x B, C = A x A and C = A x AT is a key operation in various domains and is characterized with high complexity and runtime overhead. There exist models for parallelizing ... 
Revisiting hypergraph models for sparse matrix partitioning
(Society for Industrial and Applied Mathematics, 2007)We provide an exposition of hypergraph models for parallelizing sparse matrixvector multiplies. Our aim is to emphasize the expressive power of hypergraph models. First, we set forth an elementary hypergraph model for the ...