Auto-tuning similarity search algorithms on multi-core architectures

Gedik, B.

Auto-tuning similarity search algorithms on multi-core architectures

dc.citation.epage	620	en_US
dc.citation.issueNumber	5	en_US
dc.citation.spage	595	en_US
dc.citation.volumeNumber	41	en_US
dc.contributor.author	Gedik, B.	en_US
dc.date.accessioned	2016-02-08T09:34:59Z
dc.date.available	2016-02-08T09:34:59Z
dc.date.issued	2013	en_US
dc.department	Department of Computer Engineering	en_US
dc.description.abstract	In recent times, large high-dimensional datasets have become ubiquitous. Video and image repositories, financial, and sensor data are just a few examples of such datasets in practice. Many applications that use such datasets require the retrieval of data items similar to a given query item, or the nearest neighbors (NN or k -NN) of a given item. Another common query is the retrieval of multiple sets of nearest neighbors, i.e., multi k -NN, for different query items on the same data. With commodity multi-core CPUs becoming more and more widespread at lower costs, developing parallel algorithms for these search problems has become increasingly important. While the core nearest neighbor search problem is relatively easy to parallelize, it is challenging to tune it for optimality. This is due to the fact that the various performance-specific algorithmic parameters, or "tuning knobs", are inter-related and also depend on the data and query workloads. In this paper, we present (1) a detailed study of the various tuning knobs and their contributions on increasing the query throughput for parallelized versions of the two most common classes of high-dimensional multi-NN search algorithms: linear scan and tree traversal, and (2) an offline auto-tuner for setting these knobs by iteratively measuring actual query execution times for a given workload and dataset. We show experimentally that our auto-tuner reaches near-optimal performance and significantly outperforms un-tuned versions of parallel multi-NN algorithms for real video repository data on a variety of multi-core platforms. © 2013 Springer Science+Business Media New York.	en_US
dc.identifier.doi	10.1007/s10766-013-0239-8	en_US
dc.identifier.issn	0885-7458	en_US
dc.identifier.uri	http://hdl.handle.net/11693/20774	en_US
dc.language.iso	English	en_US
dc.relation.isversionof	http://dx.doi.org/10.1007/s10766-013-0239-8	en_US
dc.source.title	International Journal of Parallel Programming	en_US
dc.subject	Auto - tuning	en_US
dc.subject	Nearest neighbor search	en_US
dc.subject	Parallelization	en_US
dc.subject	Algorithmic parameters	en_US
dc.subject	Autotuning	en_US
dc.subject	Multi - core platforms	en_US
dc.subject	Multicore architectures	en_US
dc.subject	Near - optimal performance	en_US
dc.subject	Parallelized version	en_US
dc.subject	Iterative methods	en_US
dc.subject	Knobs	en_US
dc.subject	Learning algorithms	en_US
dc.subject	Optimization	en_US
dc.subject	Program processors	en_US
dc.subject	Tuners	en_US
dc.subject	Computer architecture	en_US
dc.title	Auto-tuning similarity search algorithms on multi-core architectures	en_US
dc.type	Article	en_US

Collections

Scholarly Publications - Computer Engineering

Auto-tuning similarity search algorithms on multi-core architectures

Files

Collections