Efficient discovery of join plans in schemaless data

Acar, Aybar C.; Motro, A.

Efficient discovery of join plans in schemaless data

dc.citation.epage	11	en_US
dc.citation.spage	1	en_US
dc.contributor.author	Acar, Aybar C.	en_US
dc.contributor.author	Motro, A.	en_US
dc.coverage.spatial	Cetraro - Calabria, Italy
dc.date.accessioned	2016-02-08T12:27:22Z
dc.date.available	2016-02-08T12:27:22Z
dc.date.issued	2009-09	en_US
dc.department	Department of Computer Engineering	en_US
dc.description	Date of Conference: 16-18 September, 2009
dc.description	Conference name: IDEAS '09 Proceedings of the 2009 International Database Engineering & Applications Symposium
dc.description.abstract	We describe a method of inferring join plans for a set of relation instances, in the absence of any metadata, such as attribute domains, attribute names, or constraints (e.g., keys or foreign keys). Our method enumerates the possible join plans in order of likelihood, based on the compatibility of a pair of columns and their suitability as join attributes (i.e. their appropriateness as keys). We outline two variants of the approach. The first variant is accurate but potentially time-consuming, especially for large relations that do not fit in memory. The second variant is an approximation of the former and hence less accurate, but is considerably more efficient, allowing the method to be used online, even for large relations. We provide experimental results showing how both forms scale in terms of performance as the number of candidate join attributes and the size of the relations increase. We also characterize the accuracy of the approximate variant with respect to the exact variant. Copyright ©2009 ACM.	en_US
dc.identifier.doi	10.1145/1620432.1620434	en_US
dc.identifier.uri	http://hdl.handle.net/11693/28695	en_US
dc.language.iso	English	en_US
dc.publisher	ACM	en_US
dc.relation.isversionof	http://dx.doi.org/10.1145/1620432.1620434	en_US
dc.source.title	IDEAS '09 Proceedings of the 2009 International Database Engineering & Applications Symposium	en_US
dc.subject	Dependency inference	en_US
dc.subject	Join inference	en_US
dc.subject	Schema matching	en_US
dc.subject	Foreign keys	en_US
dc.subject	Metadata	en_US
dc.subject	Keys (for locks)	en_US
dc.title	Efficient discovery of join plans in schemaless data	en_US
dc.type	Conference Paper	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Efficient discovery of join plans in schemaless data.pdf
Size:: 830.26 KB
Format:: Adobe Portable Document Format
Description:: Full printable version

Download

Collections

Scholarly Publications - Computer Engineering