Optimization of signature file parameters for databases with varying record lengths

dc.citation.epage23en_US
dc.citation.issueNumber1en_US
dc.citation.spage11en_US
dc.citation.volumeNumber42en_US
dc.contributor.authorKocberber, S.en_US
dc.contributor.authorCan, F.en_US
dc.contributor.authorPatton, J. M.en_US
dc.date.accessioned2016-02-08T10:42:24Z
dc.date.available2016-02-08T10:42:24Zen_US
dc.date.issued1999en_US
dc.departmentDepartment of Computer Engineeringen_US
dc.description.abstractFor signature files we propose a new false drop estimation method for databases with varying record lengths. Our approach provides more accurate estimation of the number of false drops by considering the lengths of individual records instead of using the average number of terms per record. In signature file processing, accurate estimation of the number of false drops is essential to obtain a more accurate signature file and therefore to obtain a better (query) response time. With a formal proof we show that under certain conditions the number of false drops estimated by considering the average record length is less than or equal to the precise 'expected' estimation which is based on the individual record lengths. The experiments with real data show that the proposed method accurately estimates the number of false drops and the actual response time. Depending on the space overhead, our approach obtains up to 33% and 20% response time improvements for the conventional sequential and new efficient multiframe signature file methods, respectively.en_US
dc.description.provenanceMade available in DSpace on 2016-02-08T10:42:24Z (GMT). No. of bitstreams: 1 bilkent-research-paper.pdf: 70227 bytes, checksum: 26e812c6f5156f83f0e77b261a471b5a (MD5) Previous issue date: 1999en_US
dc.identifier.doi10.1093/comjnl/42.1.11en_US
dc.identifier.eissn1460-2067
dc.identifier.issn0010-4620
dc.identifier.urihttp://hdl.handle.net/11693/25293en_US
dc.language.isoEnglishen_US
dc.publisherOxford University Pressen_US
dc.relation.isversionofhttps://doi.org/10.1093/comjnl/42.1.11en_US
dc.source.titleComputer Journalen_US
dc.subjectData Storage Equipmenten_US
dc.subjectFile Organizationen_US
dc.subjectOptimizationen_US
dc.subjectQuery Languagesen_US
dc.subjectRecord Lengthsen_US
dc.subjectSignature Fileen_US
dc.subjectDatabase Systemsen_US
dc.titleOptimization of signature file parameters for databases with varying record lengthsen_US
dc.typeArticleen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Optimization of signature file parameters for databases with varying record lengths.pdf
Size:
394.74 KB
Format:
Adobe Portable Document Format
Description:
Full printable version