Differential privacy under dependent tuples—the case of genomic privacy

buir.contributor.authorAlmadhoun, Nour
buir.contributor.authorAyday, Erman
buir.contributor.authorUlusoy, Özgür
dc.citation.epage1703en_US
dc.citation.issueNumber6en_US
dc.citation.spage1696en_US
dc.citation.volumeNumber36en_US
dc.contributor.authorAlmadhoun, Nouren_US
dc.contributor.authorAyday, Ermanen_US
dc.contributor.authorUlusoy, Özgüren_US
dc.date.accessioned2021-02-26T06:09:41Z
dc.date.available2021-02-26T06:09:41Z
dc.date.issued2020-03
dc.departmentDepartment of Computer Engineeringen_US
dc.description.abstractMotivation: The rapid progress in genome sequencing has led to high availability of genomic data. Studying these data can greatly help answer the key questions about disease associations and our evolution. However, due to growing privacy concerns about the sensitive information of participants, accessing key results and data of genomic studies (such as genome-wide association studies) is restricted to only trusted individuals. On the other hand, paving the way to biomedical breakthroughs and discoveries requires granting open access to genomic datasets. Privacy-preserving mechanisms can be a solution for granting wider access to such data while protecting their owners. In particular, there has been growing interest in applying the concept of differential privacy (DP) while sharing summary statistics about genomic data. DP provides a mathematically rigorous approach to prevent the risk of membership inference while sharing statistical information about a dataset. However, DP does not consider the dependence between tuples in the dataset, which may degrade the privacy guarantees offered by the DP. Results: In this work, focusing on genomic datasets, we show this drawback of the DP and we propose techniques to mitigate it. First, using a real-world genomic dataset, we demonstrate the feasibility of an inference attack on differentially private query results by utilizing the correlations between the entries in the dataset. The results show the scale of vulnerability when we have dependent tuples in the dataset. We show that the adversary can infer sensitive genomic data about a user from the differentially private results of a query by exploiting the correlations between the genomes of family members. Second, we propose a mechanism for privacy-preserving sharing of statistics from genomic datasets to attain privacy guarantees while taking into consideration the dependence between tuples. By evaluating our mechanism on different genomic datasets, we empirically demonstrate that our proposed mechanism can achieve up to 50% better privacy than traditional DP-based solutions. Availability and implementation: https://github.com/nourmadhoun/Differential-privacy-genomic-inference-attack.en_US
dc.description.provenanceSubmitted by Evrim Ergin (eergin@bilkent.edu.tr) on 2021-02-26T06:09:41Z No. of bitstreams: 1 Differential_privacy_under_dependent_tuples—the_case_of_genomic_privacy.pdf: 419359 bytes, checksum: fb375905cda37665ab5821567aeb4bf1 (MD5)en
dc.description.provenanceMade available in DSpace on 2021-02-26T06:09:41Z (GMT). No. of bitstreams: 1 Differential_privacy_under_dependent_tuples—the_case_of_genomic_privacy.pdf: 419359 bytes, checksum: fb375905cda37665ab5821567aeb4bf1 (MD5) Previous issue date: 2020-03en
dc.identifier.doi10.1093/bioinformatics/btz837en_US
dc.identifier.issn1367-4803
dc.identifier.urihttp://hdl.handle.net/11693/75604
dc.language.isoEnglishen_US
dc.publisherOxford University Pressen_US
dc.relation.isversionofhttps://dx.doi.org/10.1093/bioinformatics/btz837en_US
dc.source.titleBioinformaticsen_US
dc.subjectGenome analysisen_US
dc.titleDifferential privacy under dependent tuples—the case of genomic privacyen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Differential_privacy_under_dependent_tuples—the_case_of_genomic_privacy.pdf
Size:
409.53 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: