Huang, Z.Ayday, ErmanFellay, JacquesHubaux, J-P.Juels, A.2016-02-082016-02-082015-05http://hdl.handle.net/11693/28585Conference name: IEEE Symposium on Security and Privacy, 2015Date of Conference: 17-21 May 2015Secure storage of genomic data is of great and increasing importance. The scientific community's improving ability to interpret individuals' genetic materials and the growing size of genetic database populations have been aggravating the potential consequences of data breaches. The prevalent use of passwords to generate encryption keys thus poses an especially serious problem when applied to genetic data. Weak passwords can jeopardize genetic data in the short term, but given the multi-decade lifespan of genetic data, even the use of strong passwords with conventional encryption can lead to compromise. We present a tool, called Geno Guard, for providing strong protection for genomic data both today and in the long term. Geno Guard incorporates a new theoretical framework for encryption called honey encryption (HE): it can provide information-theoretic confidentiality guarantees for encrypted data. Previously proposed HE schemes, however, can be applied to messages from, unfortunately, a very restricted set of probability distributions. Therefore, Geno Guard addresses the open problem of applying HE techniques to the highly non-uniform probability distributions that characterize sequences of genetic data. In Geno Guard, a potential adversary can attempt exhaustively to guess keys or passwords and decrypt via a brute-force attack. We prove that decryption under any key will yield a plausible genome sequence, and that Geno Guard offers an information-theoretic security guarantee against message-recovery attacks. We also explore attacks that use side information. Finally, we present an efficient and parallelized software implementation of Geno Guard. © 2015 IEEE.EnglishBrute-force attackDistribution-transforming encoderGenomic privacyHoney encryptionAuthenticationCryptographyDigital storageFood productsGenesInformation theoryPopulation statisticsBrute-force attackDistribution-transforming encoderGenetic materialsGenome sequencesInformation- theoretic securitiesScientific communitySoftware implementationTheoretical frameworkProbability distributionsGenoGuard: protecting genomic data against brute-force attacksConference Paper10.1109/SP.2015.34