GenoGuard: protecting genomic data against brute-force attacks

Source Title
Proceedings - IEEE Symposium on Security and Privacy, 2015
Print ISSN
Electronic ISSN
447 - 462
Conference Paper
Journal Title
Journal ISSN
Volume Title

Secure storage of genomic data is of great and increasing importance. The scientific community's improving ability to interpret individuals' genetic materials and the growing size of genetic database populations have been aggravating the potential consequences of data breaches. The prevalent use of passwords to generate encryption keys thus poses an especially serious problem when applied to genetic data. Weak passwords can jeopardize genetic data in the short term, but given the multi-decade lifespan of genetic data, even the use of strong passwords with conventional encryption can lead to compromise. We present a tool, called Geno Guard, for providing strong protection for genomic data both today and in the long term. Geno Guard incorporates a new theoretical framework for encryption called honey encryption (HE): it can provide information-theoretic confidentiality guarantees for encrypted data. Previously proposed HE schemes, however, can be applied to messages from, unfortunately, a very restricted set of probability distributions. Therefore, Geno Guard addresses the open problem of applying HE techniques to the highly non-uniform probability distributions that characterize sequences of genetic data. In Geno Guard, a potential adversary can attempt exhaustively to guess keys or passwords and decrypt via a brute-force attack. We prove that decryption under any key will yield a plausible genome sequence, and that Geno Guard offers an information-theoretic security guarantee against message-recovery attacks. We also explore attacks that use side information. Finally, we present an efficient and parallelized software implementation of Geno Guard. © 2015 IEEE.

Other identifiers
Book Title
Brute-force attack, Distribution-transforming encoder, Genomic privacy, Honey encryption, Authentication, Cryptography, Digital storage, Food products, Genes, Information theory, Population statistics, Brute-force attack, Distribution-transforming encoder, Genetic materials, Genome sequences, Information- theoretic securities, Scientific community, Software implementation, Theoretical framework, Probability distributions
Published Version (Please cite this version)