Scalable privacy-preserving data sharing methodology for genome-wide association studies
Yu F, Fienberg S, Slaković A, Uhler C. 2014. Scalable privacy-preserving data sharing methodology for genome-wide association studies. Journal of Biomedical Informatics. 50, 133–141.
Download (ext.)
          
        
            
            
            Journal Article
            
            
            
            | Published
            
            
              |              English
              
            
          
        Scopus indexed
Author
        
      Yu, Fei;
      Fienberg, Stephen;
      Slaković, Alexandra;
      Uhler, CarolineISTA 

Department
    Abstract
    The protection of privacy of individual-level information in genome-wide association study (GWAS) databases has been a major concern of researchers following the publication of “an attack” on GWAS data by Homer et al. (2008). Traditional statistical methods for confidentiality and privacy protection of statistical databases do not scale well to deal with GWAS data, especially in terms of guarantees regarding protection from linkage to external information. The more recent concept of differential privacy, introduced by the cryptographic community, is an approach that provides a rigorous definition of privacy with meaningful privacy guarantees in the presence of arbitrary external information, although the guarantees may come at a serious price in terms of data utility. Building on such notions, Uhler et al. (2013) proposed new methods to release aggregate GWAS data without compromising an individual’s privacy. We extend the methods developed in Uhler et al. (2013) for releasing differentially-private χ2χ2-statistics by allowing for arbitrary number of cases and controls, and for releasing differentially-private allelic test statistics. We also provide a new interpretation by assuming the controls’ data are known, which is a realistic assumption because some GWAS use publicly available data as controls. We assess the performance of the proposed methods through a risk-utility analysis on a real data set consisting of DNA samples collected by the Wellcome Trust Case Control Consortium and compare the methods with the differentially-private release mechanism proposed by Johnson and Shmatikov (2013).
    
  Publishing Year
    
  Date Published
    2014-08-01
  Journal Title
    Journal of Biomedical Informatics
  Publisher
    Elsevier
  Acknowledgement
    This research was partially supported by NSF Awards EMSW21-RTG and BCS-0941518 to the Department of Statistics at Carnegie Mellon University, and by NSF Grant BCS-0941553 to the Department of Statistics at Pennsylvania State University. This work was also supported in part by the National Center for Research Resources, Grant UL1 RR033184, and is now at the National Center for Advancing Translational Sciences, Grant UL1 TR000127 to Pennsylvania State University. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NSF and NIH.
  Volume
      50
    Page
      133 - 141
    IST-REx-ID
    
  Cite this
Yu F, Fienberg S, Slaković A, Uhler C. Scalable privacy-preserving data sharing methodology for genome-wide association studies. Journal of Biomedical Informatics. 2014;50:133-141. doi:10.1016/j.jbi.2014.01.008
    Yu, F., Fienberg, S., Slaković, A., & Uhler, C. (2014). Scalable privacy-preserving data sharing methodology for genome-wide association studies. Journal of Biomedical Informatics. Elsevier. https://doi.org/10.1016/j.jbi.2014.01.008
    Yu, Fei, Stephen Fienberg, Alexandra Slaković, and Caroline Uhler. “Scalable Privacy-Preserving Data Sharing Methodology for Genome-Wide Association Studies.” Journal of Biomedical Informatics. Elsevier, 2014. https://doi.org/10.1016/j.jbi.2014.01.008.
    F. Yu, S. Fienberg, A. Slaković, and C. Uhler, “Scalable privacy-preserving data sharing methodology for genome-wide association studies,” Journal of Biomedical Informatics, vol. 50. Elsevier, pp. 133–141, 2014.
    Yu F, Fienberg S, Slaković A, Uhler C. 2014. Scalable privacy-preserving data sharing methodology for genome-wide association studies. Journal of Biomedical Informatics. 50, 133–141.
    Yu, Fei, et al. “Scalable Privacy-Preserving Data Sharing Methodology for Genome-Wide Association Studies.” Journal of Biomedical Informatics, vol. 50, Elsevier, 2014, pp. 133–41, doi:10.1016/j.jbi.2014.01.008.
  
      All files available under the following license(s):
      
      
        
          
        
          
          
      
      
    
  
            Copyright Statement:
          
        
            This Item is protected by copyright and/or related rights. [...]
          
        
      Link(s) to Main File(s)
    
  Access Level
     Open Access
 Open Access
    Export
Marked PublicationsOpen Data ISTA Research Explorer
Web of Science
View record in Web of Science®Sources
 arXiv 1401.5193
arXiv 1401.5193

 Google Scholar
Google Scholar