Accelerating Genomic Data Generation and Facilitating Genomic Data Access Using Decentralization, Privacy-Preserving Technologies and Equitable Compensation

Dennis Grishin ,
Dennis Grishin
Kamal Obbad ,
Kamal Obbad
Preston Estep ,
Preston Estep
Kevin Quinn ,
Kevin Quinn
Sarah Wait Zaranek ,
Sarah Wait Zaranek
Alexander Wait Zaranek ,
Alexander Wait Zaranek
Ward Vandewege ,
Ward Vandewege
Tom Clegg ,
Tom Clegg
Nico César ,
Nico César
Mirza Cifric ,
Mirza Cifric
George Church
George Church

Published: 23.03.2018.

Biochemistry

Volume 1, Issue 1 (2018)

pp. 1-23;

https://doi.org/10.30953/bhty.v1.34

Abstract

In the years since the first human genome was sequenced at a cost of over $3 billion, technological advancements have driven the price below $1,000, making personal genome sequencing affordable to many people. Personal genome sequencing has the potential to enable better disease prevention, more accurate diagnoses, and personalized therapies. Furthermore, sharing genomic data with researchers promises identification of the causes of many diseases and the development of new therapies. However, sequencing costs, data privacy concerns, regulatory restrictions, and technical challenges impede the growth of genomic data and hinder data sharing. In this article, we propose that these challenges can be addressed by combining decentralized system design, privacy-preserving technologies, and an equitable compensation model in a platform that vests control over data with individual owners; ensures transparency and privacy; facilitates regulatory compliance; minimizes expensive data transfers; and shifts the sequencing costs from consumers, patients, and biobanks to researchers in industry and academia. We exemplify this by describing the implementation of Nebula, a distributed genomic data generation, sharing, and analysis platform.

Keywords

References

1.
Finishing the euchromatic sequence of the human genome. Nature. 2004;431(7011):931–45.
2.
Maurano MT, Humbert R, Rynes E, Thurman RE, Haugen E, Wang H, et al. Systematic Localization of Common Disease-Associated Variation in Regulatory DNA. Science. 2012;337(6099):1190–5.
3.
Lindor NM, Thibodeau SN, Burke W. Whole-Genome Sequencing in Healthy People. Mayo Clinic Proceedings. 2017;92(1):159–72.
4.
Berg JS, Adams M, Nassar N, Bizon C, Lee K, Schmitt CP, et al. An informatics approach to analyzing the incidentalome. Genetics in Medicine. 2013;15(1):36–44.
5.
Yang A, Palmer AA, de Wit H. Genetics of caffeine consumption and responses to caffeine. Psychopharmacology. 2010;211(3):245–57.
6.
Mattar R, Mazo, Carrilho. Lactose intolerance: diagnosis, genetic, and clinical factors. Clinical and Experimental Gastroenterology. :113.
7.
Freeman HJ. Risk factors in familial forms of celiac disease. World Journal of Gastroenterology. 2010;16(15):1828.

Citation

Copyright

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. 

Article metrics

Google scholar: See link

The statements, opinions and data contained in the journal are solely those of the individual authors and contributors and not of the publisher and the editor(s). We stay neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Most read articles