Show simple item record

dc.contributor.authorHill, A.C.
dc.contributor.authorGuo, C.
dc.contributor.authorLitkowski, E.M.
dc.contributor.authorManichaikul, A.W.
dc.contributor.authorYu, B.
dc.contributor.authorKonigsberg, I.R.
dc.contributor.authorGorbet, B.A.
dc.contributor.authorLange, L.A.
dc.contributor.authorPratte, K.A.
dc.contributor.authorKechris, K.J.
dc.contributor.authorDeCamp, M.
dc.contributor.authorCoors, M.
dc.contributor.authorOrtega, V.E.
dc.contributor.authorRich, S.S.
dc.contributor.authorRotter, J.I.
dc.contributor.authorGerzsten, R.E.
dc.contributor.authorClish, C.B.
dc.contributor.authorCurtis, J.L.
dc.contributor.authorHu, X.
dc.contributor.authorObeidat, M.-E.
dc.contributor.authorMorris, M.
dc.contributor.authorLoureiro, J.
dc.contributor.authorNgo, D.
dc.contributor.authorO’Neal, W.K.
dc.contributor.authorMeyers, D.A.
dc.contributor.authorBleecker, E.R.
dc.contributor.authorHobbs, B.D.
dc.contributor.authorCho, M.H.
dc.contributor.authorBanaei-Kashani, F.
dc.contributor.authorBowler, R.P.
dc.date.accessioned2024-08-06T03:49:52Z
dc.date.available2024-08-06T03:49:52Z
dc.date.issued2023-06-07
dc.identifier.citationHill, A.C., Guo, C., Litkowski, E.M. et al. Large scale proteomic studies create novel privacy considerations. Sci Rep 13, 9254 (2023). https://doi.org/10.1038/s41598-023-34866-6
dc.identifier.issn2045-2322
dc.identifier.pmid37286633
dc.identifier.doi10.1038/s41598-023-34866-6
dc.identifier.urihttp://hdl.handle.net/10150/673841
dc.description.abstractPrivacy protection is a core principle of genomic but not proteomic research. We identified independent single nucleotide polymorphism (SNP) quantitative trait loci (pQTL) from COPDGene and Jackson Heart Study (JHS), calculated continuous protein level genotype probabilities, and then applied a naïve Bayesian approach to link SomaScan 1.3K proteomes to genomes for 2812 independent subjects from COPDGene, JHS, SubPopulations and InteRmediate Outcome Measures In COPD Study (SPIROMICS) and Multi-Ethnic Study of Atherosclerosis (MESA). We correctly linked 90–95% of proteomes to their correct genome and for 95–99% we identify the 1% most likely links. The linking accuracy in subjects with African ancestry was lower (~ 60%) unless training included diverse subjects. With larger profiling (SomaScan 5K) in the Atherosclerosis Risk Communities (ARIC) correct identification was > 99% even in mixed ancestry populations. We also linked proteomes-to-proteomes and used the proteome only to determine features such as sex, ancestry, and first-degree relatives. When serial proteomes are available, the linking algorithm can be used to identify and correct mislabeled samples. This work also demonstrates the importance of including diverse populations in omics research and that large proteomic datasets (> 1000 proteins) can be accurately linked to a specific genome through pQTL knowledge and should not be considered unidentifiable. © 2023, The Author(s).
dc.language.isoen
dc.publisherNature Research
dc.rights© The Author(s) 2023. This article is licensed under a Creative Commons Attribution 4.0 International License.
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titleLarge scale proteomic studies create novel privacy considerations
dc.typeArticle
dc.typetext
dc.contributor.departmentUniversity of Arizona
dc.identifier.journalScientific Reports
dc.description.noteOpen access journal
dc.description.collectioninformationThis item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.
dc.eprint.versionFinal Published Version
dc.source.journaltitleScientific Reports
refterms.dateFOA2024-08-06T03:49:52Z


Files in this item

Thumbnail
Name:
s41598-023-34866-6.pdf
Size:
2.333Mb
Format:
PDF
Description:
Final Published Version

This item appears in the following Collection(s)

Show simple item record

© The Author(s) 2023. This article is licensed under a Creative Commons Attribution 4.0 International License.
Except where otherwise noted, this item's license is described as © The Author(s) 2023. This article is licensed under a Creative Commons Attribution 4.0 International License.