BATCAVE: calling somatic mutations with a tumor- And site-specific prior
Affiliation
Mel and Enid Zuckerman College of Public Health, University of ArizonaDepartment of Molecular and Cellular Biology, University of Arizona
Issue Date
2020
Metadata
Show full item recordPublisher
Oxford University PressCitation
Mannakee, B. K., & Gutenkunst, R. N. (2020). BATCAVE: calling somatic mutations with a tumor- And site-specific prior. NAR Genomics and Bioinformatics.Journal
NAR Genomics and BioinformaticsRights
Copyright © The Author(s) 2020. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License.Collection Information
This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.Abstract
Detecting somatic mutations withins tumors is key to understanding treatment resistance, patient prognosis and tumor evolution. Mutations at low allelic frequency, those present in only a small portion of tumor cells, are particularly difficult to detect. Many algorithms have been developed to detect such mutations, but none models a key aspect of tumor biology. Namely, every tumor has its own profile of mutation types that it tends to generate. We present BATCAVE (Bayesian Analysis Tools for Context-Aware Variant Evaluation), an algorithm that first learns the individual tumor mutational profile and mutation rate then uses them in a prior for evaluating potential mutations. We also present an R implementation of the algorithm, built on the popular caller MuTect. Using simulations, we show that adding the BATCAVE algorithm to MuTect improves variant detection. It also improves the calibration of posterior probabilities, enabling more principled tradeoff between precision and recall. We also show that BATCAVE performs well on real data. Our implementation is computationally inexpensive and straightforward to incorporate into existing MuTect pipelines. More broadly, the algorithm can be added to other variant callers, and it can be extended to include additional biological features that affect mutation generation. © The Author(s) 2020. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.Note
Open access journalISSN
2631-9268Version
Final published versionae974a485f413a2113503eed53cd6c53
10.1093/nargab/lqaa004
Scopus Count
Collections
Except where otherwise noted, this item's license is described as Copyright © The Author(s) 2020. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License.

