Quikr: Rapid Bacterial Community Reconstruction Via Compressive Sensing
Update: 2014-03-31
Description
Co-authors: Simon Foucart (University of Georgia), Gail Rosen (Drexel University)
Many metagenomic studies compare hundreds to thousands of environmental and health-related samples by extracting and sequencing their DNA. However, one of the first steps - to determine what bacteria are actually in the sample - can be a computationally time-consuming task since most methods rely on computing the classification of each individual read out of tens to hundreds of thousands of reads. We introduce Quikr: a QUadratic, K-mer based, Iterative, Reconstruction method which computes a vector of taxonomic assignments and their proportions in the sample using an optimization technique motivated from the mathematical theory of compressive sensing. On both simulated and actual biological data, we demonstrate that Quikr is typically more accurate as well as typically orders of magnitude faster than the most commonly utilized taxonomic assignment techniques for both whole genome techniques (Metaphyler, Metaphlan) and 16S rRNA techniques (the Ribosomal Database Project's Naive Bayesian Classifier). We also show that in general nonnegative L1 minimization can be reduced to a simple nonnegative least squares problem.
Related Links: http://www.math.uga.edu/~foucart/WGSQuikr_final.pdf - WGSQuikr preprint
http://www.math.uga.edu/~foucart/QuikrRevisionPrePrint.pdf - Quikr preprint
http://www.math.uga.edu/~foucart/SRasNNLS_final.pdf - Sparse recovery by means of nonnegative least squares
Many metagenomic studies compare hundreds to thousands of environmental and health-related samples by extracting and sequencing their DNA. However, one of the first steps - to determine what bacteria are actually in the sample - can be a computationally time-consuming task since most methods rely on computing the classification of each individual read out of tens to hundreds of thousands of reads. We introduce Quikr: a QUadratic, K-mer based, Iterative, Reconstruction method which computes a vector of taxonomic assignments and their proportions in the sample using an optimization technique motivated from the mathematical theory of compressive sensing. On both simulated and actual biological data, we demonstrate that Quikr is typically more accurate as well as typically orders of magnitude faster than the most commonly utilized taxonomic assignment techniques for both whole genome techniques (Metaphyler, Metaphlan) and 16S rRNA techniques (the Ribosomal Database Project's Naive Bayesian Classifier). We also show that in general nonnegative L1 minimization can be reduced to a simple nonnegative least squares problem.
Related Links: http://www.math.uga.edu/~foucart/WGSQuikr_final.pdf - WGSQuikr preprint
http://www.math.uga.edu/~foucart/QuikrRevisionPrePrint.pdf - Quikr preprint
http://www.math.uga.edu/~foucart/SRasNNLS_final.pdf - Sparse recovery by means of nonnegative least squares
Comments
Top Podcasts
The Best New Comedy Podcast Right Now – June 2024The Best News Podcast Right Now – June 2024The Best New Business Podcast Right Now – June 2024The Best New Sports Podcast Right Now – June 2024The Best New True Crime Podcast Right Now – June 2024The Best New Joe Rogan Experience Podcast Right Now – June 20The Best New Dan Bongino Show Podcast Right Now – June 20The Best New Mark Levin Podcast – June 2024
In Channel