120 Scaling Metagenomic Search with Sourmash - Conversations with Titus Brown
Update: 2024-01-18
Description
In this final episode with Titus Brown, the conversation focuses on his work scaling metagenomic search with Sourmash:
An overview of what Sourmash does - sketching and comparing large k-mer datasets
How the sampling approach enables analyses like containment estimation
Exciting capabilities of the Branchwater tool for multi-threaded real-time SRA search
Scaling to search across millions of metagenomes in seconds with WebAssembly
Potential public health applications for tracking and sourcing pathogens
Important caveats around resolution limits and need for follow-up analyses
Ongoing work to characterize the technique's specificity and sensitivity
Overall, this episode highlights the massive scaling Sourmash enables for metagenomic search, and the potential use cases in public health, while acknowledging current limitations and uncertainties. Titus emphasizes the need to precisely convey what bioinformatic tools can and cannot do as research continues.
Papers:
Spacegraphcats - https://genomebiology.biomedcentral.com/articles/10.1186/s13059-020-02066-4
Sourmash - https://www.biorxiv.org/content/10.1101/2022.01.11.475838v2
IBD exploration - https://dib-lab.github.io/2021-paper-ibd/
An overview of what Sourmash does - sketching and comparing large k-mer datasets
How the sampling approach enables analyses like containment estimation
Exciting capabilities of the Branchwater tool for multi-threaded real-time SRA search
Scaling to search across millions of metagenomes in seconds with WebAssembly
Potential public health applications for tracking and sourcing pathogens
Important caveats around resolution limits and need for follow-up analyses
Ongoing work to characterize the technique's specificity and sensitivity
Overall, this episode highlights the massive scaling Sourmash enables for metagenomic search, and the potential use cases in public health, while acknowledging current limitations and uncertainties. Titus emphasizes the need to precisely convey what bioinformatic tools can and cannot do as research continues.
Papers:
Spacegraphcats - https://genomebiology.biomedcentral.com/articles/10.1186/s13059-020-02066-4
Sourmash - https://www.biorxiv.org/content/10.1101/2022.01.11.475838v2
IBD exploration - https://dib-lab.github.io/2021-paper-ibd/
Comments
Top Podcasts
The Best New Comedy Podcast Right Now – June 2024The Best News Podcast Right Now – June 2024The Best New Business Podcast Right Now – June 2024The Best New Sports Podcast Right Now – June 2024The Best New True Crime Podcast Right Now – June 2024The Best New Joe Rogan Experience Podcast Right Now – June 20The Best New Dan Bongino Show Podcast Right Now – June 20The Best New Mark Levin Podcast – June 2024
In Channel