Genome resolved metagenomics

Slides:



Advertisements
Similar presentations
Metabarcoding 16S RNA targeted sequencing
Advertisements

9 Genomics and Beyond Brief Chapter Outline
Bioinformatics for Whole-Genome Shotgun Sequencing of Microbial Communities By Kevin Chen, Lior Pachter PLoS Computational Biology, 2005 David Kelley.
Utilizing Fuzzy Logic for Gene Sequence Construction from Sub Sequences and Characteristic Genome Derivation and Assembly.
The Sorcerer II Global ocean sampling expedition Katrine Lekang Global Ocean Sampling project (GOS) Global Ocean Sampling project (GOS) CAMERA CAMERA METAREP.
Presentation on genome sequencing. Genome: the complete set of gene of an organism Genome annotation: the process by which the genes, control sequences.
Mouse Genome Sequencing
Todd J. Treangen, Steven L. Salzberg
H = -Σp i log 2 p i. SCOPI Each one of the many microbial communities has its own structure and ecosystem, depending on the body environment it exists.
Transcriptome analysis With a reference – Challenging due to size and complexity of datasets – Many tools available, driven by biomedical research – GATK.
Probes can be designed in an evolutionary hierarchy.
Chapter 21 Eukaryotic Genome Sequences
Current Challenges in Metagenomics: an Overview Chandan Pal 17 th December, GoBiG Meeting.
The metagenomics sequencing service CD Genomics. Metagenomics: Metagenomics is the study of metagenomes, genetic material recovered directly from environmental.
Genome annotation and search for homologs. Genome of the week Discuss the diversity and features of selected microbial genomes. Link to the paper describing.
454 Genome Sequence Assembly and Analysis HC70AL S Brandon Le & Min Chen.
OPERA highthroughput paired-end sequences Reconstructing optimal genomic scaffolds with.
Genome sequencing and annotation Week 2 reading assignment - pages 63-78, 93-98, Boxes 2.1 and don’t worry about details of similarity scoring.
Quantitative Phylogenetic Assessment of Microbial Communities in Diverse Environments Xinjun Zhang.
Metagenomic Species Diversity.
Lesson: Sequence processing
SNP Detection Congtam Pham 2/24/04 Dr. Marth’s Class.
Human Genome Project.
LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?” Try to be specific.
Dr. Christoph W. Sensen und Dr. Jung Soh Trieste Course 2017
Introduction to Parsing (adapted from CS 164 at Berkeley)
Seminar on :- Constructing Contigs Sequencing
Research in Computational Molecular Biology , Vol (2008)
Sequence comparison: Local alignment
Toward Next Generation Biodiversity Research
LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?” Try to be specific.
 The human genome contains approximately genes.  At any given moment, each of our cells has some combination of these genes turned on & others.
LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?” Try to be specific.
LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?” Try to be specific.
LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?” Try to be specific.
LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?” Try to be specific.
LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?” Try to be specific.
7 x 2 5 LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?” Try.
For example, what would the value of this numerical expression be?
LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?” Try to be specific.
Gene Transfer, Genetic Engineering, and Genomics
LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?” Try to be specific.
LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?” Try to be specific.
LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?” Try to be specific.
LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?” Try to be specific.
H = -Σpi log2 pi.
LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?” Try to be specific.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?” Try to be specific.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
For example, if you have 3 4 of a giant candy bar and decide to eat 1 6 of it, how much of the candy bar will you be eating? LearnZillion Notes: --This.
LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?” Try to be specific.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?” Try to be specific.
The Future of Genetic Research
LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?” Try to be specific.
0.7 = ? ? LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?”
0.73 = ? ? LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?”
CSCI 1810 Computational Molecular Biology 2018
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
LearnZillion Notes: --This is your hook. Start with a question to draw the student in. We want that student saying, “huh, how do you do X?” Try to be specific.
The power of metagenomic read recruitment
For example: How do you show an increase of 12% over the original cost, if the original cost is $x? LearnZillion Notes: --This is your hook. Start with.
I wonder…2 _ 1 2 ? LearnZillion Notes:
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Feeling Worried – your experience
Toward Accurate and Quantitative Comparative Metagenomics
General overview of the bioinformatic pipelines for the 16S rRNA gene microbial profiling and shotgun metagenomics. General overview of the bioinformatic.
9-4 Short Tandem Repeats.
Curriculum Embedded Reading Assessment
Presentation transcript:

Genome resolved metagenomics http://merenlab.org/momics

Let’s say you have an environment filled with microbial cells.

Shotgun sequencing allows you to sequence the entire DNA content of this environment

In the form of ‘short sequencing reads’. Q: Why are they short?

We call these reads often metagenomics short reads. This type of data can be used in multiple ways to understand the microbial make up of an environment at the DNA level. Q: What can we do with these reads? We can align them to databases to estimate functions and taxonomy, and crate summary matrices to compare different environments, for instance. Q: Can you list some of the caveats of doing that? Alternatively we can use metagenomic reads to estimate the ‘occurrence’ of a genome in an environment. Which is an extremely powerful approach to probe naturally occurring microbes.

Let’s say we have a reference genome.

As we discussed before, we can use metagenomic read recruitment to understand whether there is a population that occurs in this environment (‘metagenomic read recruitment’ is also called ‘mapping’ pretty frequently, but I like read recruitment better because it associates this operation with an ‘intention’: recruiting reads with a reference sequence to probe its occurrence and abundance in a given metagenome).

One could use more genomes, indeed. Let’s say we used these two.

What is left behind are the reads that didn’t map either of the genomes we have used. Q: There are still purple and blue reads in there, why? Even if there were hundreds of genomes in our genomic database, there would still be some that would remain. Q: What could be an alternative strategy to characterize an environment with a larger reference context?

We could generate those reference context ourselves! Metagenomic assembly is a computational strategy to generate longer sequences by stitching partially overlapping short reads together.

We call these longer sequences ‘contigs’ (as in ‘contiguous DNA’) We call these longer sequences ‘contigs’ (as in ‘contiguous DNA’). Just like the short reads, contigs also can be aligned to databases for to study functions of genes they describe. For that purpose contigs would work better than short reads. Q: Do you agree? Why?

But contigs can also be used for something more: the holy grail of microbiology. Reconstructing genomes of microorganisms we have not yet cultivated directly from the environmental samples. We will talk about the details of this step next week.

But essentially, when done right, metagenomic binning would give us genomes, which we call ‘metagenome-assembled genomes’. Q: Why are there multiple ‘contigs’ for a given genome rather than a single one? Q: Can we estimate the abundance of these MAGs? How?

The same strategy of mapping will also work with these sequences.

In this case, we have better characterized the environment compared to the previous approach. Q: Why are there still blue reads? Q: What’s up with yellow? Which stage of the analysis may have failed? Q: What else can we do to get the genome of yellow?