Probes can be designed in an evolutionary hierarchy.

Slides:



Advertisements
Similar presentations
Bassem M. Shoucri Bio Sci D145 January 8, Why do we want to map genomes? Clinical applications! – Identify genes causing diseases – Treatment? Compare.
Advertisements

Bioinformatics for Whole-Genome Shotgun Sequencing of Microbial Communities By Kevin Chen, Lior Pachter PLoS Computational Biology, 2005 David Kelley.
Community structure and metabolism through reconstruction of microbial genomes from the environment articles Venter et al.
Mining SNPs from EST Databases Picoult-Newberg et al. (1999)
Central Dogma Information storage in biological molecules DNA RNA Protein transcription translation replication.
Application of Molecular Biotechnologies to Remediation
Human Genome Project. Basic Strategy How to determine the sequence of the roughly 3 billion base pairs of the human genome. Started in Various side.
Molecular Microbial Ecology Lecture 1 Professor Ralph Kirby Faculty of Life Sciences Extension 5511 Room B322.
The Sorcerer II Global ocean sampling expedition Katrine Lekang Global Ocean Sampling project (GOS) Global Ocean Sampling project (GOS) CAMERA CAMERA METAREP.
Lecture 19, Chapter 11 Analysis of transgenic plants part II Neal Stewart.
Analysis of Transgenic Plants. 1.Regeneration on Selective Medium Selectable Marker Gene.
BioInformatics (2). Physical Mapping - I Low resolution  Megabase-scale High resolution  Kilobase-scale or better Methods for low resolution mapping.
Environmental Genome Shotgun Sequencing of the Sargasso Sea
20.1 – 1 Look at the illustration of “Cloning a Human Gene in a Bacterial Plasmid” (Figure 20.4 in the orange book). If the medium used for plating cells.
Analysis of Microbial Community Structure
20.1 – 1 Look at the illustration of “Cloning a Human Gene in a Bacterial Plasmid” (Figure 20.4 in the orange book). If the medium used for plating cells.
Molecular Microbial Ecology
Advancing Science with DNA Sequence Natalia Ivanova MGM Workshop September 12, 2012 Metagenome analysis: use case.
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
CO 10.
How do you identify and clone a gene of interest? Shotgun approach? Is there a better way?
Cottrell, M. T., L. A. Waldner, L. Yu, and D. L. Kirchman Bacterial diversity of metagenomic and PCR libraries from the Delaware River. Environmental.
The Sargasso Sea “Metagenome”
Library screening Heterologous and homologous gene probes Differential screening Expression library screening.
Accurate estimation of microbial communities using 16S tags Julien Tremblay, PhD
Work by Antonio Izzo Based on 36 soil cores from a total of 9 plots contained within a 2.5 hectare region.
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
Bradeen Lab – University of Minnesota Lab Goal: Development of “allelic mining” techniques and strategies for R genes, enabling multi-genotype isolation.
Analysis of Complex Proteomic Datasets Using Scaffold Free Scaffold Viewer can be downloaded at:
Microbial genomics Genomics: study of entire genomes Logical next step after genetics: study of genes Genomics: 1) “Structural genomics” * Determine and.
Stratton Nature 45: 719, 2009 Evolution of DNA sequencing technologies to present day DNA SEQUENCING & ASSEMBLY.
Big Picture Of ≈1.7 million species classified so far, roughly 6000 are microbes True number of microbes is obviously larger than 6000 “Imagine if our.
Advancing Science with DNA Sequence Natalia Ivanova MGM Workshop September 29, 2011 Metagenome analysis: use case.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
Human Genome.
Neanderthals Noonan, et al. Sequencing and Analysis of Neanderthal Genomic DNA Green, et al. Analysis of one million base pairs of Neanderthal DNA Kristine.
Microarray (Gene Expression) DNA microarrays is a technology that can be used to measure changes in expression levels or to detect SNiPs Microarrays differ.
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
Accurate estimation of microbial communities using 16S tags
How many genes are there?
Comparison of Synechococcus and Prochlorococcus photosynthetic pigments and cell size Characteristic Prochlorococcus Synechococcus Primary photosynthetic.
Environmental Genome Shotgun Sequencing of the Sargasso Sea Venter et. al (2004) Presented by Ken Vittayarukskul Steven S. White.
Radiation hybrid map of the zebrafish genome
Tools for microbial community analysis. What I am not going to talk  Culture dependent analysis  Isolate all possible colonies  Infer community  Test.
Virginia Commonwealth University
Rob Edwards San Diego State University
Metagenomic Species Diversity.
Metagenomics: From Bench to Data Analysis 19-23rd September S rRNA-based surveys for Community Analysis: How Quantitative are they? Dr.
Ch 15 DNA Technology/ Genetic Engineering
SNP Detection Congtam Pham 2/24/04 Dr. Marth’s Class.
Human Genome Project.
Microbial Genomics Workshop Elizabeth Dinsdale Rio October
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Genomic Data Manipulation Thinking about data visually
Very important to know the difference between the trees!
Environmental Genome Shotgun Sequencing of the Sargasso Sea
Workshop on the analysis of microbial sequence data using ARB
Try not to leave any behind – I dare you!
Life on Earth Thought to be ~3.8 billion years old
AMPLIFYING AND ANALYZING DNA.
DNA Sequencing The DNA from the genome is chopped into bits- whole chromosomes are too large to deal with, so the DNA is broken into manageably-sized overlapping.
Today… Review a few items from last class
Discovery tools for human genetic variations
From Mendel to Genomics
Introduction to Sequencing
A Comprehensive Analysis of Recently Integrated Human Ta L1 Elements
Genome resolved metagenomics
RealTime-PCR.
Toward Accurate and Quantitative Comparative Metagenomics
Presentation transcript:

Probes can be designed in an evolutionary hierarchy

Probes can be designed to be highly redundant to increase the certainly of identification

The match between clone counts and hybridization intensity

Genomics terminology Shotgun cloning: undirected cloning effort where the entire sample is cloned and sequenced Contig - assembled continuous sequence derived from sequence reads from a single clone Scaffold: assembled sequence reads derived from multiple overlapping clones nX coverage: mean number of times a region was sequenced from independent clones Mini-scaffold: scaffold assembled only by paired ends of overlapping contigs (approx. 1X coverage)

“Environmental Genomics” Ed Delong Shotgun cloning of Megabase fragments from marine environments Probe for those with rRNA gene of interest Sequence and use bioinformatics to infer function Used to connect diverse psbA (photosystem II) genes to known 16S sequence groups

Shotgun cloning and assembly of enviromental genomes Tyson et al 2004, Nature 428:37-43 Venter et al 2004, Science 304:66-74.

Tyson et al Iron Mt. study Pink biofilm growing at pH 0.87, which was known to be composed of 6 rRNA types 103,462 sequence reads from shotgun clones provided 10X coverage for two species (Leptospirillum III and Ferroplasma II), and 3X coverage for Leptospirillum II. Very low polymorphism in Leptospirillum III interpreted as evidence for a single strain Higher polymorphism (2.2%) in Ferroplasma II interpreted as evidence for 3 strains which show evidence of past recombination A single nitrogen fixer was found (Leptospirillum III

An aside about rRNA 16S rRNA sequences of Fer I isolate and assembled Fer II strains differ by less than 1%. The assembled genomes of Fer I and FerII differ by more than 22% even though gene order and content appear to be conserved.

Tyson et al. FISH image of biofilm Yellow (red + green) Leptospirillum Green (Eubacterial) Blue (Archaea) predominantly Ferroplasma

Some numbers from the Venter et al. study From 200L of filtered sea water, 1.66 million sequence reads were derived. 246 mbp were assembled into 64,398 scaffolds ranging from 826 bp to 2.6 Mbp 170 mbp of miniscaffolds and unpaired reads 1.2 million protein-coding genes (10X more than previously in protein database) 69,901 conserved open reading frames with no assignable function 60,000 16S sequences, 148 of which are at least 3% different from previously known sequence

Summary of genes found in the Sargasso sea survey

Assembly problems Most abundant genomes are over- represented Assembled genomes are composites of different individuals (particularly genomes with lower coverage)

Estimates of species diversity At least 300 species/sample assuming homologous sequences that are greater than >6% are from different species Using models based on a poisson distribution and 3 different coverage models, estimates of species for the whole study range from 1800 to 47,000. A minimum of 12X greater sequence effort would be needed to sample 95% of the unique sequence

Population level findings Scaffolds with 14X coverage contain about 1 SNP/10,000 bases, and also contain inserted phage sequences SAR 11- like (a previously characterize 16S type) sequences are abundant but are very polymorphic

Venter et al evidence the the composite genome represents a population

Other interesting findings Species distributions are patchy even in the ocean (e.g. Burkholderia and Sewanella abunance in sample 1 but not 2) Clear copy bias in rRNA gene sequences in favor of beta and gamma proteobacteria, which typically have two or more gene copies Huge diversity of rhodopsin sequences, possible non- chlorophyll light harvesting?

Estimates of abundance of major groups based on different gene families

Rhodopsin tree showing the novelty of the Sargasso sea samples (SAR)

Advantages and disadvantages of environmental genomics Avoids PCR and all the inherent biases Not dependent on rRNA Lots of new genes and new information about who has them Currently way too expensive for mere mortals Not very efficient if structure and activity are the main questions

Hybridization to rRNA for identification and quantification Extract total RNA from sample (no amplification or cloning!) Spot it on a filter Probe it with oligonucliotide probes

From MacKay et al. 2002

Landeweert et al 2003 Wanted to measure competition between two mycorrhizal fungi: Suillus & Paxillus Setup pot inoculations with pine trees and either fungus or both fungi together Measured: total mycelium, PLFAs Amplified ITS region with basidiomycete specific primers Compared via DGGE, clone counts, real-time PCR quantification

Mycelium (white) of Suillus in co- culture with pine

DGGE gel of amplified basidiomycete ITS from soil

Conclusions DGGE, Clone counts, and real-time quantification agree that Suillus ITS increases as Paxillus decreases What have we gained by real-time? Quantification of the template, rather than the amplicons