Environmental Genome Shotgun Sequencing of the Sargasso Sea

Slides:



Advertisements
Similar presentations
Cyber Metagenomics; Challenge to See The Unseen Majority in The Ocean
Advertisements

DNA Technology & Gene Mapping Biotechnology has led to many advances in science and medicine including the creation of DNA clones via recombinant clones,
Tucson High School Biotechnology Course Spring 2010.
SEQUENCING-related topics 1. chain-termination sequencing 2. the polymerase chain reaction (PCR) 3. cycle sequencing 4. large scale sequencing stefanie.hartmann.
Bioinformatics for Whole-Genome Shotgun Sequencing of Microbial Communities By Kevin Chen, Lior Pachter PLoS Computational Biology, 2005 David Kelley.
DNA Technology. Biotechnology The use or alteration of cells or biological molecules for specific applications Transgenics Transgenic “changed genes”
CHAPTER 15 Microbial Genomics Genomic Cloning Techniques Vectors for Genomic Cloning and Sequencing MS2, RNA virus nt sequenced in 1976 X17, ssDNA.
Central Dogma Information storage in biological molecules DNA RNA Protein transcription translation replication.
Goals of the Human Genome Project determine the entire sequence of human DNA identify all the genes in human DNA store this information in databases improve.
Molecular Microbial Ecology Lecture 1 Professor Ralph Kirby Faculty of Life Sciences Extension 5511 Room B322.
The Sorcerer II Global ocean sampling expedition Katrine Lekang Global Ocean Sampling project (GOS) Global Ocean Sampling project (GOS) CAMERA CAMERA METAREP.
Genome sequencing. Vocabulary Bac: Bacterial Artificial Chromosome: cloning vector for yeast Pac, cosmid, fosmid, plasmid: cloning vectors for E. coli.
Manipulating the Genome: DNA Cloning and Analysis 20.1 – 20.3 Lesson 4.8.
Human Genome Project Seminal achievement. Scientific milestone. Scientific implications. Social implications.
Zachary Bendiks. Jonathan Eisen  UC Davis Genome Center  Lab focus: “Our work focuses on genomic basis for the origin of novelty in microorganisms (how.
Reading the Blueprint of Life
Environmental Genome Shotgun Sequencing of the Sargasso Sea
DNA Technology and Genomics
20.1 – 1 Look at the illustration of “Cloning a Human Gene in a Bacterial Plasmid” (Figure 20.4 in the orange book). If the medium used for plating cells.
DNA Technology- Cloning, Libraries, and PCR 17 November, 2003 Text Chapter 20.
From Haystacks to Needles AP Biology Fall Isolating Genes  Gene library: a collection of bacteria that house different cloned DNA fragments, one.
AP Biology Ch. 20 Biotechnology.
20.1 – 1 Look at the illustration of “Cloning a Human Gene in a Bacterial Plasmid” (Figure 20.4 in the orange book). If the medium used for plating cells.
Molecular Microbial Ecology
Genomic walking (1) To start, you need: -the DNA sequence of a small region of the chromosome -An adaptor: a small piece of DNA, nucleotides long.
How do you identify and clone a gene of interest? Shotgun approach? Is there a better way?
Probes can be designed in an evolutionary hierarchy.
Today: Biotechnology. Over 600 recent transposon insertions were identified by examining DNA from 36 genetically diverse humans. Tbl 1 Which transposable.
The Sargasso Sea “Metagenome”
Microbial genomics Genomics: study of entire genomes Logical next step after genetics: study of genes Genomics: 1) “Structural genomics” * Determine and.
Big Picture Of ≈1.7 million species classified so far, roughly 6000 are microbes True number of microbes is obviously larger than 6000 “Imagine if our.
15.2, slides with notes to write down
GENE SEQUENCING. INTRODUCTION CELL The cells contain the nucleus. The chromosomes are present within the nucleus.
Human Genomics. Writing in RED indicates the SQA outcomes. Writing in BLACK explains these outcomes in depth.
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
15 March 2016 Today’s Title: CW: Introduction to genetic engineering Learning Question: what is genetic engineering?
Environmental Genome Shotgun Sequencing of the Sargasso Sea Venter et. al (2004) Presented by Ken Vittayarukskul Steven S. White.
Title: Studying whole genomes Homework: learning package 14 for Thursday 21 June 2016.
Metagenomics The study of metagenomes, genetic material recovered directly from environmental samples. Term: Coined in 1998 to refer to the idea that a.
Radiation hybrid map of the zebrafish genome
Assay I HLA-DQ Alpha (A1) Haplotype. Purpose To determine which one of several known alleles is present at the HLA DQ α locus on each of an individual’s.
Virginia Commonwealth University
Quantitative Phylogenetic Assessment of Microbial Communities in Diverse Environments Xinjun Zhang.
Rob Edwards San Diego State University
Microbial genomics.
Biotechnology.
Metagenomic Species Diversity.
Environmental Genome Shotgun Sequencing of the Sargasso Sea
Seminar in Bioinformatics (236818)
DNA Technologies (Introduction)
15.2, slides with notes to write down
Genomes and Their Evolution
Section 3: Gene Technologies in Detail
Very important to know the difference between the trees!
Workshop on the analysis of microbial sequence data using ARB
Human Cells Human genomics
Cloning Overview DNA can be cloned into bacterial plasmids for research or commercial applications. The recombinant plasmids can be used as a source of.
Life on Earth Thought to be ~3.8 billion years old
Chapter 14 Bioinformatics—the study of a genome
Genomic Data Manipulation
Gene Transfer, Genetic Engineering, and Genomics
Today… Review a few items from last class
Genomes and Their Evolution
DNA and the Genome Key Area 8a Genomic Sequencing.
From Mendel to Genomics
Introduction to Sequencing
Human Genome Project Seminal achievement. Scientific milestone.
Volume 27, Issue 9, Pages (May 2017)
Figure Genetic characterization of the novel GYG1 gene mutation (A) GYG1_cDNA sequence and position of primers used. Genetic characterization of the novel.
Toward Accurate and Quantitative Comparative Metagenomics
Presentation transcript:

Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter, Karin Remington, John F. Heidelberg, Aaron L. Halpern, Doug Rusch, Jonathan A. Eisen, Dongying Wu, Ian Paulsen, Karen E. Nelson, William Nelson, Derrick E. Fouts, Samuel Levy, Anthony H. Knap, Michael W. Lomas, Ken Nealson, Owen White, Jeremy Peterson, Jeff Hoffman, Rachel Parsons, Holly Baden-Tillson, Cynthia Pfannkoch, Yu-Hui Rogers, Hamilton O. Smith http://science.sciencemag.org/content/304/5667/66/tab-figures-data By: Madona Masoud and Daria Nahidipour

Background In the 90’s microbial diversity was studied using PCR amplification of DNA 1981 - Messing came up with the first description 1983 - Current shotgun sequencing approach developed by Sanger Lab This method of genome sequencing was used by Venter to sequence the human genome. He was NOT affiliated with the Human Genome Project. 2003 - The first time whole-genome shotgun sequencing was used to study ocean microbial diversity at such a large-scale

What’s in the deep blue Sargasso Sea? Located in the middle of the North-Atlantic Ocean No land boundaries and is over 1000 miles wide and 3000 miles long One of the most well-studied region of the global ocean Nutrient-limited FIg 1. Sampling sites are shown in this figure.

Hypothesis There would be a limited amount of genetic diversity of oceanic microorganisms in the Sargasso Sea due to fact that it is a nutrient limited region.

Collection The fleet: RV Weatherbird II and SV Sorcerer II A total of ~1500L of surface seawater Filters were sized to isolate microbes and to exclude eukaryotic DNA (too large) and viruses (too small) Genomic DNA was extracted from the filters and prepared for sequencing SV Sorcerer II: http://www.nshof.org/index.php?option=com_content&view=article&id=791:sorcerer-ii-100-foot-research-sailboat Photo: http://www.jcvi.org/cms/research/projects/gos/photos/

Whole-Genome Shotgun [WGS] Sequencing https://www.youtube.com/watch?v=23iCH3mmifU

http://what-when-how.com/genomics/genome-sequencing-of-microbial-species-genomics/ https://www.sciencedirect.com/topics/neuroscience/shotgun-sequencing ADD IN: Genomic Libraries were constructed (inserts 2-6 kb) In fragmentation, double stranded DNA is used which gives the advantage of paired-end sequencing [sequencing of the fragment from the ends of the clone] and providing sequence information. DNA is isolated from the species, them fragmented. The DNA fragments are then incorporated into cloning vectors Each cloning vector obtain a random sequence The DNA fragments are assembled into contigs which, when are collected in large quantities and supported, can be turned into the complete sequence How the fragments come together which is through having multiple assemblies that complement each other The complete sequence is annotated Final sequence

Summary of WGS Sequencing Advantages: Improved speed and cost Labor saving because sequencing reactions and assembly fully automated Enhanced coverage of the genome Detection of large numbers of DNA polymorphisms Approximation of the abundance Disadvantages Possibility of DNA sequences mislocation due to linking DNA fragments incorrectly. Repetitive sequences Gaps in the DNA assembly that are caused by the repetitive sequences repetitive sequences since the size of these sequences is unknown due to not having a map prior enhanced coverage of the genome taking into account the abundance of each species. The size of repetitive sequences is not known unless there is a map made prior to the one being constructed.

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4083584/ After the clone, a cycle begins in which the De Novo assembly of DNA fragments happen, then the fragments are sequenced which take the coverage of the data into account. Lastly, the cycle ends with the a new genetic map being constructed or having the new genes integrated into an already existing map through the assembly of contigs together.

Celera Assembler Celera Assembler - de novo whole-genome shotgun (WGS) DNA sequence assembler Reconstructs long sequences of DNA from short fragmented data produced by WGS sequencing w/o use of a reference genome De novo sequence assemblers are a type of program that assembles short nucleotide sequences into longer ones without the use of a reference genome. These are most commonly used in bioinformatic studies to assemble genomes More Info: http://wgs- assembler.sourceforge.net/wiki/index.php?title=Main_Page

Old technique - PCR based rRNA Sequencing Some rRNA genes are not amplified by their designated primers Inaccurate estimates of abundance, diversity, and taxonomic relations based on the rRNA gene numbers https://www.sciencedirect.com/science/article/pii/S0022202X16003481 Example of PCR based rRNA sequencing from a study that was concerned with skin microbes. The microbes were obtained then their DNA was isolated and amplified using 16S rRNA genes. The amplified genes obtained were then assembled. https://www.slideshare.net/aaronsaunders/amplicon-sequencing-introduction sample -> PCR amplification -> sequence read A lot of phylogenetic studies had been done using PCR based rRNA Sequencing, however, there are problems that follow the use of this method mainly bias due to primers inability to amplify certain rRNA genes which did not identify some rRNA whilist using shotgun sequencing identified new, distinct small subunit rRNA genes. In addition, 97% and 99% sequence similarity [not completely accurate] showed novel phylotypes [similarities] between organisms that they later compared with the ones from the RDP II database. A second issue with using PCR based sequencing is that the number of rRNA genes may not vary greatly as it is necessary for every cell type

Gene Conservation of Prochlorococcus Outermost circle is completed sequence of Prochlorococcus marinus genome Inner circles are fragments obtained from environmental sequencing. Black = not conserved Different color = chromosomal rearrangement Takeaway: There are even differences between closely related organisms Genomic differences are present even among closely related organisms in the same genus. Outermost circle is the completed sequence of the Prochlorococcus marinus genome. This species is one of the most abundant photosynthetic organism on the planet and is responsible for a significant fraction of photosynthesis in the oceans. The inner circles are the fragments obtained from environmental sequencing. The black regions not conserved. Different color regions indicate chromosomal rearrangements took place. More info: https://genome.jgi.doe.gov/portal/prom9/prom9.home.html

Comparing sequences Comparison of scaffolds from WGS sequencing of Sargasso Sea to a known sequence Used tBLASTx to perform the comparison tblastx: for identifying nucleotide sequences similar to the query based on their coding potential They are comparing Sargasso Sea scaffolds to clone 4B7, which came from the North Pacific, far away from and totally different environment. Those being compared are individual scaffolds obtained from the WGS sequencing of the Sargasso sea .The red dots represent the similarity of the scaffold to the known sequence. As you can see there is lots of similarity to the known sequence as indicated by mostly red.

9 megaplasmids Genes encoded in the forward and reverse direction paired-end reads of the prepared plasmid clones Different colors show the gene function - magenta = fatty + phospholipid metabolism - white= genes with no known homology - grey = genes of unknown function

Sequence diversity of rhodopsin-like genes Phylogenetic tree of rhodopsin-like genes proteorhodopsin—membrane protein found in marine bacteria, provide a novel mechanism to harness light energy. Additional sequences ID in this study represent a great increase in sequence diversity of genes compared to previous findings Red indicated uncultured species collected from the Sargasso Sea This is a phylogenetic tree of rhodopsinlike genes in Sargasso sea along with homologs of the genes. Phylogenetic anlysis of these genes reveals that additional sequences identified in this study represent a great increase in sequence diversity of these genes in comparison to what has been seen previously. The red indicates sequences from uncultured species, meaning it cannot grow on a Petri dish in a laboratory because not enough is known about its biology.

Results Discovered 148 novel bacterial phylotypes (97% sequence similarity) Identified over 1.2 million unknown genes - 782 new rhodopsin-like photoreceptors Lots of variation in the species present suggests high oceanic microbial diversity The genes they found are published online and available to the public

Conclusion The researchers found a lot more genetic diversity of organisms in this nutrient-poor region of the ocean WGS sequencing has a new use as a tool that can be used in environmental sequencing to find new species We may discover new microbes that can create alternative sources of energy

Limitations Abundant species have great coverage but less common species are only represented by a few sequences Sample collection - Filtering technique was not that effective because eukaryotic DNA was found (18S rRNA) Over-collapsing occurs when well conserved regions assemble across species and cause scaffolds to break → suboptimal assembly

Supplemental Readings 1. Rusch et al. 2007. The Sorcerer II Global Ocean Sampling Expedition: Northwest Atlantic through Eastern Tropical Pacific. PLOS Biology. 5:398-431. 2. Moran et al. Nature. Genome sequence of Silicibacter pomeroyi reveals adaptations to the marine environment. Nature. 432:910–913. 3. Jensen et al. 2016. Application of whole genome shotgun sequencing for detection and characterization of genetically modified organisms and derived products. Bioanal Chem. 408:4595-614. 4. McDermott et al. Whole-Genome Sequencing for Detecting Antimicrobial Resistance in Nontyphoidal Salmonella. Antimicrob Agents Chemother. 60:5515-20. http://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.0050077 https://www.nature.com/articles/nature03170 https://www.ncbi.nlm.nih.gov/pubmed/27100228 https://www.ncbi.nlm.nih.gov/pubmed/27381390