JGI Timeline 1997 JGI April 2003 Human Genome Program Officially Ended Human Genome Program Officially Launched 1990 Joint Genome Institute ………………….(JGI)

Slides:



Advertisements
Similar presentations
Cyber Metagenomics; Challenge to See The Unseen Majority in The Ocean
Advertisements

Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA) Invited Talk CONNECT Board Meeting La Jolla, CA April 26, 2006.
Perspectives from EPA’s Endocrine Disruptor Screening Program
Data and Information Opportunities
Peer Review Essential Questions & Vocabulary social cultural signs systems structures.
1 Developing EPA’s Peer Review Program Joint JIFSAN/SRA/RAC Symposium Dorothy E. Patton, Ph.D., J.D. September 30, 2003.
BIOREMEDIATION Jiří Mikeš.
Jeff Dangl, UNC Chapel Hill Phil Hugenholtz, Susannah Tringe, JGI Ruth Ley, Cornell Rhizosphere Grand Challenge Pilot Project Scott Clingenpeel Project.
Peer Assessment of 5-year Performance ARS National Program 301: Plant, Microbial and Insect Genetic Resources, Genomics and Genetic Improvement Summary.
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey What is Metagenomics?  Traditional microbial genomics 
Genetically Modified Organisms Interactions with Population Health and Safety Chelsea Kadish Tyler Vaughn Ashley Wright.
6/10/2015 ©T. C. Hazen #1 Center for Environmental Biotechnology Center for Environmental Biotechnology Rapid deduction of bacteria stress response pathways:
Office of Science Office of Biological and Environmental Research Susan K. Gregurick, Ph.D. Program Manager Computational Biology & Bioinformatics Biological.
A New Initiative on Earth System Research for Global Sustainability
Oxidizing Pyrite 1.FeS O 2 + H 2 O  Fe SO H + 2.FeS Fe H 2 O  15 Fe SO H Fe O 2.
BSSD Response to the Committee of Visitors June 2008 review report The COV was charged to assess the processes used to: –solicit, review, and recommend.
Genomes To Life User Facilities for 21 st Century Systems Biology A Joint Initiative of the Office of Advanced Scientific Computing Research and the Office.
National Science Foundation Directorate for Biological Sciences Susan J. Lolle, Program Director Developmental Systems Cluster, Division of Integrative.
GTL Facilities Characterization and Imaging of Molecular Machines Lee Makowski.
Overview of the National Science Foundation (NSF) and the Major Research Instrumentation (MRI) Program Office of Integrative Activities National Science.
Sequencing a genome (a) outline the steps involved in sequencing the genome of an organism; (b) outline how gene sequencing allows for genome-wide comparisons.
GTL User Facilities Facility II: Whole Proteome Analysis Michelle V. Buchanan.
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology Metagenomics Center for Earth Observations and Applications Advisory Committee.
The BIO Directorate Microbial Biology Emphasis BIO Advisory Committee April, 2005.
Molecular Microbial Ecology
Institute of Systems Biology (INBIOSIS)/ School of Biosciences & Biotechnology (Faculty of Science & Technology), Bioinformatics Development in Malaysia.
Beyond the Human Genome Project Future goals and projects based on findings from the HGP.
GTL Facilities Computing Infrastructure for 21 st Century Systems Biology Ed Uberbacher ORNL & Mike Colvin LLNL.
DOE Resources & Facilities for Biological Discovery : Realizing the Potential Presentation to the BERAC 25 April 2002.
Genomics Eddy Rubin December 3, DOE Joint Genome Mission Supported by the DOE Office of Science, JGI unites the expertise of five national laboratories—Lawrence.
Faculty Grant Writing Institute University of California Merced / April 17, 2015 University of California at Merced 1.
Assembling the Tree of Life Diana Lipscomb Program Director, Systematic Biology Program National Science Foundation.
TAIR Workshop Model Organism Databases and Community Annotation Plant and Animal Genome XVI Conference, San Diego January 13, 2008.
Advancing Science with DNA Sequence GENEBOREE A Tool for Collaborative Gene Annotation DOE Joint Genome Institute Integrated Microbial Genomes Annette.
BERAC Subcommittee Report on DOE-JGI Review November 16 – 18, 2005 Walnut Creek, CA.
Microbial Diversity Scott Clingenpeel. Complete Genomes 2671 Prokaryotic genomes in GenBank 114 Eukaryotic genomes – 9 Plants – 52 Animals Do we really.
Advancing Science with DNA Sequence Undergraduate Genomics in a Research University Environment A Collaborative Effort between the JGI and UC Merced M.
GTL User Facilities Facility IV: Analysis and Modeling of Cellular Systems Jim K. Fredrickson.
Keith O. Hodgson SSRL Director Brief Update on the Linac Coherent Light Source - LCLS February 26, 2002 Basic Energy Sciences Advisory Committee Undulator.
ASCAC-BERAC Joint Panel on Accelerating Progress Toward GTL Goals Some concerns that were expressed by ASCAC members.
Current Challenges in Metagenomics: an Overview Chandan Pal 17 th December, GoBiG Meeting.
Genomics of Microbial Eukaryotes Igor Grigoriev Fungal Genomics Program Head US DOE Joint Genome Institute, Walnut Creek, CA.
IPG2P Working Group Update. iPG2P Final deliverable: – Procedure allowing an investigator to begin with trait of interest in species possessing limited.
Outline of Japanese Alliance for Climate Change Observation (JACCO) Shin MIYAZAKI ( Office for Coordination of Climate Change Observation (OCCCO), Japan.
Genomes To Life Biology for 21 st Century A Joint Initiative of the Office of Advanced Scientific Computing Research and Office of Biological and Environmental.
Overview of NSF and the Directorate for Biological Sciences (BIO) Overview of NSF and the Directorate for Biological Sciences (BIO) Tom Brady Division.
Advancing Science with DNA Sequence Response to BERAC Review Overview of JGI BERAC Recommendations –Managing multi-lab environment –Operation of user facility.
International Atomic Energy Agency Roles and responsibilities for development of disposal facilities Phil Metcalf Workshop on Strategy and Methodologies.
U.S. Department of Energy’s Office of Science Dr. Raymond L. Orbach Director, Office of Science April 29, 2004 PRESENTATION FOR THE BIOLOGICAL AND ENVIRONMENTAL.
es/by-sa/2.0/. Metagenomics Prof:Rui Alves Dept Ciencies Mediques Basiques, 1st Floor, Room.
Single-cell genome assembly of marine bacterial communities metabolising plastic waste Robert Sugar 2014.
November 18, 2003 SC’O3 Optimizing Genomic Data Storage for Wide Accessibility Joint Genome Institute (JGI) NERSC Center Computational Research Division.
Systems Microbiology Biology 475. Systems microbiology aims to integrate basic biological information with genomics, transcriptomics, metabolomics, glycomics,
Grants Management Update George O. Strawn NSF Chief Information Officer Advisory Committee for Business and Operations Spring 2006 Meeting.
The Genomics: GTL Program Environmental Remediation Sciences Program Spring Workshop April 3, 2006.
Laurie E. Locascio, Ph.D. Director, MML/NIST NIST/MML: Measurement Assurance for Biological Systems.
“Genomics: The CAMERA Project" Invited Talk 5 th Annual ON*VECTOR International Photonics Workshop UCSD February 28, 2006 Dr. Larry Smarr Director,
High throughput biology data management and data intensive computing drivers George Michaels.
Office of Research and Development Photo image area measures 2” H x 6.93” W and can be masked by a collage strip of one, two or three images. The photo.
Web-Based Builder of Digital Educational Resources Ksenia Klygina et al. NEC’ 2015.
“ Building an Information Infrastructure to Support Microbial Metagenomic Sciences " Presentation to the NBCR Research Advisory Committee UCSD La Jolla,
The National Science Foundation Independent Federal Agency Support for all fields of fundamental science and engineering.
Office of Animal Welfare Assurance (OAWA) Janna Barcelo, BA, CPIA
Joslynn Lee – Data Science Educator
Million Veteran Program Data Marts and Data Access
Production Genomics Facility
Development of Compliance Tools for Metals
Workshop Aims TAMU GO Workshop 17 May 2010.
Setting up an ERIC 11 May 2012 Richard Derksen
Presentation transcript:

JGI Timeline 1997 JGI April 2003 Human Genome Program Officially Ended Human Genome Program Officially Launched 1990 Joint Genome Institute ………………….(JGI) Non Traditional User Facility

US DOE Joint Genome Institute The JGI Post Human Genome Project Community Sequencing Program (CSP) Microbial Community Genomics

Overview The Community Sequencing Program (CSP) To provide the scientific community through a peer reviewed process access to high throughput sequencing at the JGI.

What types of projects will the JGI/CSP accept? A wide range of projects. Ultimately, the most important factor in determining if a project will be accepted is its scientific merit. User Guide > How to Propose a Project

JGI Director Users Proposal Study Panel Scientific Advisory Committee Sequence Allocation Designated Lab Director Proposals & Peer Review Process General Scientific Users Proposals

FAQ What can researchers get from the CSP program? The deliverables can range from raw sequence traces to well-annotated assembled genomes depending on the request in the proposal.

Users Scientific Support for Approved Projects Scientific Support Group SSG Production Sequencing Informatic Analysis Of Sequence Interactions of the JGI and Scientific Users with Approved Sequencing Proposals

DOE Gov Agencies Scientific Support for Approved Projects Scientific Support Group SSG Production Sequencing Informatic Analysis Of Sequence Interactions of the JGI and Scientific Users with Approved Sequencing Proposals (EPA,USDA, NSF) GTL, Microbe CSP

DOE Informatics JGI Science Programs Production Sequencing

DOE+CSP+Gov A Informatics JGI Science Programs Scientific Support Group Production Sequencing

Sequence Based Science at the JGI Gene Regulatory Vocabulary of Animals Studies of Body Plan Evolution Microbial Community Genomics

< 1% of microbes are culturable Many unculturables live in interdependent consortia of considerable diversity Aim: to recover genome-scale sequences and reveal metabolic capabilities What is the structure of natural microbial populations? What is a microbial species? Can we harness their metabolic capabilities

What Enviroments to Study? Ones with minimal microbial complexity

Iron Mountain Jill Banfield et al. UC Berkeley JillBanfieldGene Tyson Phil Hugenholtz UC Berkeley Geology

Iron Mountain Superfund site Discharging >1 ton of toxic metals/day (pH <1) FeS 2

“whole metagenome shotgun” dataset

Purify High Molecular Weight DNA Shotgun Library Construction DNA Sequencing Fosmid Library Construction Fosmid Insert End Sequencing Assembly Annotation ===== === == = ===== === == = = = = ===== === == = = ===== === == = Enviromental Sample

Purify High Molecular Weight DNA DNA Sequencing Fosmid Library Construction Fosmid Insert End Sequencing Assembly Annotation ===== === == = = = ===== === == = = = Shotgun Library Construction ===== === == = ===== === == = Shotgun Library Construction ===== === == = ===== === == = = When possible culture isolates = ? Enviromental Sample

Iron Mtn “whole metagenome shotgun” GC content separates into two components Forward read average G+C Reverse read average G+C archaea bacteria

Iron Mountain “whole metagenome shotgun” GC and depth distributions Read average G+C Read depth 310 Bacterial Lepto II Lepto III

Archaeal Fer 2 Fer 1 (cultured and sequenced ) G-plasma Read average G+C Bacterial Lepto II Lepto III 310 Read depth

Archaeal Fer 2 (3X) Fer 1 (1X) G-plasma (1X) Read average G+C Bacterial Lepto II (3X) Lepto III (1X) 310 Stoichiometry Read depth

Archaeal Fer 2 Fer 1 G-plasma Read average G+C Bacterial Lepto II Lepto III 310 Other sampled genomes at low depth (including eukaryotes) 15% of reads

Similarity to Fer1 (isolate) to Sequence in Community %id to cultivated Fer1 isolate Number of reads 64.9% 78.2% % Fer2 G plasma Fer1 Mixed Community Reads

Conclusions So Far The stochiometry of organisms encouraging for the assembly of individual genomes Assemblies support 16S studies suggesting limited diversity Isolated Fer1 genome sequences matches genome in environmental sample

How do we know that our assembly is correct?

How do you know you’ve done it right? Check pair ends against scaffold At the gross level: check pairs (expect few % due to failing/chimeric clones) Align all reads back against assembled scaffolds scaffolds end where there is no clone coverage in 3kb plasmids Identifies potentially repetitive areas and/or rearrangements How do we know that our assembly is correct?

Fer2 vs. fer1 shows local synteny Fer1 and Fer2 have avg. nt identity of 78% Fer1 gene on contig Fer2 gene on contig

What does it mean to assemble a community genome? Sample derived from millions of genomes. ? What is a “species” in the enviroment? Members of the same species a)significantly different (many lineages survive and diverge) b)highly similar (selective sweeps)

What does it mean to assemble a community genome? Lepto II : 1 nucleotide variation / 3,000 bp Fer II: 2.2 nucleotide variation / 100 bp

1 CONSENSUS gtttatattaaatccattgatttctaagcttccggttcttcttccgtataatggagattt XYG46314.b1 162 A C A A XYG44123.b1 673 A C A A XYG44918.b1 48 A C A XYG13291.g XYG40116.g G XYG3051.b G CONSENSUS atagcttaataattcatcctccatcatacttatgcttgaacctgataatattatgtatag XYG46314.b XYG44123.b XYG13291.g XYG40116.g A XYG3051.b A CONSENSUS ccttgtagtatccattaattcatcaaatattttctgcattatagatataataccatggtt XYG46314.b XYG44123.b XYG13291.g XYG40116.g1 72 T G....C A XYG3051.b2 516 T G....C A Reads of the Same Sequence from 5 Different Members of the Same Species (FerII)

1 CONSENSUS gtttatattaaatccattgatttctaagcttccggttcttcttccgtataatggagattt XYG46314.b1 162 A C A A XYG44123.b1 673 A C A A XYG44918.b1 48 A C A XYG13291.g XYG40116.g G XYG3051.b G CONSENSUS atagcttaataattcatcctccatcatacttatgcttgaacctgataatattatgtatag XYG46314.b XYG44123.b XYG13291.g XYG40116.g A XYG3051.b A CONSENSUS ccttgtagtatccattaattcatcaaatattttctgcattatagatataataccatggtt XYG46314.b XYG44123.b XYG13291.g XYG40116.g1 72 T G....C A XYG3051.b2 516 T G....C A Two Haplotypes Among the 5 Different Members of the Same Species (FerII)

CONSENSUS gtttatattaaatccattgatttctaagcttccggttcttcttccgtataatggagattt XYG46314.b1 162 A C A A XYG44123.b1 673 A C A A XYG44918.b1 48 A C A XYG13291.g XYG40116.g G XYG3051.b G CONSENSUS atagcttaataattcatcctccatcatacttatgcttgaacctgataatattatgtatag XYG46314.b XYG44123.b XYG13291.g XYG40116.g A XYG3051.b A CONSENSUS ccttgtagtatccattaattcatcaaatattttctgcattatagatataataccatggtt XYG46314.b XYG44123.b XYG13291.g XYG40116.g1 72 T G....C A XYG3051.b2 516 T G....C A Two haplotypes Among the 5 Different Members of the Same Species (Fer II)

Polymorphisms occur in blocks Long quiet regions separate highly variable segments Variation is found in blocks of 5-10 genes Local depth % polymorphic sites ORFs

Summary of Iron Mountain Biofilm Limited number of predominant species present in biofilm the majority have never been cultured Several lines of evidence suggest that we can assemble genomes of these organisms Simplicity of community suggests removal of most variants by natural selection Now studying the metabolic capabilities of microbes