Current Challenges in Metagenomics: an Overview Chandan Pal 17 th December, GoBiG Meeting.

Slides:



Advertisements
Similar presentations
CSU IDRC Next Generation Sequencing Core Genomic Sequencing Services.
Advertisements

Use of the genomic data o Reconstruction of metabolic properties o Nature’s Microbiome o NGS in Population Genetics.
Metabarcoding 16S RNA targeted sequencing
DESIGNING THE MICROBIAL RESEARCH COMMONS: AN INTERNATIONAL SYMPOSIUM NATIONAL ACADEMY OF SCIENCES, WASHINGTON, DC, 8-9 OCTOBER 2009 Paul Gilna, B.Sc.,
What Is Genomics? Genomics is the study of how the entire genome of a species functions as a unit and evolves over time. It is the study of life’s blueprint,
Bioinformatics caacaagccaaaactcgtacaaCgagatatctcttggaaaaactgctcacaatattgacgtacaaggttgttcatgaaactttcggtaAcaatcgttgacattgcgacctaatacagcccagcaagcagaat Managing.
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey What is Metagenomics?  Traditional microbial genomics 
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Bioinformatics for Whole-Genome Shotgun Sequencing of Microbial Communities By Kevin Chen, Lior Pachter PLoS Computational Biology, 2005 David Kelley.
Greg Phillips Veterinary Microbiology
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
The Sorcerer II Global ocean sampling expedition Katrine Lekang Global Ocean Sampling project (GOS) Global Ocean Sampling project (GOS) CAMERA CAMERA METAREP.
High Throughput Sequencing
Subsystem Approach to Genome Annotation National Microbial Pathogen Data Resource Claudia Reich NCSA, University of Illinois, Urbana.
A Comprehensive Workflow for Microbial Genome Sequencing From Swab to Publication Madison I. Dunitz 1, David A. Coil 1, Jenna M. Lang 1, Guillaume Jospin.
NGS Data Generation Dr Laura Emery. Overview The NGS data explosion Sequencing technologies An example of a sequencing workflow Bioinformatics challenges.
Introduction to metagenomics Agnieszka S. Juncker Center for Biological Sequence Analysis Technical University of Denmark.
Bioinformatics Core Facility Ernesto Lowy February 2012.
Discussion on Metagenomic Data for ANGUS Course Adina Howe.
Molecular Microbial Ecology
Discovery of new biomarkers as indicators of watershed health and water quality Anamaria Crisan & Mike Peabody.
From Metagenomic Sample to Useful Visual Anna Shcherbina 01/10/ Anna Shcherbina Bioinformatics Challenge Day 02/02/2013 From Metagenomic Sample to.
Beyond the Human Genome Project Future goals and projects based on findings from the HGP.
H = -Σp i log 2 p i. SCOPI Each one of the many microbial communities has its own structure and ecosystem, depending on the body environment it exists.
Introduction to next generation sequencing Rolf Sommer Kaas.
Accurate estimation of microbial communities using 16S tags Julien Tremblay, PhD
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
P. Tang ( 鄧致剛 ); RRC. Gan ( 甘瑞麒 ); PJ Huang ( 黄栢榕 ) Bioinformatics Center, Chang Gung University. Genome Sequencing Genome Resequencing De novo Genome.
DAN LAWSON BRC 2011 – ANNUAL MEETING UT SOUTHWESTERN MEDICAL CENTER DALLAS, TX SEPTEMBER 2011 Challenges and opportunities of new sequencing technologies.
Biodiversity initiative: Integrating Taxonomy, Genomics and Biodiversity ++ = ????? Speaker: Benjamin Linard Alfried Vogler Team.
Advancing Science with DNA Sequence Metagenome definitions: a refresher course Natalia Ivanova MGM Workshop September 12, 2012.
Chapter 21 Eukaryotic Genome Sequences
cloud-based platform: application for genomic epidemiology studies Rodrigo Jardim & Alberto M. R. Dávila (Computational and Systems Biology.
NextGen Pipeline: Enabling the Plant Science Community Tom Brutnell (lead), Steve Rounsley (co-lead), Matt Vaughn (Engagement Lead) Ed Buckler, Justin.
Tsute (George) Chen Bioinformatics Core Department of Microbiology The Forsyth Institute March 24 th, 2015 HOMD A Tour to the Data and Tools.
Genomics Core Facility at UNH: High-Throughput Sequencing on the Illumina HiSeq 2500 Platform Project Consultation Sample Submission Library Creation Illumina.
UK NGS Sequencing Update July 2009 Dr Gerard Bishop - Division of Biology Dr Sarah Butcher – Centre for Bioinformatics.
Analyzing Time Course Data: How can we pick the disappearing needle across multiple haystacks? IEEE-HPEC Bioinformatics Challenge Day Dr. C. Nicole Rosenzweig.
Analysis and comparison of very large metagenomes with fast clustering and functional annotation Weizhong Li, BMC Bioinformatics 2009 Present by Chuan-Yih.
Genome annotation and search for homologs. Genome of the week Discuss the diversity and features of selected microbial genomes. Link to the paper describing.
Metagenomics at Second Genome
Anna Shcherbina Bioinformatics Challenge Day 01/10/2013 De novo assembly from clinical sample This work is sponsored by the Defense Threat Reduction Agency.
es/by-sa/2.0/. Metagenomics Prof:Rui Alves Dept Ciencies Mediques Basiques, 1st Floor, Room.
Accurate estimation of microbial communities using 16S tags
Habitat-Lite & EnvO Jin Mao Postdoc, School of Information, University of Arizona Nov. 20, 2015.
__________________________________________________________________________________________________ Fall 2015GCBA 815 __________________________________________________________________________________________________.
First of all: “Darnit Jim, I’m a doctor not a bioinformatician!”
Environmental Genome Shotgun Sequencing of the Sargasso Sea Venter et. al (2004) Presented by Ken Vittayarukskul Steven S. White.
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
Convenience Sample of 4 Adults and 6 Infants. Adults 4 visits over 2 weeks; infants 2 visits over 2 weeks Adult specimens: 1) plaque (by method, teeth,
Gene prediction in metagenomic fragments: A large scale machine learning approach Katharina J Hoff, Maike Tech, Thomas Lingner, Rolf Daniel, Burkhard Morgenstern.
From Reads to Results Exome-seq analysis at CCBR
Million Veteran Program: Industry Day Genomic Data Processing and Storage Saiju Pyarajan, PhD and Philip Tsao, PhD Million Veteran Program: Industry Day.
Metagenomic Species Diversity.
Quality Control & Preprocessing of Metagenomic Data
Seminar in Bioinformatics (236818)
Considerations for metagenomics data analysis and summary of workflows
Canadian Bioinformatics Workshops
Toward Next Generation Biodiversity Research
Workshop on the analysis of microbial sequence data using ARB
Metagenomics Image: Iverson et al. 2012, Science.
Introduction to G-OnRamp
Overview of Microbial Pathway and Genome Databases
Metagenomics Microbial community DNA extraction
Volume 21, Issue 8, Pages (August 2014)
Example usage of mockrobiota MC resource for marker gene and metagenome sequencing pipelines. Example usage of mockrobiota MC resource for marker gene.
Genome resolved metagenomics
Toward Accurate and Quantitative Comparative Metagenomics
General overview of the bioinformatic pipelines for the 16S rRNA gene microbial profiling and shotgun metagenomics. General overview of the bioinformatic.
Presentation transcript:

Current Challenges in Metagenomics: an Overview Chandan Pal 17 th December, GoBiG Meeting

Why Metagenomics? Study of microbial communities Unculturable or difficult to grow in lab Understanding microbial population, diversity

Metagenomic Workflow Environmental Sampling Library preparation Sequencing (NGS platforms) Quality controlling of raw reads (duplicated reads, low quality reads) Read alignment to reference databases Assembling the reads into longer contigs Predicting genes and assigning function to the genes

Challenges Sampling  bioinformatics analysis Sampling (biodiversity richness, evenness, Extreme environments, DNA extraction) Analysing replicates (soil samples, water samples etc.) Identification of Microbial communities - Based on knowledge of similar organisms - Variability among closely related species (e.g. E.coli K12 vs 0157:H7)

Challenges Large number of sequencing data, data storage, processing, lack of supercomputing resources, no comprehensive pipeline Sequence artifacts (sequence clustering: CD-Hit, SEED) (errors: SLP, pyronoise, Ampliconnoise)) Metagenomic Assembly (gene rearrangements) - highly conserved sequences from different species - assembling all reads into single sequence - lateral gene transfer - low coverage - Speed (GPU, Cloud computing) Gene prediction (frameshits at low coverage) & Functional annotation (homology based, depends on Public database)

Challenges Quality and Context of metadata (meaningful comparison between studies) Integrative approach Working on the Server or cloud computing Data submission (one Illumina run= 600 Gb = 4-5 times fastq raw files) Reverse metagenomics approach Biotechnological applications