INSTRUCTIONS This is the BIOL375 class of 2010-11. These are the students currently working with Dr. Scott on the Meiothermus ruber genome annotation.

Slides:



Advertisements
Similar presentations
1.1.3 MI.
Advertisements

 Preparing undergraduates to succeed in college and beyond in a bioinformatics-rich curriculum  Discussion of existing resources, opportunities, and.
Chapter 17 Table of Contents Section 1 Biodiversity
BIOINFORMATICS Ency Lee.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Using Bioinformatics to Make the Bio- Math Connection The Confessions of a Biology Teacher.
Biological Databases Notes adapted from lecture notes of Dr. Larry Hunter at the University of Colorado.
Integration of Bioinformatics into Inquiry Based Learning by Kathleen Gabric.
Bioinformatics Student host Chris Johnston Speaker Dr Kate McCain.
BLOSUM Information Resources Algorithms in Computational Biology Spring 2006 Created by Itai Sharon.
The Sorcerer II Global ocean sampling expedition Katrine Lekang Global Ocean Sampling project (GOS) Global Ocean Sampling project (GOS) CAMERA CAMERA METAREP.
Genetic Research Using Bioinformatics: LESSON 6:
Subsystem Approach to Genome Annotation National Microbial Pathogen Data Resource Claudia Reich NCSA, University of Illinois, Urbana.
Pathways Database System: An Integrated System For Biological Pathways L. Krishnamurthy, J. Nadeau, G. Ozsoyoglu, M. Ozsoyoglu, G. Schaeffer, M. Tasan.
Zachary Bendiks. Jonathan Eisen  UC Davis Genome Center  Lab focus: “Our work focuses on genomic basis for the origin of novelty in microorganisms (how.
Laboratory Training for Field Epidemiologists Typing May 2007 Sequencing and Phylogeny.
Unit 1: The Language of Science  communicate and apply scientific information extracted from various sources (3.B)  evaluate models according to their.
Lesson 10 Bioinformatics
What a Great Time to Teach & Do Research with Undergrads!! So much data! *** So much for me & my students to do!!!*** So many questions! So many great.
Wellcome Trust Workshop Working with Pathogen Genomes Module 3 Sequence and Protein Analysis (Using web-based tools)
Basic Introduction of BLAST Jundi Wang School of Computing CSC691 09/08/2013.
Judith Kandel, CSU Fullerton Scott Cooper, UW-La Crosse Web-based Problem-Solving Exercises for the Life Sciences.
Sequence Databases What are they and why do we need them.
Compare and contrast prokaryotic and eukaryotic cells.[BIO.4A] October 2014Secondary Science - Biology.
Overview. What is Annotation? Annotation is the process of determining the location and function of all identifiable genes in a genome. Annotation is.
Greengenes: A Tutorial
Identify gene markers for different taxonomic groups in Archaea and Bacteria Genomes Dongying Wu 1,2, Jonathan A. Eisen 1,2 1. DOE Joint Genome Institute,
Copyright OpenHelix. No use or reproduction without express written consent 2 Overview of Genome Browsers Materials prepared by Warren C. Lathe, Ph.D.
Advancing Science with DNA Sequence Undergraduate Genomics in a Research University Environment A Collaborative Effort between the JGI and UC Merced M.
Molecular Biology Primer. Starting 19 th century… Cellular biology: Cell as a fundamental building block 1850s+: ``DNA’’ was discovered by Friedrich Miescher.
Web Apollo and the VectorBase user community Gloria I. Giraldo-Calderón March 31, 2015.
Copyright © 2010 Pearson Education Inc. Lecture 01 – Genetics & Genomics: An Introduction Based on Chapter 1 – Genetics: An introduction.
What is Genetic Research?. Genetic Research Deals with Inherited Traits DNA Isolation Use bioinformatics to Research differences in DNA Genetic researchers.
Organizing information in the post-genomic era The rise of bioinformatics.
Biological Databases Biology outside the lab. Why do we need Bioinfomatics? Over the past few decades, major advances in the field of molecular biology,
Construction of Substitution Matrices
Function preserves sequences Christophe Roos - MediCel ltd Similarity is a tool in understanding the information in a sequence.
ARE THESE ALL BEARS? WHICH ONES ARE MORE CLOSELY RELATED?
Condor: BLAST Monday, July 19 th, 3:15pm Alain Roy OSG Software Coordinator University of Wisconsin-Madison.
Copyright © by Holt, Rinehart and Winston. All rights reserved. ResourcesChapter menu To View the presentation as a slideshow with effects select “View”
Abstract Our current understanding of the taxonomic and phylogenetic diversity of cellular organisms, especially the bacteria and archaea, is mostly based.
Condor: BLAST Rob Quick Open Science Grid Indiana University.
WMU CS 6260 Parallel Computations II Spring 2013 Presentation #1 about Semester Project Feb/18/2013 Professor: Dr. de Doncker Name: Sandino Vargas Xuanyu.
Final Project Bioinformatics for Biologists. Alternative A Alternative B.
Bioinformatics and Computational Biology
Western New York Genetics in Research Partnership Expanding Exposure, Career Exploration and Interactive Projects in Basic Genome Analysis and Bioinformatics.
Condor: BLAST Monday, 3:30pm Alain Roy OSG Software Coordinator University of Wisconsin-Madison.
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
Construction of Substitution matrices
Integration of Bioinformatics into Inquiry Based Learning by Kathleen Gabric.
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
Protein Evolution Introducing the use of Biology Workbench as a Bioinformatics Tool.
Taxonomy & Phylogeny. B-5.6 Summarize ways that scientists use data from a variety of sources to investigate and critically analyze aspects of evolutionary.
Bioinformatics What is a genome? How are databases used? What is a phylogentic tree?
Using BLAST to Identify Species from Proteins
How to Use This Presentation
Biological Databases By: Komal Arora.
Sequence based searches:
Pipelines for Computational Analysis (Bioinformatics)
Using BLAST to Identify Species from Proteins
Genomic Data Manipulation
Predict Protein Sequence by Fuzzy-Association Rules
Bioinformatics and BLAST
Overview Bioinformatics: Analyzing biological data using statistics, math modeling, and computer science BLAST = Basic Local Alignment Search Tool Input.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Applying principles of computer science in a biological context
Human Genome Project Seminal achievement. Scientific milestone.
Using BLAST to Identify Species from Proteins
Presentation transcript:

INSTRUCTIONS This is the BIOL375 class of These are the students currently working with Dr. Scott on the Meiothermus ruber genome annotation project. This presentation was created by students in this course. You will need speakers or a headset to hear the narration attached to this presentation. On most pages, you will see a speaker icon like this one. Click on the icon to hear the narration. When finished with one slide, click “enter” to advance to the next slide.

MEIOTHERMUS RUBER Mitch Anliker, Mohammed Hussain, Heather Smith, Melissa Reller, Jose Candelario Orozco

Introduction Background Information on Meiothermus ruber Explain what it means to "annotate" For what Purpose? Click on the speaker Icon to learn more about the Meiothermus ruber project and annotations

BACKGROUND What is Meiothermus ruber? Procaryote: Eubacteria domain Physical characteristics: Thermophile - prefers °F Isolated from hot springs “Pest” in paper mills Non-pathogenic Genome: 3,000,000 base pairs *3,100 protein-coding genes predicted

BACKGROUND CONTINUED Phylum: Thermi Class: Thermi Order: Thermales Family: Thermaceae Genus: Meiothermus Species: ruber Pure science reasons Most thermophiles belong to the Archaea domain DOE’s GEBA project Undergraduate research

W HY STUDY M EIOTHERMUS RUBER ? Practical reasons Contaminant of paper mills Contains an enzyme that digests feathers

W HAT IS A G ENOME A NNOTATION ? A Genome Annotation is a process of attaching biological information to DNA sequences

W HY A NNOTATE M ORE G ENOMES ? Archaea Bacteria

GEBA Genomes *T.P. Curtis, W.T. Sloan, and J.W. Scannell Estimating prokaryotic diversity and its limits. Proc Natl Acad Sci USA 99: Genomic Encyclopedia of Bacteria & Archaea (GEBA) is a massive JGI genome sequencing effort to fill in many of the missing or under-sampled branches of the Bacteria & Archaea trees.

* D. Wu, P. Hugenholtz, K. Mavromatis, et al., A phylogeny-driven genomic encyclopedia of Bacteria and Archaea. Nature 462: First 56 GEBA genomes* filled in several missing or under-sampled branches of the Bacteria trees & showed that there is a lot of genomic diversity out there to be discovered. GEBA continued…

MEIOTHERMUS RUBER GENOME ANNOTATION PROJECT Genome annotation - the process of attaching biological information to DNA sequences o It consists of two main steps:  identifying elements on the genome, a process called Gene Calling, and  attaching biological information to these elements o Technology is called Bioinformatics – using computer programs to analyze sequence information and make predictions Functional genomics – benchtop research o Gene cloning to isolate the gene of interest from the genome o Mutational studies to confirm biological function predictions

M. RUBER G ENOME P ROJECT Is there evidence to support the predictions related to my gene? Large gaps in the types of bacterial genomes studied Learn the tools to analyze your gene prediction Use the tools to collect evidence to support/refute the prediction Form your argument

IMG-ACT Phobius NCBI T-Coffee BLAST Web Logo KEGG PSORT SignalP TIRGfam Phylogeny.fr TMHMM

W HY A NNOTATE WITH S TUDENTS ? Most automated genome annotations - 35% are wrong Automated annotations miss things! Learning new and valuable information is key o Previous knowledge can help you!

A NNOTATION G OALS Develop and strengthen genome annotation skills such as: o Using computer programs to analyze sequence data o Gathering and evaluating information from Web-based community- accessible sequence databases o Evaluating automated gene calls Produce quality annotations for incorporation into the Integrated Microbial Genomes Database Build conceptual understanding of: o Evolutionary relationships among genomes o Genome organization o Power and limitations of bioinformatics o Protein structure and function o Transcriptional and translational signals Develop basic scientific research skills such as: o Reading and evaluating primary literature o Developing hypotheses and interpreting data o Drawing conclusions from a collection of evidence o Working collaboratively o Working with real data

IMG-ACT M ODULAR A NNOTATION Streamline annotation Emphasizes biological root of bioinformatics More easily compatible with education Emphasizes complementarity of tools Allows addition and removal of modules to match student level

ANNOTATION Module TitleDescription Mod 1: Basic InformationDNA coordinates & base sequence, amino acid sequence, pI Mod 2: Sequence-based Similarity Data Sequence alignment, conserved protein domains and protein families Mod 3: Cellular Localization Data Signal peptide sequence, transmembrane domains

M ODULE C ONCEPTS Basic Information

IMG-ACT (JGI): Cheryl Kerfeld Seth Axen Microbial Genome Annotation Network: (NSF RCN-UBE) Lori Scott, PI mgan.jgi-psf.org Acknowledgements