Doug Brutlag 2011 Bioinformatics Genomics, Bioinformatics.

Slides:



Advertisements
Similar presentations
Martin John Bishop UK HGMP Resource Centre Hinxton Cambridge CB10 1 SB
Advertisements

Doug Brutlag 2011 Genomics Bioinformatics & Medicine Biochem 158/258 and HumBio 158G Doug.
©CMBI 2005 Exploring Protein Sequences - Part 2 Part 1: Patterns and Motifs Profiles Hydropathy Plots Transmembrane helices Antigenic Prediction Signal.
Bioinformatics for biomedicine Summary and conclusions. Further analysis of a favorite gene Lecture 8, Per Kraulis
© Wiley Publishing All Rights Reserved. Analyzing Protein Sequences.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Biology 224 Dr. Tom Peavy Sept 27 & 29 Protein Structure & Analysis.
Protein databases Morten Nielsen. Background- Nucleotide databases GenBank, National Center for Biotechnology Information.
Introduction to Bioinformatics Spring 2008 Yana Kortsarts, Computer Science Department Bob Morris, Biology Department.
Protein RNA DNA Predicting Protein Function. Biochemical function (molecular function) What does it do? Kinase??? Ligase??? Page 245.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
The Cell, Central Dogma and Human Genome Project.
Tutorial 5 Motif discovery.
Protein analysis and proteomics Friday, 27 January 2006 Introduction to Bioinformatics DA McClellan
What’s next ?? Today 3.3 Protein function 10.3 Protein secondary structure prediction 17.3 Protein tertiary structure prediction 24.3Gene expression &
Introduction to Bioinformatics - Tutorial no. 5 MEME – Discovering motifs in sequences MAST – Searching for motifs in databanks TRANSFAC – The Transcription.
Genomics and Bioinformatics Doug Brutlag Professor Emeritus Biochemistry & Medicine (by courtesy) Computational Molecular Biology Biochem 218 – BioMedical.
Signaling Pathways and Summary June 30, 2005 Signaling lecture Course summary Tomorrow Next Week Friday, 7/8/05 Morning presentation of writing assignments.
DNA Motif and protein domain discovery
Ayesha Masrur Khan Spring Course Outline Introduction to Bioinformatics Definition of Bioinformatics and Related Fields Earliest Bioinformatics.
ExPASy - Expert Protein Analysis System The bioinformatics resource portal and other resources An Overview.
Predicting Function (& location & post-tln modifications) from Protein Sequences June 15, 2015.
Doug Brutlag Professor Emeritus Biochemistry & Medicine (by courtesy) Protein Sequence Databases Computational Molecular Biology Biochem 218 – BioMedical.
Doug Brutlag 2011 Genome Databases Doug Brutlag Professor Emeritus of Biochemistry & Medicine Stanford University School of Medicine Genomics, Bioinformatics.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
Doug Brutlag Professor Emeritus Biochemistry & Medicine (by courtesy) Genome Databases Computational Molecular Biology Biochem 218 – BioMedical Informatics.
Doug Brutlag 2011 Next Generation Sequencing and Human Genome Databases Doug Brutlag Professor Emeritus of Biochemistry & Medicine Stanford University.
Bioinformatics.
PROTEIN SEQUENCE ANALYSIS. Need good protein sequence analysis tools because: As number of sequences increases, so gap between seq data and experimental.
A brief Introduction to Bioinformatics Y. SINGH NELSON R. MANDELA SCHOOL OF MEDICINE DEPARTMENT OF TELEHEALTH Content licensed under.
Gary Stormo by Andrew Bardee. History Born 1950 in South Dakota Undergraduate in Biology from Caltech PhD in Molecular Biology from University of Colorado.
Biology 224 Instructor: Tom Peavy Feb 21 & 26, Protein Structure & Analysis.
Master’s Degrees in Bioinformatics in Switzerland: Past, present and near future Patricia M. Palagi Swiss Institute of Bioinformatics.
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
Biological Databases Biology outside the lab. Why do we need Bioinfomatics? Over the past few decades, major advances in the field of molecular biology,
Construction of Substitution Matrices
Monday, November 8, 2:30:07 PM  Ontology is the philosophical study of the nature of being, existence or reality as such, as well as the basic categories.
Harbin Institute of Technology Computer Science and Bioinformatics Wang Yadong Second US-China Computer Science Leadership Summit.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Protein and RNA Families
Mining Biological Data. Protein Enzymatic ProteinsTransport ProteinsRegulatory Proteins Storage ProteinsHormonal ProteinsReceptor Proteins.
Bioinformatics The Prediction of Life Tony C Smith Department of Computer Science University of Waikato
Breakthrough Technology to Improve the Range and Accuracy of Cas9 Target.
Finding Patterns Gopalan Vivek Lee Teck Kwong Bernett.
EB3233 Bioinformatics Introduction to Bioinformatics.
An overview of Bioinformatics. Cell and Central Dogma.
PROTEIN PATTERN DATABASES. PROTEIN SEQUENCES SUPERFAMILY FAMILY DOMAIN MOTIF SITE RESIDUE.
COMPUTATIONAL ENGINEERING OF BIONANOSTRUCTURES
Sequence Based Analysis Tutorial March 26, 2004 NIH Proteomics Workshop Lai-Su L. Yeh, Ph.D. Protein Science Team Lead Protein Information Resource at.
Bioinformatics Dipl. Ing. (FH) Patrick Grossmann
Intro to Probabilistic Models PSSMs Computational Genomics, Lecture 6b Partially based on slides by Metsada Pasmanik-Chor.
Introduction to Bioinformatics - Tutorial no. 5 MEME – Discovering motifs in sequences MAST – Searching for motifs in databanks TRANSFAC – the Transcription.
COMPUTATIONAL ENGINEERING OF BIONANOSTRUCTURES RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON How can we design peptides and proteins capable.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
Doug Brutlag Professor Emeritus Biochemistry & Medicine (by courtesy) Sequence Alignment Computational Molecular Biology Biochem 218 – BioMedical Informatics.
Protein families, domains and motifs in functional prediction May 31, 2016.
Bioinformatics Overview
생물정보학 Bioinformatics.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
The Future of Genetic Research
Different Genes ~ Protein Primary Structure
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Introduction to Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Doug Brutlag 2011 Bioinformatics Genomics, Bioinformatics & Medicine Doug Brutlag Professor Emeritus of Biochemistry & Medicine Stanford University School of Medicine

Doug Brutlag 2011 Human Biology 40 th Birthday Friday, October 21, 2011 Friday, October 21, 2011

Doug Brutlag 2011 What is Bioinformatics? RNAProtein DNAPhenotype SelectionEvolution Individuals Populations Biological Information

Doug Brutlag 2011 Computational Goals of Bioinformatics Learn & Generalize: Discover conserved patterns (models) of sequences, structures, metabolism & chemistries from well-studied examples. Prediction: Infer function or structure of newly sequenced genes, genomes, proteomes or proteins from these generalizations. Organize & Integrate: Develop a systematic and genomic approach to molecular interactions, metabolism, cell signaling, gene expression… Basis of systems biology Simulate: Model gene expression, gene regulation, protein folding, protein-protein interaction, protein-ligand binding, catalytic function, metabolism… Goal of systems biology. Engineer: Construct novel organisms or novel functions or novel regulation of genes and proteins. Basis of synthetic biology. Target: Mutations, RNAi to specific genes and transcripts or drugs to specific protein targets. Practical biological and medical use of bioinformatics.

Doug Brutlag 2011 Central Paradigm of Molecular Biology DNARNAProtein Phenotype

Doug Brutlag 2011 MVHLTPEEKT AVNALWGKVN VDAVGGEALG RLLVVYPWTQ RFFESFGDLS SPDAVMGNPK VKAHGKKVLG AFSDGLAHLD NLKGTFSQLS ELHCDKLHVD PENFRLLGNV LVCVLARNFG KEFTPQMQAA YQKVVAGVAN ALAHKYH Genetic Information Central Paradigm of Bioinformatics Phenotype (Symptoms) Biochemical Function Molecular Structure

Doug Brutlag 2011 Central Paradigm of Bioinformatics Molecular Structure Phenotype (Symptoms) Biochemical Function Genetic Information MVHLTPEEKT AVNALWGKVN VDAVGGEALG RLLVVYPWTQ RFFESFGDLS SPDAVMGNPK VKAHGKKVLG AFSDGLAHLD NLKGTFSQLS ELHCDKLHVD PENFRLLGNV LVCVLARNFG KEFTPQMQAA YQKVVAGVAN ALAHKYH

Doug Brutlag 2011 Challenges Understanding Genetic Information Genetic Information Molecular Structure Biochemical Function Phenotype Genetic information is redundant Structural information is redundant

Doug Brutlag 2011 Soybean Leghemoglobin and Sperm Whale Myoglobin Soybean LeghemoglobinSperm Whale Myoglobin

Doug Brutlag 2011 Challenges Understanding Genetic Information Genetic Information Molecular Structure Biochemical Function Phenotype Genetic information is redundant Structural information is redundant Genes and proteins are meta-stable

Doug Brutlag 2011 Challenges Understanding Genetic Information Genetic Information Molecular Structure Biochemical Function Phenotype Genetic information is redundant Structural information is redundant Genes and proteins are meta-stable Genes and proteins are one dimensional but their function depends on three-dimensional structure

Doug Brutlag 2011 Discovering Function from Protein Sequence Sequence Similarity VLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF------DLSHGS |:| :|: | |:|||| | |:||| |: : :|:| :| | |: | 2 HLTPEEKSAVTALWGKV--NVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGN Sequences of Common Structure or Function

Doug Brutlag 2011 Dayhoff’s PAM 250 Amino Acid Replacement Matrix (1978)

Doug Brutlag 2011 Discovering Function from Protein Sequence Consensus Sequences or Sequence Motifs Zinc Finger (C2H2 type) C X{2,4} C X{12} H X{3,5} H Sequence Similarity VLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF------DLSHGS |:| :|: | |:|||| | |:||| |: : :|:| :| | |: | 2 HLTPEEKSAVTALWGKV--NVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGN Sequences of Common Structure or Function

Doug Brutlag 2011 A Typical Motif: Zinc Finger DNA Binding Motif C..C H....H

Doug Brutlag 2011Position A R N D C Q E G H I L K M F P S T W Y V BLOCKs, PRINTs, PSSMS or Weight Matrices Discovering Function from Protein Sequence Consensus Sequences or Sequence Motifs Zinc Finger (C2H2 type) C X{2,4} C X{12} H X{3,5} H Sequence Similarity VLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF------DLSHGS |:| :|: | |:|||| | |:||| |: : :|:| :| | |: | 2 HLTPEEKSAVTALWGKV--NVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGN Sequences of Common Structure or Function

Doug Brutlag 2011 Position-Specific Scoring Matrix for Prokaryotic Helix-Turn-Helix Motifs SequenceHelix TurnHelix RCRO_LAMBDF G Q T K T A K D L G V Y Q S A I N K A I H RCRO_BP434M T Q T E L A T K A G V K Q Q S I Q L I E A RCRO_BPP22G T Q R A V A K A L G I S D A A V S Q W K E RPC1_LAMBDL S Q E S V A D K M G M G Q S G V G A L F N RPC1_BP434L N Q A E L A Q K V G T T Q Q S I E Q L E N RPC1_BPP22I R Q A A L G K M V G V S N V A I S Q W E R RPC2_LAMBDL G T E K T A E A V G V D K S Q I S R W K R LACR_ECOLIV T L Y D V A E Y A G V S Y Q T V S R V V N CRP_ECOLII T Q Q E I G Q I V G C S R E T V G R I L K TRPR_ECOLIM S Q R E L K N E L G A G I A T I T R G S N RPC1_CPP22R G Q R K V A D A L G I N E S Q I S R W K G GALR_ECOLIA T I K D V A R L A G V S V A T V S R V I N Y77_BPT7L S H R S L G E L Y G V S Q S T I T R I L Q TER3_ECOLIL T T R K L A Q K L G V E Q P T L Y W H V K VIVB_BPT7D Y Q A I F A Q Q L G G T Q S A A S Q I D E DEOR_ECOLIL H L K D A A A L L G V S E M T I R R D L N RP32_BACSUR T L E E V G K V F G V T R E R I R Q I E A Y28_BPT7E S N V S L A R T Y G V S Q Q T I C D I R K IMMRE_BPPHS T L E A V A G A L G I Q V S A I V G E E T

Doug Brutlag 2011 Profiles, PSI-BLAST Hidden Markov Models AA1AA2AA3AA4AA5AA6 I 1 I 2 I 3 I 4 I 5 D 2 D 3 D 4 D 5 Discovering Function from Protein Sequence Position A R N D C Q E G H I L K M F P S T W Y V BLOCKs, PRINTs, PSSMS or Weight Matrices Consensus Sequences or Sequence Motifs Zinc Finger (C2H2 type) C X{2,4} C X{12} H X{3,5} H Sequence Similarity VLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHF------DLSHGS |:| :|: | |:|||| | |:||| |: : :|:| :| | |: | 2 HLTPEEKSAVTALWGKV--NVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGN Sequences of Common Structure or Function

Doug Brutlag 2011 Data Mining: The Seach for Buried Treasure

Doug Brutlag 2011 Data Mining: The Seach for Buried Treasure

Doug Brutlag 2011 Data Mining: The Seach for Buried Treasure

Doug Brutlag 2011 PROSITE Patterns Active site of trypsin-like serine proteases G D S G G Zinc Finger (C 2 H 2 type) C-X(2,4)-C-X(12)-H-X(3,5)-H N-Glycosylation Site N-[^P]-[S T]-[^P] Homeobox Domain Signature [LIVMF]-X(5)-[LIVM]-X(4)-[IV]-[RKQ]-X-W-X(8)-[RK]

Doug Brutlag 2011 Swiss Institute of Bioinformatics

Doug Brutlag 2011 Expasy Bioinformatics Resource Portal

Doug Brutlag 2011 Expasy Bioinformatics Resource Portal

Doug Brutlag 2011 Prosite Database

Doug Brutlag 2011 UniProt Knowledge Base

Doug Brutlag 2011 UniProt Opsin Entries

Doug Brutlag 2011 UniProt Homo sapiens Opsin Entries

Doug Brutlag 2011 UniProt Homo sapiens OPN1MW Entry