Bioinformatics: Data-driven molecular biology Mikhail Gelfand A.A.Kharkevich Institute for Information Transmission Problems, RAS Moscow II Испано-российский.

Slides:



Advertisements
Similar presentations
Martin John Bishop UK HGMP Resource Centre Hinxton Cambridge CB10 1 SB
Advertisements

Control of Gene Expression
LS Chapter 5 Biology Basics Student Learning Outcomes: 1.Explain the biological hierarchy of organization Give examples of each level 2.Explain.
Although humans have used biotechnology for thousands of years, discoveries made in the 1960s and 1970s increased our understanding about cells and molecules,
Genome organization Lesk, Ch 2 (Lesk, 2008). Genomes and proteomes Genome of a typical bacterium comes as a single DNA molecule of about 5 million characters.
CHAPTER 8 Metabolic Respiration Overview of Regulation Most genes encode proteins, and most proteins are enzymes. The expression of such a gene can be.
August 19, 2002Slide 1 Bioinformatics at Virginia Tech David Bevan (BCHM) Lenwood S. Heath (CS) Ruth Grene (PPWS) Layne Watson (CS) Chris North (CS) Naren.
Bioinformatics For MNW 2 nd Year Jaap Heringa FEW/FALW Integrative Bioinformatics Institute VU (IBIVU) Tel ,
Bioinformatics Dr. Aladdin HamwiehKhalid Al-shamaa Abdulqader Jighly Lecture 1 Introduction Aleppo University Faculty of technical engineering.
6/10/2015 ©T. C. Hazen #1 Center for Environmental Biotechnology Center for Environmental Biotechnology Rapid deduction of bacteria stress response pathways:
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
The Cell, Central Dogma and Human Genome Project.
Data visualization in the post-genomics era Carol Morita Genentech, Inc.
Bioinformatics Student host Chris Johnston Speaker Dr Kate McCain.
BI420 – Course information Web site: Instructor: Gabor Marth Teaching.
Cloning, genomes, and proteomes
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
Overview of Bioinformatics A/P Shoba Ranganathan Justin Choo National University of Singapore A Tutorial on Bioinformatics.
Genome Sequencing & App. of DNA Technologies Genomics is a branch of science that focuses on the interactions of sets of genes with the environment. –
Bioinformatics.
Development of Bioinformatics and its application on Biotechnology
AP Biology Ch. 20 Biotechnology.
ERA-NET PathoGenoMics Meeting Bonn 7-8 April, 2005 Research topics of interest in the Area of Genomics of Bacterial and Fungal Pathogens of Humans Prof.
Advanced Algorithms and Models for Computational Biology Class Overview Eric Xing & Ziv Bar-Joseph Lecture 1, January 18, 2005 Reading: Chap. 1, DTM book.
Shankar Subramaniam University of California at San Diego Data to Biology.
Institute of Systems Biology (INBIOSIS)/ School of Biosciences & Biotechnology (Faculty of Science & Technology), Bioinformatics Development in Malaysia.
CEITEC BRNO | CZECH REPUBLIC central european institute of technology CEITEC Genomics and proteomics at MU Jiří Fajkus.
Beyond the Human Genome Project Future goals and projects based on findings from the HGP.
Genome Project and Bioinformatics Dr Tan Tin Wee Director Bioinformatics Centre.
Introduction to Bioinformatics CPSC 265. Interface of biology and computer science Analysis of proteins, genes and genomes using computer algorithms and.
CS 790 – Bioinformatics Introduction and overview.
Igor Ulitsky.  “the branch of genetics that studies organisms in terms of their genomes (their full DNA sequences)”  Computational genomics in TAU ◦
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
Regulation of Gene Expression Eukaryotes
Microbial Models I: Genetics of Viruses and Bacteria 7 November, 2005 Text Chapter 18.
Molecular Biology Primer. Starting 19 th century… Cellular biology: Cell as a fundamental building block 1850s+: ``DNA’’ was discovered by Friedrich Miescher.
Introduction to Molecular Biology and Genomics BMI/CS 576 Mark Craven September 2007.
CSCI 6900/4900 Special Topics in Computer Science Automata and Formal Grammars for Bioinformatics Bioinformatics problems sequence comparison pattern/structure.
Bioinformatics For MNW 2 nd Year Jaap Heringa FEW/FALW Centre for Integrative Bioinformatics VU (IBIVU) Tel ,
CO 1: Ability to explain foundations of modern biotechnology.
Genomics and Arabidopsis. What is ‘genomics’? Study of an organism’s entire genome –All the DNA encoded in the organism –Nucleus, mitochondria, chloroplasts.
Harbin Institute of Technology Computer Science and Bioinformatics Wang Yadong Second US-China Computer Science Leadership Summit.
Genomes To Life Biology for 21 st Century A Joint Initiative of the Office of Advanced Scientific Computing Research and Office of Biological and Environmental.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Bioinformatics The application of computer technology to the management of biological information
NY Times Molecular Sciences Institute Started in 1996 by Dr. Syndey Brenner (2002 Nobel Prize winner). Opened in Berkeley in Roger Brent,
Genes and Genomic Datasets. DNA compositional biases Base composition of genomes: E. coli: 25% A, 25% C, 25% G, 25% T P. falciparum (Malaria parasite):
EB3233 Bioinformatics Introduction to Bioinformatics.
Functional and Evolutionary Attributes through Analysis of Metabolism Sophia Tsoka European Bioinformatics Institute Cambridge UK.
Bioinformatics and Computational Biology
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
Trends Biomedical In silico. “Omics” a variety of new technologies help explain both normal and abnormal cell pathways, networks, and processes simultaneous.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
 What is different between these 2 sequences? GGAATTCCTAGCAAT CCTTAAGGATCGTTA CTACGTGAGGAATTC GATGCACTCCTTAAG.
JOINT INDO-RUSSIAN WORKSHOP " S YSTEMS B IOLOGY AND G ENOME I NFORMATICS OF M. TUBERCULOSIS AND OTHER INFECTIOUS DISEASES" (SBGI’08) Novosibirsk, Russia,
신기술 접목에 의한 신약개발의 발전전망과 전략 LGCI 생명과학 기술원. Confidential LGCI Life Science R&D 새 시대 – Post Genomic Era Genome count ‘The genomes of various species including.
Looking Within Human Genome King abdulaziz university Dr. Nisreen R Tashkandy GENOMICS ; THE PIG PICTURE.
High-throughput data used in bioinformatics
생물정보학 Bioinformatics.
“Proteomics is a science that focuses on the study of proteins: their roles, their structures, their localization, their interactions, and other factors.”
Gene Transfer, Genetic Engineering, and Genomics
Genomes and Their Evolution
Genome organization and Bioinformatics
Bioinformatics: Data-driven molecular biology
Department of Chemical Engineering
Introduction to Bioinformatic
Genome-wide Reconstruction of OxyR and SoxRS Transcriptional Regulatory Networks under Oxidative Stress in Escherichia coli K-12 MG1655  Sang Woo Seo,
Engineering Biological Systems with Synthetic RNA Molecules
Introduction to Bioinformatics
Presentation transcript:

Bioinformatics: Data-driven molecular biology Mikhail Gelfand A.A.Kharkevich Institute for Information Transmission Problems, RAS Moscow II Испано-российский форум по информационным и коммуникационным технологиям Madrid, / IX / 2009

Exponential increase of data volume red – papers (PubMed) blue – sequence fragments (GenBank) green – nucleorides (GenBank) of 18 million papers in PubMed, ~675 thousand have keywords “bioinformat* OR comput*”

622 complete genomes (bacteria)

>45 thousand Google hits on “genome deciphered” Top 10 hits: bioremediation –bacterium Pseudomonas agriculture and biotech –crop and biofuel plant Sorghum –rice medicine –pathogenic bacterium Staphylococcus –SARS (atypical pneumonia) virus –Brugia worm (elephantiasis) individual genome (medicine) –James Watson science / model organism –macaque science / evolution –mammoth (mitochondrial) –platypus

Sequencing is just the beginning Bacterial genome: several million nucleotides 600 through 9,000 genes (~ 90% of a genome codes for proteins) This slide: 0,1% of the Escherichia coli genome Human genome: 3 billion nucleotides, thousand genes polymorphisms (individual differences): ~ 1 for 1000 nucleotides differences between human and chimpanzee: ~ 1 of 100

Not just genomes Other types of large-scale experiments / datasets: State of the genome (gene expression) –methylation –nucleosome positioning –histone modifications Transcriptomics, protein abundance (gene expression) Protein-protein interactions –signaling etc. –functional complexes Protein-DNA interactions (regulation) etc.

Goals Functional annotation of genes and proteins –biological function –regulation (in what conditions) Functional annotation of genomes –metabolic reconstruction and modeling –regulatory networks and development –prediction of organism properties from its genome

Applications: biotechnology Improvement of production strains (chemistry, pharma, food industry) –via modeling of metabolic pathways New enzymes (new functions, stress tolerance) –via sequencing and functional annotation Biofuels –fast-growing, stress-tolerant plants; identification of genes –microbes as producers of ethanol or fatty acids: targeted genome design

Applications: medicine and pharma Personalized medicine –identification of predisposing alleles: lifestyle –pharmacogenomics (metabolic alleles) –diagnostics Drug targets (chronic disease) –analysis of signaling pathways Anti-infectives –identification of drug targets Drug design; identification of drug candidates –modeling of protein structure and interactions of proteins with small molecules

Methods. Integration of data Systems biology: Integration of diverse datasets for one organism Comparative genomics: Simultaneous analysis of genomic data for many organisms Comparative systems biology: understanding the evolution of gene regulation and expression, signaling etc. Comparative structural biology

Bioinformatics in Russia Few high-throughput experiments –Open data –Collaborations –Theory (evolution), methods, algorithms Highlights: –Evolution (IITP RAS) and taxonomy (IPCB MSU) –Regulation (FBB MSU, GosNIIGenetika, IITP RAS, ICaG SB RAS) –Annotation (FBB MSU, IITP RAS) –Protein Structure (IPR RAS, IMB RAS, IPCB MSU, BF MSU) –Modeling Metabolism (IPCB MSU, ICaG SB RAS) Regulation (SpBSPU, ICaG SB RAS) –Drug design (IBMC RAMS)

Research and Training Center “Bioinformatics”, Institute of Information Transmission Problems (5 years: ) Molecular evolution –Alternative splicing as a driver of evolution in eukaryotes –Positive selection Comparative genomics of regulation in bacteria –Evolution of regulatory pathways –Protein-DNA interactions Annotation –Gene recognition –Functional annotation –Regulation

Comparative genomics in action: confirmed predictions Regulatory mechanisms –riboswitches (riboflavin – vitamin B1, thiamin – vitamin B2) –antisense regulation of the methionine-cysteine pathway –role of the ribosome in zinc homeostasis Regulators: NrdR, MtaR/MetR, CmbR, NiaR Enzymes: FadE, ThiN, TenA, CobZ, CobX/CbiZ, PduX, NagP, NagB-II Microcins (capistruin, Burkholderia thailandensis) Transporters –АВС-transporters with universal energizing components: Co, Ni, biotin (vitamin H), thiamin (vitamin B2), riboflavin (vitamin B1) –other: threonin, methionin, oligogalacturonides, N-acetylglucosamin, corrinoids, nyacin, riboflacin, Co Regulatory motifs: nitrogen-fixation, fatty acid biosynthesis, iron homeostasis, catabolism of chitin and pectin Regulatory sites: several dozens

Functional annotation of genomes First Russian bacterial genome, Acholeplasma laidlawii (2008): sequencing and proteomics: Institute of Physico-Chemical Medicine; annotation: IITP: ~1,5 Mb; ~1400 genes. Established function for ~80% genes; metabolic reconstruction

Publications (refereed)

Collaborations European Laboratory of Molecular Biology * Germany –Humboldt University, Berlin –Munich Technical University France –Lyon University United Kingdom –University of East Anglia Spain –Center for Genome Regulation (Barcelona) USA –MIT –Burnham Institute * –Lawrence Berkeley National Laboratory * –Stowers Institute * –Rutgers University China –China-Germany Partner Institute of Molecular Genetics (Shanghai) Industry –Biomax (Germany) –Interated Genomics (USA) Bold: on-going * Former students