Bioinformatics David Brodin BEA core facility MOLEKYLÄRBIOLOGI MED GENETIK – BIOINFORMATIK HT -07 Course web page:

Slides:



Advertisements
Similar presentations
UTACCEL 2010 Adventures in Biotechnology Graham Cromar.
Advertisements

Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Bioinformatics What is bioinformatics? Why bioinformatics? The major molecular biology facts Brief history of bioinformatics Typical problems of bioinformatics:
Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
Bioinformatics at IU - Ketan Mane. Bioinformatics at IU What is Bioinformatics? Bioinformatics is the study of the inherent structure of biological information.
1 Genetics The Study of Biological Information. 2 Chapter Outline DNA molecules encode the biological information fundamental to all life forms DNA molecules.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Introduction to Bioinformatics Spring 2008 Yana Kortsarts, Computer Science Department Bob Morris, Biology Department.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Data-intensive Computing: Case Study Area 1: Bioinformatics B. Ramamurthy 6/17/20151.
Biological Databases Notes adapted from lecture notes of Dr. Larry Hunter at the University of Colorado.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
August 29, 2002InforMax Confidential1 Vector PathBlazer Product Overview.
CHAPTER 15 Microbial Genomics Genomic Cloning Techniques Vectors for Genomic Cloning and Sequencing MS2, RNA virus nt sequenced in 1976 X17, ssDNA.
Bioinformatics Student host Chris Johnston Speaker Dr Kate McCain.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Bioinformatics Original definition (1979 by Paulien Hogeweg): “application of information technology and computer science to the field of molecular biology”
ExPASy - Expert Protein Analysis System The bioinformatics resource portal and other resources An Overview.
Proteomics Understanding Proteins in the Postgenomic Era.
Computational Structural Bioinformatics ECS129 Instructor: Patrice Koehl
From T. MADHAVAN, & K.Chandrasekaran Lecturers in Zoology.. EXIT.
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
A number of slides taken/modified from:
Srihari-CSE730-Spring 2003 CSE 730 Information Retrieval of Biomedical Text and Data Inroduction.
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
Development of Bioinformatics and its application on Biotechnology
Bioinformatics Sean Langford, Larry Hale. What is it?  Bioinformatics is a scientific field involving many disciplines that focuses on the development.
Bioinformatics and it’s methods Prepared by: Petro Rogutskyi
Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.
Genomics, Proteomics, and Bioinformatics Biology 224 Instructor: Tom Peavy January 29, 2008.
Beyond the Human Genome Project Future goals and projects based on findings from the HGP.
Manifestations of a Code Genes, genomes, bioinformatics and cyberspace – and the promise they hold for biology education.
A New Oklahoma Bioinformatics Company. Microarray and Bioinformatics.
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
Finish up array applications Move on to proteomics Protein microarrays.
BIOINFORMATICS IN BIOCHEMISTRY Bioinformatics– a field at the interface of molecular biology, computer science, and mathematics Bioinformatics focuses.
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
What is Genetic Research?. Genetic Research Deals with Inherited Traits DNA Isolation Use bioinformatics to Research differences in DNA Genetic researchers.
Biological Databases Biology outside the lab. Why do we need Bioinfomatics? Over the past few decades, major advances in the field of molecular biology,
REMINDERS 2 nd Exam on Nov.17 Coverage: Central Dogma of DNA Replication Transcription Translation Cell structure and function Recombinant DNA technology.
Predicting protein degradation rates Karen Page. The central dogma DNA RNA protein Transcription Translation The expression of genetic information stored.
Introduction to Bioinformatics (Lecture for CS397-CXZ Algorithms in Bioinformatics) Jan. 21, 2004 ChengXiang Zhai Department of Computer Science University.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill
The Mammalian Protein – Protein Interaction Database and Its Viewing System That Is Linked to the Main FANTOM2 Viewer Genome Research (2003) Speaker: 蔡欣吟.
Proteomics Session 1 Introduction. Some basic concepts in biology and biochemistry.
BIOLOGICAL DATABASES. BIOLOGICAL DATA Bioinformatics is the science of Storing, Extracting, Organizing, Analyzing, and Interpreting information in biological.
Data Mining and Decision Trees 1.Data Mining and Biological Information 2.Data Mining and Machine Learning Techniques 3.Decision trees and C5 4.Applications.
Central dogma: the story of life RNA DNA Protein.
Application of Bioinformatics in Genetic Research Instructors: Dr. Henry Baker Dr. Luciano Brocchieri Dr. Michele Tennant Dr. Lei Zhou
Bioinformatics and Computational Biology
The iPlant Collaborative Vision Enable life science researchers and educators to use and extend cyberinfrastructure.
Integrated Genomic and Proteomic Analyses of a Systematically Perturbed Metabolic Network Science, Vol 292, Issue 5518, , 4 May 2001.
The iPlant Collaborative Vision Enable life science researchers and educators to use and extend cyberinfrastructure.
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
BIOINFORMATICS Ayesha M. Khan Spring 2013 Lec-8.
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
1Introduction 1.0
Biological Databases By: Komal Arora.
Data-intensive Computing: Case Study Area 1: Bioinformatics
Bioinformatics Madina Bazarova. What is Bioinformatics? Bioinformatics is marriage between biology and computer. It is the use of computers for the acquisition,
생물정보학 Bioinformatics.
Mangaldai College, Mangaldai
Genomes and Their Evolution
9 Future Challenges for Bioinformatics
The Study of Biological Information
Introduction to Bioinformatic
Introduction to Bioinformatics
Presentation transcript:

Bioinformatics David Brodin BEA core facility MOLEKYLÄRBIOLOGI MED GENETIK – BIOINFORMATIK HT -07 Course web page:

Introduction to Bioinformatics -History of Bioinformatics -Need for computers -Computational Biologi -Fields of Bioinformatics -Bioinformatic tools Homologi, sekvensanalys och fylogenetik Introduction to Microarrays & Lab Lecture Content Monday Tuesday Wednesday Mass spectrometry Web Databases, bioinformatic tools etc Genotyping Arrays Tiling Arrays Computer Lab

Major advances in the field of molecular biology genomic technologies Explosive growth in the biological information generated by the scientific community Need of computerized databases to store, organize, and index the data and for specialized tools to view and analyze the data Need for Computers what computer science is to molecular biology is like what mathematics has been to physics Larry Hunter, ISMB’94

History of DNA Sequencing History of Bioinformatics Adapted from Messing & Llaca, PNAS (1998)

History of Bioinformatics

Early database: The Atlas of Protein Sequences was available on Digital Tape in 1978, and by modem Early programs: restriction enzyme sites, pattern finding, promoters, etc… circa : DDBJ/EMBL/GenBank are created as a public repository of genetic sequence information. 1983: NIH funds the PIR (Protein Information Resource) database. 1988: Pearson and Lipman create FASTA Number of published base pairs 1971 First published DNA sequence PhiX174 5, Lambda 48, Yeast Chromosome III 316, Haemophilus influenza 1,830, Saccharomyces 12,068, C. elegans 97,000, D. melanogaster 120,000, H. sapines (draft) 2,600,000, H. sapiens 2,850,000,000 History of Bioinformatics

Bioinformatics: Research, development, or application of computational tools and approaches for expanding the use of biological, medical, behavioral or health data, including those to acquire, store, organize, archive, analyze, or visualize such data. Computational Biology: The development and application of data-analytical and theoretical methods, mathematical modeling and computational simulation techniques to the study of biological, behavioral, and social systems. -National Institute of Health (NIH) Computational Biology In the early days of bioinformatics a major concern was creation and maintenance of databases to store biological information, involving design issues and development of complex user interfaces. Today the most pressing task involves the analysis and interpretation of various types of data, including nucleotide and amino acid sequences, protein domains, and protein structures. The actual process of analyzing and interpreting data is referred to as computational biology. Biology in the 21st century is being transformed from a lab-based science to an information science as well.

easy access to the information a method for extracting only that information needed to answer a specific biological question Biological Databases A biological database is a large, organized body of persistent data, usually associated with computerized software designed to update, query, and retrieve components of the data stored within the system. For researchers to benefit from the data stored in a database, two additional requirements must be met: the input sequence with a description of the type of molecule ID of sequence the scientific name of the source organism contact name literature citations associated with the sequence. Nucleotide sequence record:

Analysis and interpretation of various types of biological data Developement of new algorithms and statistics with which to asses biological information Development and implementation of tools that enable efficient access and management of different types of information Sub-disciplines, challenges & goals Need to feel comfortable in interdisciplinary area Depend on others for primary data Need to address important biological and computer science problems Challenges of working with bioinformat.: Important sub-disciplines: Important goal of bioinformatics: understanding basic biological processes and, in turn, advances in the diagnosis, treatment, and prevention of many genetic diseases.

Genomes Nucleotide Sequences Protein Sequences Macromolecular Structures Small Molecules Gene Expression Molecular Interactions Reactions & Pathways Protein Families Taxonomy Ontologies Sequence Similarity & Analysis Structure Analysis Fields of Bioinformatics The ”omics” Series: Genomics: Gene identification & characterization Transcriptomics: Expression profiles of mRNA Proteomics:Functions & interactions of proteins Structural Genomics: Large scale structure determination Cellinomics: Metabolic Pathways, Cell-cell interactions Pharmacogenomics: Genome-based drug design

I cloned a gene –is it a known gene? Does the sequence match? Is the sequence any good? Is the sequence similar to other known sequences? Which gene family does it belong to? The gene I´m interested in was found in another organism, but not in mine. How can I look for it? How is the gene expressed in different types of tissues? What is the biological function of the protein encoded by the gene? Is the gene associated with any disease? Typical Questions Biological problems that computers can help with: Increasingly, biological studies begin with a scientist conducting vast numbers of database and web site searches to formulate specific hypotheses or to design large-scale experiments.

Many different bioinformatic tools avaiable over the internet free of charge to whoever wishes to use them Also many commersial software packages avaiable Some bioinformaticians write their own tools for specialized tasks Bioinformatic tools Many platforms avaiable for software development...

Open Source in the life sciences: Present in all areas of bioinformatics Some very well known examples of tools used in industry and academic circles include: –BLAST –EMBOSS –EnsEMBL –GenScan –Bioconductor Open Access: Unrestricted access to data Allows all to work and make discoveries Discoveries are not necessarily open access Open access is applicable to any kind of data you want to apply it to: –Sequence data (DNA, RNA or protein) –Gene expression data –Protein-protein interaction data –Publication Open Source & Open Access

Precise, predictive model of transcription initiation and termination: ability to predict where and when transcription will occur in a genome Precise, predictive model of RNA splicing/alternative splicing: ability to predict the splicing pattern of any primary transcript in any tissue Precise, quantitative models of signal transduction pathways: ability to predict cellular responses to external stimuli Determining effective protein:DNA, protein:RNA and protein:protein recognition codes Accurate ab initio protein structure prediction Rational design of small molecule inhibitors of proteins Mechanistic understanding of protein evolution: understanding exactly how new protein functions evolve Mechanistic understanding of speciation: molecular details of how speciation occurs Continued development of effective gene ontologies - systematic ways to describe the functions of any gene or protein Education: development of appropriate bioinformatics curricula for secondary, undergraduate and graduate education Top 10 Future Challenges Chris Burge, Ewan Birney, Jim Fickett. Genome Technology, issue No. 17, January, 2002