Algorithms in Computational Biology

Slides:



Advertisements
Similar presentations
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Advertisements

School of Computer Engineering Master of Science (Bioinformatics) A/P Kwoh Chee Keong 2009 presented by.
August 19, 2002Slide 1 Bioinformatics at Virginia Tech David Bevan (BCHM) Lenwood S. Heath (CS) Ruth Grene (PPWS) Layne Watson (CS) Chris North (CS) Naren.
Bioinformatics For MNW 2 nd Year Jaap Heringa FEW/FALW Integrative Bioinformatics Institute VU (IBIVU) Tel ,
Bioinformatics at IU - Ketan Mane. Bioinformatics at IU What is Bioinformatics? Bioinformatics is the study of the inherent structure of biological information.
Intro to Molecular Genetics RNA & Protein Synthesis 3/16/2011.
RNA and Protein Synthesis
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Non-coding RNA William Liu CS374: Algorithms in Biology November 23, 2004.
Introduction to Bioinformatics Spring 2008 Yana Kortsarts, Computer Science Department Bob Morris, Biology Department.
Basic Biology for CS262 OMKAR DESHPANDE (TA) Overview Structures of biomolecules How does DNA function? What is a gene? How are genes regulated?
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Bioinformatics Student host Chris Johnston Speaker Dr Kate McCain.
Introduction to Bioinformatics (Lecture for CS498-CXZ Algorithms in Bioinformatics) Aug. 25, 2005 ChengXiang Zhai Department of Computer Science University.
Algorithms in Computational Biology Tanya Berger-Wolf Compbio.cs.uic.edu/~tanya/teaching/CompBio January 13, 2006.
Signaling Pathways and Summary June 30, 2005 Signaling lecture Course summary Tomorrow Next Week Friday, 7/8/05 Morning presentation of writing assignments.
Bioinformatics Original definition (1979 by Paulien Hogeweg): “application of information technology and computer science to the field of molecular biology”
CISC667, F05, Lec27, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Review Session.
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
Bioinformatics.
Chapter 13.2 (Pgs ): Ribosomes and Protein Synthesis
1 Bio + Informatics AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC An Overview پرتال پرتال بيوانفورماتيك ايرانيان.
A brief Introduction to Bioinformatics Y. SINGH NELSON R. MANDELA SCHOOL OF MEDICINE DEPARTMENT OF TELEHEALTH Content licensed under.
COT 6930 HPC and Bioinformatics Introduction to Molecular Biology Xingquan Zhu Dept. of Computer Science and Engineering.
Molecular Biology Primer. Starting 19 th century… Cellular biology: Cell as a fundamental building block 1850s+: ``DNA’’ was discovered by Friedrich Miescher.
Introduction to Bioinformatics Yana Kortsarts References: An Introduction to Bioinformatics Algorithms bioalgorithms.info.
Sevas Educational Society All Rights Reserved, 2008 Module 1 Introduction to Bioinformatics.
Bioinformatics For MNW 2 nd Year Jaap Heringa FEW/FALW Centre for Integrative Bioinformatics VU (IBIVU) Tel ,
8.4 Transcription KEY CONCEPT Transcription converts a gene into a single-stranded RNA molecule.
Introduction to Bioinformatics (Lecture for CS397-CXZ Algorithms in Bioinformatics) Jan. 21, 2004 ChengXiang Zhai Department of Computer Science University.
1 Protein Structure Prediction (Lecture for CS397-CXZ Algorithms in Bioinformatics) April 23, 2004 ChengXiang Zhai Department of Computer Science University.
Introduction to Bioinformatics Dr. Rybarczyk, PhD University of North Carolina-Chapel Hill
AdvancedBioinformatics Biostatistics & Medical Informatics 776 Computer Sciences 776 Spring 2002 Mark Craven Dept. of Biostatistics & Medical Informatics.
Central dogma: the story of life RNA DNA Protein.
EB3233 Bioinformatics Introduction to Bioinformatics.
Bioinformatics and Computational Biology
Introduction to Bioinformatics Algorithms Algorithms for Molecular Biology CSCI Elizabeth White
DNA Structure. The Flow of Genetic Information from DNA to RNA to Protein –DNA functions as the inherited directions for a cell or organism. Copyright.
The Central Dogma of Molecular Biology DNA  RNA  Protein  Trait.
Bioinformatics for Research
/ Computational Genomics
Data-intensive Computing: Case Study Area 1: Bioinformatics
Bioinformatics Madina Bazarova. What is Bioinformatics? Bioinformatics is marriage between biology and computer. It is the use of computers for the acquisition,
The Central Dogma Transcription & Translation
Transcription and Translation
생물정보학 Bioinformatics.
2/23/15 Learning Objectives
High-throughput Biological Data The data deluge
What is Bioinformatics?
RNA Secondary Structure Prediction
Notes – Protein Synthesis: Transcription
Protein Synthesis Part 1: Transcription
Genetics Lesson 4.
Genomes and Their Evolution
Genome organization and Bioinformatics
9 Future Challenges for Bioinformatics
Next Generation Sequencing and Human Genome Databases
Bioinformatics Vicki & Joe.
Bioinformatics For MNW 2nd Year
LESSON 1 INTNRODUCTION HYE-JOO KWON, Ph.D /
Transcription and Translation
Transcription and Translation
Central Dogma
How genes on a chromosome determine what proteins to make
Introduction to Bioinformatic
(Really) Basic Molecular Biology
credit: modification of work by NIH
Introduction to Bioinformatics
Transcription and Translation
Reconfigurable Computing (EN2911X, Fall07)
Presentation transcript:

Algorithms in Computational Biology Tanya Berger-Wolf Compbio.cs.uic.edu/~tanya/teaching/CompBio January 17, 2017

1D, 2D, 3D representation of DNA

The Central Dogma of Molecular Biology DNA -> RNA -> Protein

The Central Dogma of Molecular Biology DNA-RNA-Protein [photo credit (three slides): Bis2A (http://archive.cnx.org/resources/687459d21b78abda0606beeb021c4c0d31f2c942/0324_DNA_Translation_and_Codons.jpg)]

The Central Dogma of Molecular Biology (1) This happens in the cell nucleus, (2) not every letter is transcribed, (3) some are promoter regions involved in regulation of transcription and DNA replicaiton (4) some are “introns,” or untranslated regions of DNA, transcribed to RNA, but then spliced out before translation, (5) others are called “junk DNA” for their apparent lack of understood function, some are retroviral DNA, and some may really do little but space other DNA stretches, or ... possibly nothing.

The Central Dogma of Molecular Biology Process of translation. mRNA (messenger RNA) is processed by a ribosome and uses tRNA (transfer RNA).

The Central Dogma of Molecular Biology 4^3 = 64 possible nucleotide triplet arrangements; result in 20 amino acids. More or less universal! Some organisms have minor departures—which triplet encodes a particular amino acid.

The Central Dogma of Molecular Biology

DNA Sequencing

What is Computational Biology? No standard definition! Our definition: computational techniques for biological problems Data acquisition, management and representation (bioinformatics) Pattern analysis and data mining (bioinformatics) Data analysis and optimization Using bio data to solve other problems (medicine, public policy, etc.) Computational biology touches all parts of computer science Databases Data streaming HPC and systems Networking Algorithms Privacy and security Image processing Visualization http://www.colorbasepair.com/what_is_bioinformatics.html

Why is CompBio Important? Biology perspective More and more biological information is available => need for effectively accessing and using the information As more detailed information is available different questions can be asked (models of evolution) => requires new math Computer science perspective Excellent application domain Poses special computational challenges Brings computer science closer to scientific discovery Currently growing …

CompBio and Other Fields Computer Science Biology Information Management Biochemistry Molecular Biology Bioinformatics/ CompBio Theoretical CS Machine Learning Data Mining Biophysics Numerical Computing Applied Mathematics & Statistics

CompBio and Bioinformatics From Chris Burge’s MIT Open Courseware 7.91/20.490/6.874/HST.506

1980s: Sequence Alignment/Search Which specific residues/positions in a pair of proteins are homologous? Smith-Waterman alignment algorithm What RNA secondary structure has minimum folding free energy? Nussinov algorithm Zuker algorithm How to rapidly and reliably find homologs to a query sequence in a sequence database? FastA and BLAST algorithms and associated statistics Temple F. Smith and Michael S. Waterman Copyright Michael Waterman Ruth Nussinov Michael Zuker

Al Gore Learns to Search PubMed NCBI Director David Lipman (far left) coaches Vice President Gore (seated) as he searches PubMed. NIH Director Harold Varmus (center) and NLM Director Donald Lindberg (far right) look on. June 26, 1997. Photograph by the National Center for Biotechnology Information; in the public domain.

1990s: HMMs, Ab Initio Protein Structure Prediction, Genomics, Comparative Genomics How to identify domains in a protein? How to identify genes in a genome? Hidden Markov Models as a framework for such problems How to study gene expression globally, infer gene function from expression? Microarrays and clustering How to predict protein function by comparing genomes? gene fusions, phylogenetic profiling, etc. How to predict protein structure directly from primary sequence? Rosetta algorithm

2000s Part 1: The human genome is sequenced, assembled, annotated genomics becomes fashionable Criag Venter, public domain image Photo of the Human Genome project pioneers © Mayo Foundation for Medical Education and Research. All rights reserved. Ewan Birney, public domain image Jim Kent, public domain image

2000s Part 2: Biological Experiments Become High-Throughput, Computational Biology Becomes more Biological Courtesy of Marc Vidal. Used with permission. Massively parallel data collection – transcriptomics, proteomics, interactomics, metagenomics Using sequence and array data to address fundamental questions about transcription, splicing, microRNAs, translation, epigenetics, protein structure/function, development, evolution, disease, etc. Integrated computational/experimental approaches Rise of bioimage informatics Courtesy of Marc Vidal. Courtesy of Donald G. Moerman and Benjamin D. Williams. License: CC-BY. Source: Moerman, D. G. and Williams, B. D. "Sarcomere Assembly in C. Elegans Muscle" (January 16, 2006), WormBook, ed. The C. elegans Research Community, WormBook.

Topics in Bioinformatics Genomics Proteomics Transcriptomics Text Mining Biology Literature … … …In this paper, we report the discovery of a new gene that affects DNA reproduction in … Gene expression & regulation Genes Proteins (Function) DNA Sequences Microarray data Protein Sequences AATTCATGAAAATCGTATACTGGTCTGGTACCGGC TGAGAAAATGGCAGAGCTCATCGCTAAAGGTA TCTGGTAAAGACGTCAACACCATCAACGTGTC ACATCGATGAACTGCTGAACGAAGATATCCTG TTGCTCTGCCATGGGCGATGAAGTTCTCGAGG MKIVYWSGTGNTEKMAELIAKGIIESGKDV DELLNEDILILGCSAMGDEVLEESEFEPFIE KVALFGSYGWGDGKWMRDFEERMNGYG PDEAEQDCIEFGKKIANI