Algorithms for Biological Sequence Analysis Kun-Mao Chao ( 趙坤茂 ) Department of Computer Science and Information Engineering National Taiwan University,

Slides:



Advertisements
Similar presentations
Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
Advertisements

Finding regulatory modules from local alignment - Department of Computer Science & Helsinki Institute of Information Technology HIIT University of Helsinki.
BIOINFORMATICS Ency Lee.
Bioinformatics What is bioinformatics? Why bioinformatics? The major molecular biology facts Brief history of bioinformatics Typical problems of bioinformatics:
Bioinformatics Dr. Aladdin HamwiehKhalid Al-shamaa Abdulqader Jighly Lecture 1 Introduction Aleppo University Faculty of technical engineering.
Archives and Information Retrieval
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Introduction to Bioinformatics Spring 2008 Yana Kortsarts, Computer Science Department Bob Morris, Biology Department.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Bioinformatics and Phylogenetic Analysis
Bioinformatics Lecture 2. Bioinformatics: is the computational branch of molecular biology Using the computer software to analyze biological data The.
Biological Databases Chi-Cheng Lin, Ph.D. Associate Professor Department of Computer Science Winona State University – Rochester Center
Signaling Pathways and Summary June 30, 2005 Signaling lecture Course summary Tomorrow Next Week Friday, 7/8/05 Morning presentation of writing assignments.
CISC667, F05, Lec27, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Review Session.
A Whirlwind Tour of Bioinformatics Kun-Mao Chao ( 趙坤茂 ) National Taiwan University
An Introduction to Bioinformatics Molecular Biology Databases.
Presented by Liu Qi An introduction to Bioinformatics Algorithms Qi Liu
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
What is Bioinformatics?. Conceptualizing biology in terms of molecules and then applying “informatics” techniques from math, computer science, and statistics.
Bioinformatics.
Development of Bioinformatics and its application on Biotechnology
Algorithms for Biological Sequence Analysis Kun-Mao Chao ( 趙坤茂 ) Department of Computer Science and Information Engineering National Taiwan University,
Databases in Bioinformatics and Systems Biology Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.
Bioinformatics for biomedicine
Introduction to Bioinformatics CPSC 265. Interface of biology and computer science Analysis of proteins, genes and genomes using computer algorithms and.
Genome Organization and Evolution. Assignment For 2/24/04 Read: Lesk, Chapter 2 Exercises 2.1, 2.5, 2.7, p 110 Problem 2.2, p 112 Weblems 2.4, 2.7, pp.
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
Molecular Biology Primer. Starting 19 th century… Cellular biology: Cell as a fundamental building block 1850s+: ``DNA’’ was discovered by Friedrich Miescher.
CS397-CXZ Algorithms in Bioinformatics ChengXiang (“Cheng”) Zhai, Robert Skeel (Department of Computer Science) Nick Sahinidis (Department of Chemical.
Copyright © 2010 Pearson Education Inc. Lecture 01 – Genetics & Genomics: An Introduction Based on Chapter 1 – Genetics: An introduction.
Organizing information in the post-genomic era The rise of bioinformatics.
Introduction to Bioinformatics Biostatistics & Medical Informatics 576 Computer Sciences 576 Fall 2008 Colin Dewey Dept. of Biostatistics & Medical Informatics.
Selected Videos for Biomedical Informatics Kun-Mao Chao ( 趙坤茂 ) National Taiwan University
Multiple Sequence Alignment Kun-Mao Chao ( 趙坤茂 ) Department of Computer Science and Information Engineering National Taiwan University, Taiwan WWW:
ARE THESE ALL BEARS? WHICH ONES ARE MORE CLOSELY RELATED?
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
Overview of Bioinformatics 1 Module Denis Manley..
AdvancedBioinformatics Biostatistics & Medical Informatics 776 Computer Sciences 776 Spring 2002 Mark Craven Dept. of Biostatistics & Medical Informatics.
A Whirlwind Tour of Bioinformatics Kun-Mao Chao ( 趙坤茂 ) National Taiwan University
Algorithms for Biological Sequence Analysis Kun-Mao Chao ( 趙坤茂 ) Department of Computer Science and Information Engineering National Taiwan University,
EB3233 Bioinformatics Introduction to Bioinformatics.
Pathogenomics How this project began: Ann Rose - take advantage of DNA sequence information - genomics Julian Davies - use the information to understand.
Bioinformatics and Computational Biology
The iPlant Collaborative Vision Enable life science researchers and educators to use and extend cyberinfrastructure.
BINF6201/8201: Molecular Sequence Analysis Dr. Zhengchang Su Office: 351 Bioinformatics Building Office hours: Tuesday and Thursday:
Never-ending stories Kun-Mao Chao ( 趙坤茂 ) Dept. of Computer Science and Information Engineering National Taiwan University, Taiwan
The Theory of Computation Kun-Mao Chao ( 趙坤茂 ) National Taiwan University
Trees Kun-Mao Chao ( 趙坤茂 ) Department of Computer Science and Information Engineering National Taiwan University, Taiwan
NCBI: something old, something new. What is NCBI? Create automated systems for knowledge about molecular biology, biochemistry, and genetics. Perform.
Graduate Research with Bioinformatics Research Mentors Nancy Warter-Perez, ECE Robert Vellanoweth Chem and Biochem Fellow Sean Caonguyen 8/20/08.
Bioinformatics Overview
A Whirlwind Tour of Bioinformatics
Statistical Applications in Biology and Genetics
생물정보학 Bioinformatics.
Algorithms for Biological Sequence Analysis
Mangaldai College, Mangaldai
Genomes and Their Evolution
Bioinformatics: Buzzword or Discipline (???)
Genome organization and Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
CISC 667 Intro to Bioinformatics (Spring 2007) Review session for Mid-Term CISC667, S07, Lec14, Liao.
Introduction to Bioinformatic
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Trees Kun-Mao Chao (趙坤茂)
Introduction to Bioinformatics
Trees Kun-Mao Chao (趙坤茂)
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Algorithms for Biological Sequence Analysis Kun-Mao Chao ( 趙坤茂 ) Department of Computer Science and Information Engineering National Taiwan University, Taiwan Date: October 2, 2007 WWW:

2 About this course Course: Algorithms for biological sequence analysis We will be focused on the sequence-related algorithmic problems. Genomic sequences are our main target. –The oldest language –The largest program Fall semester, 2007 Tuesday 9:10 – 12:10, 107 CSIE Building. 3 credits Web site:

3 Coursework: Homework assignments and Class participation (15%) Two midterm exams (60%; 30% each): –November 6, 2007 (tentatively) –December 18, 2007 (tentatively) Oral presentation of selected papers (25%)

4 Outlines Part I: Sequence Homology –Introduction to genomes –Dynamic programming strategy revisited –Pairwise sequence alignment –Multiple sequence alignment –Chaining algorithms for genomic sequence analysis –Suboptimal alignment –Comparative genomics –Hidden Markov models (the Viterbi algorithm et al.) Part II: Sequence Composition –Maximum-sum and maximum-density segments –SNP and haplotype data analysis –Genome annotation –Other advanced topics

5 A Brief History of Genetics 1859 Darwin publishes The Origin of Species 1865 Genes are particular factors 1871 Discovery of nucleic acid 1903 Chromosomes are hereditary units 1910 Genes lie on chromosomes 1913 Chromosomes are linear arrays of genes 1931 Recombination occurs by crossing over

6 A Brief History of Genetics (cont’d) 1944 DNA is the genetic material 1945 A gene codes for protein 1951 First protein sequence 1953 DNA is a double helix 1961 Genetic code is triplet 1977 Eukaryotic genes are interrupted 1977 DNA can be sequenced 21th Century: Many genomes completely sequenced

7 Milestones of Bioinformatics 1962 Pauling's theory of molecular evolution 1965 Margaret Dayhoff's Atlas of Protein SequencesMargaret Dayhoff's 1970 Needleman-Wunsch algorithmNeedleman-Wunsch 1977 DNA sequencing and software to analyze it (Staden)Staden 1981 Smith-Waterman algorithm developedSmith-Waterman algorithm 1981 The concept of a sequence motif (Doolittle)Doolittle 1982 GenBank Release 3 made public 1982 Phage lambda genome sequenced

8 Milestones of Bioinformatics (cont’d) 1983 Sequence database searching algorithm (Wilbur- Lipman)Wilbur- Lipman 1985 FASTP/FASTN: fast sequence similarity searchingFASTP/FASTN 1988 National Center for Biotechnology Information (NCBI) created at NIH/NLMNational Center for Biotechnology Information 1988 EMBnet network for database distributionEMBnet network 1990 BLAST: fast sequence similarity searchingBLAST 1991 EST: expressed sequence tag sequencing 1993 Sanger Centre, Hinxton, UKSanger Centre 1994 EMBL European Bioinformatics Institute, Hinxton, UKEMBL European Bioinformatics Institute

9 Milestones of Bioinformatics (cont’d) 1995 First bacterial genomes completely sequencedbacterial genomes 1996 Yeast genome completely sequencedYeast genome completely sequenced 1997 PSI-BLASTPSI-BLAST 1998 Worm (multicellular) genome completely sequencedWorm (multicellular) genome completely sequenced 1999 Fly genome completely sequenced

10 Milestones of Bioinformatics (cont’d) Human Genome Project ( )Human Genome Project Mouse 2002 Rat 2004 Chimpanzee 2005 Completed Genomes

11 Chimpanzee Genome

12 The Primate Family Tree Source: Nature