Biocomputation: Comparative Genomics Tanya Talkar Lolly Kruse Colleen O’Rourke.

Slides:



Advertisements
Similar presentations
Parallel BioInformatics Sathish Vadhiyar. Parallel Bioinformatics  Many large scale applications in bioinformatics – sequence search, alignment, construction.
Advertisements

Blast outputoutput. How to measure the similarity between two sequences Q: which one is a better match to the query ? Query: M A T W L Seq_A: M A T P.
Bioinformatics Tutorial I BLAST and Sequence Alignment.
BLAST Sequence alignment, E-value & Extreme value distribution.
Sequence Alignment Kun-Mao Chao ( 趙坤茂 ) Department of Computer Science and Information Engineering National Taiwan University, Taiwan
Bioinformatics Unit 1: Data Bases and Alignments Lecture 2: “Homology” Searches and Sequence Alignments.
Definitions Optimal alignment - one that exhibits the most correspondences. It is the alignment with the highest score. May or may not be biologically.
Structural bioinformatics
Introduction to Bioinformatics Burkhard Morgenstern Institute of Microbiology and Genetics Department of Bioinformatics Goldschmidtstr. 1 Göttingen, March.
Heuristic alignment algorithms and cost matrices
Developing Pairwise Sequence Alignment Algorithms Dr. Nancy Warter-Perez.
Developing Pairwise Sequence Alignment Algorithms Dr. Nancy Warter-Perez June 23, 2005.
Bioinformatics and Phylogenetic Analysis
Reconfigurable Computing S. Reda, Brown University Reconfigurable Computing (EN2911X, Fall07) Lecture 18: Application-Driven Hardware Acceleration (4/4)
Developing Pairwise Sequence Alignment Algorithms Dr. Nancy Warter-Perez June 23, 2004.
Sequence Analysis Tools
Similar Sequence Similar Function Charles Yan Spring 2006.
Developing Pairwise Sequence Alignment Algorithms Dr. Nancy Warter-Perez May 20, 2003.
Algorithms Dr. Nancy Warter-Perez June 19, May 20, 2003 Developing Pairwise Sequence Alignment Algorithms2 Outline Programming workshop 2 solutions.
Developing Sequence Alignment Algorithms in C++ Dr. Nancy Warter-Perez May 21, 2002.
Bioinformatics Unit 1: Data Bases and Alignments Lecture 3: “Homology” Searches and Sequence Alignments (cont.) The Mechanics of Alignments.
Bioinformatics Workshop, Fall 2003 Algorithms in Bioinformatics Lawrence D’Antonio Ramapo College of New Jersey.
Dynamic Programming. Pairwise Alignment Needleman - Wunsch Global Alignment Smith - Waterman Local Alignment.
Incorporating Bioinformatics in an Algorithms Course Lawrence D’Antonio Ramapo College of New Jersey.
Sequence alignment, E-value & Extreme value distribution
Introduction to Bioinformatics From Pairwise to Multiple Alignment.
Sequence comparison: Local alignment
TM Biological Sequence Comparison / Database Homology Searching Aoife McLysaght Summer Intern, Compaq Computer Corporation Ballybrit Business Park, Galway,
Developing Pairwise Sequence Alignment Algorithms
Sequence Alignment and Phylogenetic Prediction using Map Reduce Programming Model in Hadoop DFS Presented by C. Geetha Jini (07MW03) D. Komagal Meenakshi.
Protein Sequence Alignment and Database Searching.
Content of the previous class Introduction The evolutionary basis of sequence alignment The Modular Nature of proteins.
BLAST: A Case Study Lecture 25. BLAST: Introduction The Basic Local Alignment Search Tool, BLAST, is a fast approach to finding similar strings of characters.
Pairwise Sequence Alignment. The most important class of bioinformatics tools – pairwise alignment of DNA and protein seqs. alignment 1alignment 2 Seq.
Database Searches BLAST. Basic Local Alignment Search Tool –Altschul, Gish, Miller, Myers, Lipman, J. Mol. Biol. 215 (1990) –Altschul, Madden, Schaffer,
BLAST Anders Gorm Pedersen & Rasmus Wernersson. Database searching Using pairwise alignments to search databases for similar sequences Database Query.
Construction of Substitution Matrices
Bioinformatics Multiple Alignment. Overview Introduction Multiple Alignments Global multiple alignment –Introduction –Scoring –Algorithms.
Function preserves sequences Christophe Roos - MediCel ltd Similarity is a tool in understanding the information in a sequence.
Chapter 3 Computational Molecular Biology Michael Smith
BLAST: Basic Local Alignment Search Tool Altschul et al. J. Mol Bio CS 466 Saurabh Sinha.
Techniques for Protein Sequence Alignment and Database Searching (part2) G P S Raghava Scientist & Head Bioinformatics Centre, Institute of Microbial Technology,
Pairwise Sequence Alignment Part 2. Outline Summary Local and Global alignments FASTA and BLAST algorithms Evaluating significance of alignments Alignment.
Lecture 7 CS5661 Heuristic PSA “Words” to describe dot-matrix analysis Approaches –FASTA –BLAST Searching databases for sequence similarities –PSA –Alternative.
Construction of Substitution matrices
Doug Raiford Phage class: introduction to sequence databases.
Heuristic Alignment Algorithms Hongchao Li Jan
Genome Revolution: COMPSCI 004G 8.1 BLAST l What is BLAST? What is it good for?  Basic.
Techniques for Protein Sequence Alignment and Database Searching G P S Raghava Scientist & Head Bioinformatics Centre, Institute of Microbial Technology,
Database Scanning/Searching FASTA/BLAST/PSIBLAST G P S Raghava.
Genome alignment Usman Roshan.
Homology Search Tools Kun-Mao Chao (趙坤茂)
BLAST Anders Gorm Pedersen & Rasmus Wernersson.
Sequence comparison: Local alignment
LSM3241: Bioinformatics and Biocomputing Lecture 4: Sequence analysis methods revisited Prof. Chen Yu Zong Tel:
Identifying templates for protein modeling:
Homology Search Tools Kun-Mao Chao (趙坤茂)
Fast Sequence Alignments
Sequence Based Analysis Tutorial
BLAST.
Sequence alignment, Part 2
Pairwise Sequence Alignment
Basic Local Alignment Search Tool (BLAST)
BIOINFORMATICS Fast Alignment
Basic Local Alignment Search Tool
Basic Local Alignment Search Tool (BLAST)
Homology Search Tools Kun-Mao Chao (趙坤茂)
Sequence alignment, E-value & Extreme value distribution
Reconfigurable Computing (EN2911X, Fall07)
Searching Sequence Databases
Presentation transcript:

Biocomputation: Comparative Genomics Tanya Talkar Lolly Kruse Colleen O’Rourke

DNA

What is Biocomputation? Statistics Computer Science Molecular Biology

Four Main Parts  Biomolecular computation  Biological Computation  Computational Biology  Bioinformatics

Bioinformatics: Biology Computer Science Information Technology

Sequence Analysis  Very Functional!  Compare DNA between Species  Small Fragments  Return full sequence

Computational Genomics  Needleman – Wunsch  Not used much  More Mapped Genomes = Computational Genomics!

Alignment

Global Alignment: Needleman - Wunsch  O(N 3 )  Fewest edit operations  Similar strings

Local Alignment Smith - Waterman  O(N 2 )  Dissimilar strings  Find high similarity regions

Comparison

S1PQRAXABCSTVQ S2XYAXBACSLT

S1AXAB_CS S2AX_BACS Score

Advantages: Global Alignment

Advantages: Local Alignment

BLAST Basic Local Alignment Search Tool FASTA

Improvements  Increased Speed  Locate initial alignment hot spots  Statistical significance

Terminology  Segment Pairs  Locally maximal segment pairs  Maximal segment pairs

How it works  Query sentence, P  Database  Must have score over C!  Multiple segment pairs combined ABCDEFG AGCBFDE BEDGAFB GFBEDCA

How it works  Extends each hit  Done efficiently  Truncates  Doesn’t find all pairs

Proteins  Fixed length, W  Words above threshold  Each hit extended

DNA  Word List  Exact matches  NOT dynamic programming

Scoring  Blosum62 Matrix  Match (+2), Mismatch (-3), Gaps penalized

Substitution Matrix  Represents Scoring Functions

Multiple Sequence Alignment

Methods of MSA  Progressive Alignment Construction  Iterative Methods  Hidden Markov Models  Genetic Algorithms and Simulated Annealing

Comparative Genomics  Compare Species  Find Evolutionary Significances!  Low Level  High Level  Importance of Non Coding DNA