Reading Report Ce WANG A segment alignment approach to protein comparison.

Slides:



Advertisements
Similar presentations
Segment Alignment (SEA) Yuzhen Ye Adam Godzik The Burnham Institute.
Advertisements

Global Sequence Alignment by Dynamic Programming.
DYNAMIC PROGRAMMING ALGORITHMS VINAY ABHISHEK MANCHIRAJU.
Protein Structure Prediction using ROSETTA
CMSC 150 RECURSION CS 150: Mon 26 Mar Motivation : Bioinformatics Example  A G A C T A G T T A C  C G A G A C G T  Want to compare sequences.
Chapter 7 Dynamic Programming.
Graduate Center/City University of New York University of Helsinki FINDING OPTIMAL BAYESIAN NETWORK STRUCTURES WITH CONSTRAINTS LEARNED FROM DATA Xiannian.
Sabegh Singh Virdi ASC Processor Group Computer Science Department
Gene Prediction: Similarity-Based Approaches (selected from Jones/Pevzner lecture notes)
Structural bioinformatics
Introduction to Bioinformatics Burkhard Morgenstern Institute of Microbiology and Genetics Department of Bioinformatics Goldschmidtstr. 1 Göttingen, March.
Slide 1 EE3J2 Data Mining EE3J2 Data Mining Lecture 14: Introduction to Hidden Markov Models Martin Russell.
Developing Pairwise Sequence Alignment Algorithms Dr. Nancy Warter-Perez.
Developing Pairwise Sequence Alignment Algorithms Dr. Nancy Warter-Perez June 23, 2005.
Alignment methods and database searching April 14, 2005 Quiz#1 today Learning objectives- Finish Dotter Program analysis. Understand how to use the program.
Sequence Alignment Bioinformatics. Sequence Comparison Problem: Given two sequences S & T, are S and T similar? Need to establish some notion of similarity.
Developing Pairwise Sequence Alignment Algorithms Dr. Nancy Warter-Perez June 23, 2004.
Sequence similarity.
7 -1 Chapter 7 Dynamic Programming Fibonacci Sequence Fibonacci sequence: 0, 1, 1, 2, 3, 5, 8, 13, 21, … F i = i if i  1 F i = F i-1 + F i-2 if.
Dynamic Programming and Biological Sequence Comparison Part I.
Pairwise Alignment Global & local alignment Anders Gorm Pedersen Molecular Evolution Group Center for Biological Sequence Analysis.
Developing Pairwise Sequence Alignment Algorithms Dr. Nancy Warter-Perez May 20, 2003.
Structure Alignment in Polynomial Time Rachel Kolodny Stanford University Nati Linial The Hebrew University of Jerusalem.
Bioinformatics Unit 1: Data Bases and Alignments Lecture 3: “Homology” Searches and Sequence Alignments (cont.) The Mechanics of Alignments.
Dynamic Programming. Pairwise Alignment Needleman - Wunsch Global Alignment Smith - Waterman Local Alignment.
Developing Pairwise Sequence Alignment Algorithms Dr. Nancy Warter-Perez May 10, 2005.
LCS and Extensions to Global and Local Alignment Dr. Nancy Warter-Perez June 26, 2003.
Developing Pairwise Sequence Alignment Algorithms
Pair-wise Sequence Alignment What happened to the sequences of similar genes? random mutation deletion, insertion Seq. 1: 515 EVIRMQDNNPFSFQSDVYSYG EVI.
Sequence Analysis Alignments dot-plots scoring scheme Substitution matrices Search algorithms (BLAST)
1 A Combinatorial Toolbox for Protein Sequence Design and Landscape Analysis in the Grand Canonical Model Ming-Yang Kao Department of Computer Science.
Structural alignment Protein structure Every protein is defined by a unique sequence (primary structure) that folds into a unique.
Space-Efficient Sequence Alignment Space-Efficient Sequence Alignment Bioinformatics 202 University of California, San Diego Lecture Notes No. 7 Dr. Pavel.
The dynamic nature of the proteome
Protein Sequence Alignment and Database Searching.
1 Generalized Tree Alignment: The Deferred Path Heuristic Stinus Lindgreen
QNET: A tool for querying protein interaction networks Banu Dost +, Tomer Shlomi*, Nitin Gupta +, Eytan Ruppin*, Vineet Bafna +, Roded Sharan* + University.
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
Pairwise Sequence Alignment BMI/CS 776 Mark Craven January 2002.
Sequence Analysis CSC 487/687 Introduction to computing for Bioinformatics.
MicroRNA identification based on sequence and structure alignment Presented by - Neeta Jain Xiaowo Wang†, Jing Zhang†, Fei Li, Jin Gu, Tao He, Xuegong.
Chapter 3 Computational Molecular Biology Michael Smith
Gene Prediction: Similarity-Based Methods (Lecture for CS498-CXZ Algorithms in Bioinformatics) Sept. 15, 2005 ChengXiang Zhai Department of Computer Science.
1 A Min-Cost Flow Based Detailed Router for FPGAs Seokjin Lee *, Yongseok Cheon *, D. F. Wong + * The University of Texas at Austin + University of Illinois.
Graph-based Deformable Matching of 3D Line Segments with Application in Protein Fitting 12 1 HANG DOU 1, MATTHEW L BAKER 2, TAO JU Washington University.
Sequence Alignments with Indels Evolution produces insertions and deletions (indels) – In addition to substitutions Good example: MHHNALQRRTVWVNAY MHHALQRRTVWVNAY-
Dynamic Programming: Edit Distance
Pairwise sequence alignment Lecture 02. Overview  Sequence comparison lies at the heart of bioinformatics analysis.  It is the first step towards structural.
DNA, RNA and protein are an alien language
Lecture 11 CS5661 Structural Bioinformatics – Structure Comparison Motivation Concepts Structure Comparison.
GA for Sequence Alignment  Pair-wise alignment  Multiple string alignment.
Protein Structure Prediction: Threading and Rosetta BMI/CS 576 Colin Dewey Fall 2008.
An Improved Search Algorithm for Optimal Multiple-Sequence Alignment Paper by: Stefan Schroedl Presentation by: Bryan Franklin.
DNA SEQUENCE ALIGNMENT FOR PROTEIN SIMILARITY ANALYSIS CARL EBERLE, DANIEL MARTINEZ, MENGDI TAO.
9/6/07BCB 444/544 F07 ISU Dobbs - Lab 3 - BLAST1 BCB 444/544 Lab 3 BLAST Scoring Matrices & Alignment Statistics Sept6.
Piecewise linear gap alignment.
Generating, Maintaining, and Exploiting Diversity in a Memetic Algorithm for Protein Structure Prediction Mario Garza-Fabre, Shaun M. Kandathil, Julia.
The ideal approach is simultaneous alignment and tree estimation.
Automation System For Checking Protein Prediction
Algebra II Explorations Review ( )
Design and Analysis of Algorithms (07 Credits / 4 hours per week)
Sequence Alignment Using Dynamic Programming
Intro to Alignment Algorithms: Global and Local
Lecture 19-Problem Solving 4 Incremental Method
FRONT No Solutions Infinite Solutions 1 solution (x, y)
Problem Solving 4.
Protein structure prediction.
Optimal Point Movement for Covering Circular Regions
Sequence Analysis Alan Christoffels
Design and Analysis of Algorithms (04 Credits / 4 hours per week)
Presentation transcript:

Reading Report Ce WANG A segment alignment approach to protein comparison

Agenda Motivation Previous works SEgment Alignment algorithm (SEA) Results and Discussion Answer Questions

Motivation Local structure segments (LSSs) Predicted LSSs (PLSSs) predicted or real LSSs are rarely exploited by protein sequence comparison programs that are based on position-by-position alignments.

Previous Works Nearest-neighbor methods which typically produce a list of Predicted Local Structure Segments (PLSSs) for a given protein (Fig. 1, Rychlewski and Godzik, 1997; Yi and Lander, 1993; Bystroff and Baker, 1998). ambiguous

Previous Works single position secondary structures averaged over the segments (Rychlewski and Godzik, 1997; Yi and Lander, 1993). Baker and colleagues (Bystroff and Baker, 1998) who further combined the predicted segments for a compact tertiary structure in their de novo protein structure prediction program ROSETTA (Simons et al., 1999).

Previous Works most protein comparison methods are firmly based on the concept of residue-level alignments (Waterman, 1995) similar proteins similar proteins

SEgment Alignment (SEA) compare proteins described as a collection of predicted local structure segments (PLSSs), which is equivalent to an unweighted graph (network). Any specific structure, real or predicted corresponds to a specific path in this network. compare proteins described as a collection of predicted local structure segments (PLSSs), which is equivalent to an unweighted graph (network). Any specific structure, real or predicted corresponds to a specific path in this network. SEA then uses a network matching approach to find two most similar paths in networks representing two proteins. SEA then uses a network matching approach to find two most similar paths in networks representing two proteins.

Advantage SEA explores the of predicted local structure information to search for a globally optimal solution. It simultaneously solves two related problems: SEA explores the uncertainty and diversity of predicted local structure information to search for a globally optimal solution. It simultaneously solves two related problems: the alignment of two proteins and the local structure prediction for each of them.

SEA FORMULATION network matching problem that can be solved by dynamic programming in polynomial time.

SEA We define V(i, j ) as the maximum similarity score for transforming S1[1... i] to S2[1... j ], calculated by V(i, j ) = max all(α,β)combinations, α ∈ E(i ), β ∈ E( j ) V(i α, j β )

substitution, deletion and insertion

IMPLEMENTATION The prediction and representation of local structures Scoring scheme (i α, j β ) = W a × (Aa i, Aa j ) + W s × (α, β)

Fig. 3. Comparison of the alignments between λ-repressor from E.coli (1lliA) and 434 repressor (1r69) by CE (top) and SEA (bottom).

IMPLEMENTATION The measures of alignment accuracy The benchmark for SEA validation

RESULTS AND DISCUSSION The general performance of SEA on the benchmark Prediction ambiguity improves alignment quality Alignment quality versus local structure prediction ambiguity

CONCLUSION

Any Questions?

Thanks!