Sense-Antisense Proteins Vision Lab Presentation Ruchir Shah April 16, 2003.

Slides:



Advertisements
Similar presentations
Fa07CSE 182 CSE182-L4: Database filtering. Fa07CSE 182 Summary (through lecture 3) A2 is online We considered the basics of sequence alignment –Opt score.
Advertisements

Blast outputoutput. How to measure the similarity between two sequences Q: which one is a better match to the query ? Query: M A T W L Seq_A: M A T P.
1 Introduction to Sequence Analysis Utah State University – Spring 2012 STAT 5570: Statistical Bioinformatics Notes 6.1.
Bioinformatics Tutorial I BLAST and Sequence Alignment.
Gapped Blast and PSI BLAST Basic Local Alignment Search Tool ~Sean Boyle Basic Local Alignment Search Tool ~Sean Boyle.
Measuring the degree of similarity: PAM and blosum Matrix
Bioinformatics Unit 1: Data Bases and Alignments Lecture 2: “Homology” Searches and Sequence Alignments.
Introduction to Bioinformatics
Sequence Similarity Searching Class 4 March 2010.
“Nothing in Biology makes sense except in the light of evolution” (Theodosius Dobzhansky ( )) “Nothing in bioinformatics makes sense except in.
Optimatization of a New Score Function for the Detection of Remote Homologs Kann et al.
Expect value Expect value (E-value) Expected number of hits, of equivalent or better score, found by random chance in a database of the size.
Sequence similarity (II). Schedule Mar 23midterm assignedalignment Mar 30midterm dueprot struct/drugs April 6teams assignedprot struct/drugs April 13RNA.
. Sequence Alignment via HMM Background Readings: chapters 3.4, 3.5, 4, in the Durbin et al.
Introduction to Bioinformatics Algorithms Sequence Alignment.
We continue where we stopped last week: FASTA – BLAST
Tutorial 2: Some problems in bioinformatics 1. Alignment pairs of sequences Database searching for sequences Multiple sequence alignment Protein classification.
Sequence Alignment III CIS 667 February 10, 2004.
Introduction to Bioinformatics Algorithms Sequence Alignment.
1-month Practical Course Genome Analysis Lecture 3: Residue exchange matrices Centre for Integrative Bioinformatics VU (IBIVU) Vrije Universiteit Amsterdam.
Bioinformatics Unit 1: Data Bases and Alignments Lecture 3: “Homology” Searches and Sequence Alignments (cont.) The Mechanics of Alignments.
Alignment IV BLOSUM Matrices. 2 BLOSUM matrices Blocks Substitution Matrix. Scores for each position are obtained frequencies of substitutions in blocks.
Incorporating Bioinformatics in an Algorithms Course Lawrence D’Antonio Ramapo College of New Jersey.
Blast heuristics Morten Nielsen Department of Systems Biology, DTU.
Arabidopsis Gene Project GK-12 April Workshop Karolyn Giang and Dr. Mulligan.
Information theoretic interpretation of PAM matrices Sorin Istrail and Derek Aguiar.
TM Biological Sequence Comparison / Database Homology Searching Aoife McLysaght Summer Intern, Compaq Computer Corporation Ballybrit Business Park, Galway,
Alignment Statistics and Substitution Matrices BMI/CS 576 Colin Dewey Fall 2010.
Pairwise Alignment How do we tell whether two sequences are similar? BIO520 BioinformaticsJim Lund Assigned reading: Ch , Ch 5.1, get what you can.
Inferring function by homology The fact that functionally important aspects of sequences are conserved across evolutionary time allows us to find, by homology.
Pair-wise Sequence Alignment What happened to the sequences of similar genes? random mutation deletion, insertion Seq. 1: 515 EVIRMQDNNPFSFQSDVYSYG EVI.
An Introduction to Bioinformatics
Protein Sequence Alignment and Database Searching.
CISC667, S07, Lec5, Liao CISC 667 Intro to Bioinformatics (Spring 2007) Pairwise sequence alignment Needleman-Wunsch (global alignment)
Content of the previous class Introduction The evolutionary basis of sequence alignment The Modular Nature of proteins.
Pairwise Sequence Alignment. The most important class of bioinformatics tools – pairwise alignment of DNA and protein seqs. alignment 1alignment 2 Seq.
Eric C. Rouchka, University of Louisville Sequence Database Searching Eric Rouchka, D.Sc. Bioinformatics Journal Club October.
Pairwise alignment of DNA/protein sequences I519 Introduction to Bioinformatics, Fall 2012.
Scoring Matrices April 23, 2009 Learning objectives- 1) Last word on Global Alignment 2) Understand how the Smith-Waterman algorithm can be applied to.
Identification of Cancer-Specific Motifs in
Bacterial Genetics - Assignment and Genomics Exercise: Aims –To provide an overview of the development and.
Lecture 6. Pairwise Local Alignment and Database Search Csc 487/687 Computing for bioinformatics.
CISC667, F05, Lec9, Liao CISC 667 Intro to Bioinformatics (Fall 2005) Sequence Database search Heuristic algorithms –FASTA –BLAST –PSI-BLAST.
Comp. Genomics Recitation 3 The statistics of database searching.
Construction of Substitution Matrices
Sequence Alignment Csc 487/687 Computing for bioinformatics.
Basic terms:  Similarity - measurable quantity. Similarity- applied to proteins using concept of conservative substitutions Similarity- applied to proteins.
BLAST: Basic Local Alignment Search Tool Altschul et al. J. Mol Bio CS 466 Saurabh Sinha.
Research Problem Integrate Core Concepts and throughout the Biology Curriculum (Vision and Change, AAAS, 2009) Conceptual learning is is an understanding.
Basic Local Alignment Search Tool BLAST Why Use BLAST?
Biocomputation: Comparative Genomics Tanya Talkar Lolly Kruse Colleen O’Rourke.
Pairwise Sequence Alignment Part 2. Outline Summary Local and Global alignments FASTA and BLAST algorithms Evaluating significance of alignments Alignment.
Sequence Alignment.
Construction of Substitution matrices
Doug Raiford Phage class: introduction to sequence databases.
The statistics of pairwise alignment BMI/CS 576 Colin Dewey Fall 2015.
More on HMMs and Multiple Sequence Alignment BMI/CS 776 Mark Craven March 2002.
Techniques for Protein Sequence Alignment and Database Searching G P S Raghava Scientist & Head Bioinformatics Centre, Institute of Microbial Technology,
Substitution Matrices and Alignment Statistics BMI/CS 776 Mark Craven February 2002.
9/6/07BCB 444/544 F07 ISU Dobbs - Lab 3 - BLAST1 BCB 444/544 Lab 3 BLAST Scoring Matrices & Alignment Statistics Sept6.
Basics of BLAST Basic BLAST Search - What is BLAST?
Biology 162 Computational Genetics Todd Vision Fall Aug 2004
BLAST.
Basic Local Alignment Search Tool
Pedir alineamiento múltiple
Alignment IV BLOSUM Matrices
Basic Local Alignment Search Tool
Sequence alignment, E-value & Extreme value distribution
Patterns of amino acid usage and its GC-content of synonymous codons in 65 nuclear genomes in this study. Patterns of amino acid usage and its GC-content.
1-month Practical Course Genome Analysis Iterative homology searching
Presentation transcript:

Sense-Antisense Proteins Vision Lab Presentation Ruchir Shah April 16, 2003

* Peptides generated from sense and antisense DNA strands have ‘inverted hydropathies’. Although it makes no sense, it is hypothesized that S- and AS- peptides could have a high binding affinity for each other. Sense-Antisense Proteins Picture adapted from: J.R.Heal et al; ChemBioChem 2002,3,

S-AS Codon Table

Inverted Hydropathy Blue=Non Polar Pink=Polar Picture adapted from: J.R.Heal et al; ChemBioChem 2002,3,

S-AS Codons Degeneracy: One sense AA can have more than One antisense AA. Hydropathy: Sense & antisense AA’s have inverted hydropathy. Codon biases/codon frequencies? Sense proteins interact with Antisense proteins: Numerous experimental evidences suggest that Sense and AS peptide have specific binding Affinity.

Experimental evidences Picture adapted from: J.R.Heal et al; ChemBioChem 2002,3,

How do S-AS Amino Acids interact? Picture adapted from: J.R.Heal et al; ChemBioChem 2002,3,

Molecular Recognition Theory Picture adapted from: J.R.Heal et al; ChemBioChem 2002,3,

Tasks Literature says: –S-AS proteins exist –S-AS proteins interact specifically with each other! Tasks: –Look for S-AS protein pairs(how such many pairs exist?) –What are the biological implications? –Do they really interact?

How to find S-AS pairs from Sequence Db? Conventional Sequence identity tools can be used to find out ‘similar’ proteins. Example: Blast or Smith Waterman with a choice of substitution matrix Positive score for Identity or desirable substitutions. Negative score for undesirable substitutions.

BLOSUM 62 Source:

Design of a new substitution matrix To find S-AS pairs using existing sequence identity tools I need a special matrix. New matrix should: - positively score S-AS pairs - negatively score other pairs - reflect the degeneracy of genetic code - average score should be negative to avoid false positives!!

S-AS Codon Table

Results: What does it look like? It works!!

Results: contd.. Low complexity regions!

Lots of ‘small’ hits(lessons learnt!) “get rid of noise/background” “get rid of Low complexity regions” “use a better matrix”

Design of a new substitution matrix New matrix should: - positively score S-AS pairs - negatively score other pairs - reflect the degeneracy of genetic code -take into account the codon biases

S-AS Codon Table Source: SGD(Stanford) Saccharomyces Genome Database

1.Low complexity filter : SEG 2.More meaningful Matrix: Formula for new scoring scheme

Flow Chart Sequence database (Yeast) ~6000prtns Run Smith Waterman All against All With new matrix Look for ‘hits’ Compare it with Interaction data

Tasks Look for sense-antisense protein pairs in protein sequence databases. List all sense-antisense pairs Compare the list with List of interacting proteins. Example: Sense-Antisense pairs Database of Interacting Prtns P5-P99 P2-P102 P104-P4 P1-P101 P2-P102 P3-P103 P4-P104

Tasks Look for sense-antisense protein pairs in protein sequence databases. List all sense-antisense pairs Compare the list with List of interacting proteins. Example: Sense-Antisense pairs Database of Interacting Prtns P5-P99 P2-P102 P104-P4 P1-P101 P2-P102 P3-P103 P4-P104

DIP : Database of Interacting Proteins SS=small scale experiment HT=high throughput exp. Purple=overlap Bars= more than 1 exp. Proteins = 4727 Interactions= 15174

Work in Progress Statistics of alignment: Distinguish random from meaningful hits! Relative entropy of the matrix Gap Penalties

Acknowledgments Todd Vision (Biology) Alex Tropsha (Pharmacy) Dr. Falk (Nephrology) All of my lab mates.