Secreted Fringe-like Signaling Molecules May Be Glycosyltransferases

Slides:



Advertisements
Similar presentations
Blast to Psi-Blast Blast makes use of Scoring Matrix derived from large number of proteins. What if you want to find homologs based upon a specific gene.
Advertisements

Finding Eukaryotic Open reading frames.
1 Gene Finding Charles Yan. 2 Gene Finding Genomes of many organisms have been sequenced. We need to translate the raw sequences into knowledge. Where.
Expect value Expect value (E-value) Expected number of hits, of equivalent or better score, found by random chance in a database of the size.
Readings for this week Gogarten et al Horizontal gene transfer….. Francke et al. Reconstructing metabolic networks….. Sign up for meeting next week for.
Similar Sequence Similar Function Charles Yan Spring 2006.
Single Motif Charles Yan Spring Single Motif.
Blast heuristics Morten Nielsen Department of Systems Biology, DTU.
Finding prokaryotic genes and non intronic eukaryotic genes
Protein Sequence Alignment and Database Searching.
Sequence Alignment Techniques. In this presentation…… Part 1 – Searching for Sequence Similarity Part 2 – Multiple Sequence Alignment.
Identification of Protein Domains. Orthologs and Paralogs Describing evolutionary relationships among genes (proteins): Two major ways of creating homologous.
Sequence analysis: Macromolecular motif recognition Sylvia Nagl.
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
1 The Interrupted Gene. Ex Biochem c3-interrupted gene Introduction Figure 3.1.
CISC667, F05, Lec9, Liao CISC 667 Intro to Bioinformatics (Fall 2005) Sequence Database search Heuristic algorithms –FASTA –BLAST –PSI-BLAST.
You have worked for 2 years to isolate a gene involved in axon guidance. You sequence the cDNA clone that contains axon guidance activity. What do you.
Fea- ture Num- ber Feature NameFeature description 1 Average number of exons Average number of exons in the transcripts of a gene where indel is located.
PROTEIN PATTERN DATABASES. PROTEIN SEQUENCES SUPERFAMILY FAMILY DOMAIN MOTIF SITE RESIDUE.
Chapter 3 The Interrupted Gene.
Step 3: Tools Database Searching
The statistics of pairwise alignment BMI/CS 576 Colin Dewey Fall 2015.
Finding genes in the genome
Substitution Matrices and Alignment Statistics BMI/CS 776 Mark Craven February 2002.
Genetic Code and Interrupted Gene Chapter 4. Genetic Code and Interrupted Gene Aala A. Abulfaraj.
Protein Families, Motifs & Domains.
Basics of Comparative Genomics
Volume 88, Issue 5, Pages (March 1997)
Volume 9, Issue 10, Pages (October 2016)
Volume 10, Issue 1, Pages (January 2000)
Mark M Metzstein, H.Robert Horvitz  Molecular Cell 
There are four levels of structure in proteins
Chapter 4 The Interrupted Gene.
Prediction of protein structure
The future of protein secondary structure prediction accuracy
Phosphopeptides identified harboring minimal binding motifs
When Monomers Are Preferred: A Strategy for the Identification and Disruption of Weakly Oligomerized Proteins  Yufeng Tong, David Hughes, Lisa Placanica,
Mutations in a Novel Gene with Transmembrane Domains Underlie Usher Syndrome Type 3  Tarja Joensuu, Riikka Hämäläinen, Bo Yuan, Cheryl Johnson, Saara.
Volume 88, Issue 5, Pages (March 1997)
The ATRs, ATMs, and TORs Are Giant HEAT Repeat Proteins
Edwards Allen, Zhixin Xie, Adam M. Gustafson, James C. Carrington  Cell 
Warthog and Groundhog, novel families related to Hedgehog
The C. elegans SYS-1 Protein Is a Bona Fide β-Catenin
Structure of CheA, a Signal-Transducing Histidine Kinase
Pan-Cancer Analysis of Mutation Hotspots in Protein Domains
Hongwei Wu, Mark W. Maciejewski, Sachiko Takebe, Stephen M. King 
Luis Sanchez-Pulido, John F.X. Diffley, Chris P. Ponting 
Volume 133, Issue 7, Pages (June 2008)
Human Elastase 1: Evidence for Expression in the Skin and the Identification of a Frequent Frameshift Polymorphism  Ulvi Talas, John Dunlop, Sahera Khalaf,
Exon Junction Sequences as Cryptic Splice Sites
Basics of Comparative Genomics
L. Aravind, Eugene V. Koonin  Current Biology 
The Arabidopsis NPR1 Gene That Controls Systemic Acquired Resistance Encodes a Novel Protein Containing Ankyrin Repeats  Hui Cao, Jane Glazebrook, Joseph.
Structure of the BRCT Repeats of BRCA1 Bound to a BACH1 Phosphopeptide
Characterization and Mutation Analysis of Human LEFTY A and LEFTY B, Homologues of Murine Genes Implicated in Left-Right Axis Development  K. Kosaki,
Basic Local Alignment Search Tool
Identification of the GCS1 ortholog in Gonium pectorale.
Correction of Gene Annotation and Other Sequence Analysis of Arabidopsis Proteins Identified on the 2-D Gels in Figure 1.(A) Spot 71, identified as a 17.2-kD.
Hartmut Scheel, Kay Hofmann  Current Biology 
Shifty Ciliates  Lawrence A. Klobutcher, Philip J. Farabaugh  Cell 
Solution Structure of the Proapoptotic Molecule BID
L. Aravind, Eugene V. Koonin  Current Biology 
Phosphopeptides identified harboring minimal binding motifs
Structure of the Histone Acetyltransferase Hat1
Origins and Impacts of New Mammalian Exons
Comparative Genomic Analysis Identifies an ADP-Ribosylation Factor–like Gene as the Cause of Bardet-Biedl Syndrome (BBS3)  Annie P. Chiang, Darryl Nishimura,
Evolutionary Fates and Origins of U12-Type Introns
Correspondence Current Biology
Andrey V Kajava, Gilbert Vassart, Shoshana J Wodak  Structure 
Volume 97, Issue 6, Pages (June 1999)
Presentation transcript:

Secreted Fringe-like Signaling Molecules May Be Glycosyltransferases Yan P. Yuan, Jörg Schultz, Marek Mlodzik, Peer Bork  Cell  Volume 88, Issue 1, Pages 9-11 (January 1997) DOI: 10.1016/S0092-8674(00)81852-8 Copyright © 1997 Cell Press Terms and Conditions

Figure 1 Alignment of Five Regions Conserved in the FNG, BRN and Lex1 Subfamilies First four columns: gene names, species, accession numbers, position of the motifs in the respective sequence. The numbers in angle brackets between the 5 aligned motifs indicate the number of residues that are not displayed. Amino acids conserved in at least 60% of the sequences are highlighted; hydrophobic residues conserved in at least 85% of the sequences are given in bold. The consensus line summarises residue properties conserved in at least 75% of all sequences (t - turn-like or polar, h - hydrophobic, s - small, a - aromatic). Secondary structures predicted (Rost et al. 1994) independently for each of the three subfamilies (E/e - strand, H/h - helix with 81%/72% expected accuracy) are given beneath the alignment. A region of 44 residues has been removed from the C. elegans U64858 sequence as it is very likely an artificially translated intron because of the presence of splice sites and similarity to an intronic region in Z35641. U64858 has been elongated due to exons detected upstream. The N-terminal regions of the FNG and BRN subfamilies that preceed the alignment with the Lex1 family are divergent and have been proposed to be a propeptide for the fringe subfamily (Wu et al. 1996). All eukaryotic, but also some prokaryotic sequences contain leader peptides. Starting point for the iterative database searches (for details of the strategy seeBork and Gibson 1996) was FNG. Blastp searches (Altschul et al. 1994) revealed two C. elegans homologues with blastp values p < 10−4; a multiple alignment generated by MACAW (Schuler et al. 1991) identifies several significantly similar regions with p < 10−50. The two most conserved regions (motif 3 and 4) of the extended FNG family were then subjected to iterative MoST searches (Tatusov et al. 1994). Using the restrictive ratio of observed versus expected hits r = 0.005, motif 3 matched only BRN with p = 0.022 (as derived from the expected number of false positives E = 0.023 calculated using the MoST program). The majority of the Lex1 subfamily has been identifed using motif 4 (p values down to 0.002 using the threshold r = 0.05) without false positives. Members of all three subfamilies were then aligned using MACAW (Schuler et al. 1991) which reidentified the conserved motifs 3 and 4 with significant p values (<10−50). All other motifs have significant MACAW p values for more than one subfamily. This was confirmed by Blastp and MoST searches with all motifs excluding motif 3. Nevertheless, support for correct alignment of motif 1 of the Lex1 family and motif 2 of the BRN comes mostly from the secondary structure context. Numerous ESTs (e.g. hL1, hB1, hB2, hF3) were detected by tblastn (Altschul et al. 1994) searches suggesting a high expression rate. Each individual EST was aligned with the protein families using a dynamic programming algorythm as implemented in PAIRWISE (Birney et al. 1996). Overlapping ESTs were merged. Four Arabidopsis thaliana ESTs were identified as belonging to the superfamily (H36894, H37262, Z37212, Z34632). In addition to the hF1 and hF2 ESTs found by (Wu et al. 1996), N47633, R65917, Z45166, and the merged R65818/H87611/N51686/H55274 appear to be independent fragments similar to distinct members of the Fringe subfamily (probability for chance matches p < 10−12). The ESTs provided additional support for the homology between the subfamilies. For example, the translated human EST N47633 matches members of the Fringe and the Lex1 subfamily with blastp p values of 1.7 × 10−31 and 2.5 × 10−10, respectively (using the PAM150 amino acid similarity matrix). Cell 1997 88, 9-11DOI: (10.1016/S0092-8674(00)81852-8) Copyright © 1997 Cell Press Terms and Conditions