Global Annotation of the Protein Kinase Family Michael Gribskov University of California, San Diego.

Slides:



Advertisements
Similar presentations
Secondary structure prediction from amino acid sequence.
Advertisements

1 Orthologs: Two genes, each from a different species, that descended from a single common ancestral gene Paralogs: Two or more genes, often thought of.
Using phylogenetic profiles to predict protein function and localization As discussed by Catherine Grasso.
Basics of Comparative Genomics Dr G. P. S. Raghava.
Comparative genomics Joachim Bargsten February 2012.
Molecular Evolution Revised 29/12/06
Structural bioinformatics
1 Multiple sequence alignment Lesson 4. 2 VTISCTGSSSNIGAG-NHVKWYQQLPG VTISCTGTSSNIGS--ITVNWYQQLPG LRLSCSSSGFIFSS--YAMYWVRQAPG LSLTCTVSGTSFDD--YYSTWVRQPPG.
Bioinformatics and Phylogenetic Analysis
Sequence Comparison Intragenic - self to self. -find internal repeating units. Intergenic -compare two different sequences. Dotplot - visual alignment.
Protein Modules An Introduction to Bioinformatics.
Tutorial 2: Some problems in bioinformatics 1. Alignment pairs of sequences Database searching for sequences Multiple sequence alignment Protein classification.
Identifying Functional signatures in Proteins - a computational design approach David Bernick Rohl group 16-Mar-2005.
Genomics and bioinformatics summary 1. Gene finding: computer searches, cDNAs, ESTs, 2.Microarrays 3.Use BLAST to find homologous sequences 4.Multiple.
Signaling Pathways and Summary June 30, 2005 Signaling lecture Course summary Tomorrow Next Week Friday, 7/8/05 Morning presentation of writing assignments.
Marcotte EM, Pellegrini M, Ng HL, Rice DW, Yeates TO, Eisenberg D. (1999). Detecting protein function and protein-protein interactions from genome sequences.
EVOLUTIONARY AND COMPUTATIONAL GENOMICS Shin-Han Shiu Plant Biology / CMB / EEBB / Genetics / QBMI.
Protein Classification A comparison of function inference techniques.
MCB 317 Genetics and Genomics MCB 317 Topic 10, part 3 A Story of Transcription.
Sequence comparison: Local alignment
Genome Evolution: Duplication (Paralogs) & Degradation (Pseudogenes)
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Multiple Sequence Alignment CSC391/691 Bioinformatics Spring 2004 Fetrow/Burg/Miller (Slides by J. Burg)
© Wiley Publishing All Rights Reserved.
Sequence Analysis Alignments dot-plots scoring scheme Substitution matrices Search algorithms (BLAST)
Pairwise & Multiple sequence alignments
Space-Efficient Sequence Alignment Space-Efficient Sequence Alignment Bioinformatics 202 University of California, San Diego Lecture Notes No. 7 Dr. Pavel.
Protein Bioinformatics Course
Functional Linkages between Proteins. Introduction Piles of Information Flakes of Knowledge AGCATCCGACTAGCATCAGCTAGCAGCAGA CTCACGATGTGACTGCATGCGTCATTATCTA.
Protein Evolution and Sequence Analysis Protein Evolution and Sequence Analysis.
Bikash Shakya Emma Lang Jorge Diaz.  BLASTx entire sequence against 9 plant genomes. RepeatMasker  55.47% repetitive sequences  82.5% retroelements.
HOGENOM a phylogenomic database
발표자 석사 2 년 김태형 Vol. 11, Issue 3, , March 2001 Comparative DNA Sequence Analysis of Mouse and Human Protocadherin Gene Clusters 인간과 마우스의 PCDH 유전자.
Eric C. Rouchka, University of Louisville SATCHMO: sequence alignment and tree construction using hidden Markov models Edgar, R.C. and Sjolander, K. Bioinformatics.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
Lecture 6. Pairwise Local Alignment and Database Search Csc 487/687 Computing for bioinformatics.
ANALYSIS AND VISUALIZATION OF SINGLE COPY ORTHOLOGS IN ARABIDOPSIS, LETTUCE, SUNFLOWER AND OTHER PLANT SPECIES. Alexander Kozik and Richard W. Michelmore.
You have worked for 2 years to isolate a gene involved in axon guidance. You sequence the cDNA clone that contains axon guidance activity. What do you.
Monday, November 8, 2:30:07 PM  Ontology is the philosophical study of the nature of being, existence or reality as such, as well as the basic categories.
HMMs for alignments & Sequence pattern discovery I519 Introduction to Bioinformatics.
Basic terms:  Similarity - measurable quantity. Similarity- applied to proteins using concept of conservative substitutions Similarity- applied to proteins.
Protein and RNA Families
Identification of Ortholog Groups by OrthoMCL Protein sequences from organisms of interest All-against-all BLASTP Between Species: Reciprocal best similarity.
Applied Bioinformatics Week 3. Theory I Similarity Dot plot.
A Global View of the Protein Structure Universe and Protein Evolution Sung-Hou Kim University of California, Berkeley, CA U.S.A. June 27, 2006.
Nothing in (computational) biology makes sense except in the light of evolution after Theodosius Dobzhansky (1970) Comparative genomics, genome context.
Evolution of the Protein Kinase Family Michael Gribskov DEPARTMENT OF BIOLOGICAL SCIENCES.
Large-scale Prediction of Yeast Gene Function Introduction to Bio-Informatics Winter Roi Adadi Naama Kraus
Shortest Path Analysis and 2nd-Order Analysis Ming-Chih Kao U of M Medical School
Gene Family Size Distributions Brought to You By Your Neighorhood Durand Lab Narayanan Raghupathy Nan Song Rose Hoberman.
You have worked for 2 years to isolate a gene involved in axon guidance. You sequence the cDNA clone that contains axon guidance activity. The sequence.
MGM workshop. 19 Oct 2010 Some frequently-used Bioinformatics Tools Konstantinos Mavrommatis Prokaryotic Superprogram.
Finding Motifs Vasileios Hatzivassiloglou University of Texas at Dallas.
HomologyIf twp proteins are homologous, they have a common fold and a common ancestor If two proteins have >25% identity across their entire length, they.
Sequence Similarity The bioinformatics for molecular biologists lecture series.
Shin-Han Shiu and Melissa D. Lehti-Shiu Department of Plant Biology
PINALOG Protein Interaction Network Alignment and its implication in function prediction and complex detection Hang Phan Prof. Michael J.E. Sternberg.
Sequence similarity, BLAST alignments & multiple sequence alignments
Demo: Protein Information Resource
Basics of Comparative Genomics
Sequence based searches:
Sequence comparison: Local alignment
Genome Annotation Continued
Protein Bioinformatics Course
Dr Tan Tin Wee Director Bioinformatics Centre
Identify D. melanogaster ortholog
PANTHER (Protein Analysis Through Evolutionary Relationships): Trees, Hidden Markov Models, Biological Annotations Paul Thomas, Ph.D. Division of Bioinformatics.
Protein structure prediction.
Pairwise Sequence Alignment
Basics of Comparative Genomics
Presentation transcript:

Global Annotation of the Protein Kinase Family Michael Gribskov University of California, San Diego

Signaling Cascades

Statistics Arabidopsis 1028 putative kinase 58 Potentially alternatively spliced 82 % confirmed by full length cDNA Less than 100 experimentally investigated Rice 1565 putative kinases What are the functions of each protein kinase? Functional groupings Substrate prediction Pathway analysis and modeling

Targets Protein kinase Protein phosphatase Membrane transporters Proteasome complex

Some Receptor Kinases Class I (EGF receptor) Class II (Insulin receptor) Class III (FGF receptor)

Requirements for Functional Clustering Must handle very large number of objects (over 1200 for plants, over 9000 for all species) Must deal sensibly with paralogs from functional point of view Must be based on entire sequence, not just kinase catalytic domain Must be tolerant to sequence errors and omissions

Orthology vs Paralogy Relationships between genes in multigene families are complex Multiple genes may exist before speciation Genes may be lost and replaced along lineages “Function space” must be filled Species A Species B

Clustering

Clustering/Classification Maximum linkage

Clustering/Classification Pairwise distances All-against-all BLAST Uses entire sequence Alignments not required Longer matches, i.e. more domains, give better score

Basic Approach Maximum linkage clustering up to “natural” limit Recalculate average distances between groups Repeat until tree is complete

Complete Kinase Clustering

Statistics Class 1: RLKs (transmembrane) and RLCKs Class 2: “Raf-like” Class 3: Casein Kinase and CLK Class 4: Non-TM, Non-Receptor

BLAST Distance Entire Sequence

BLAST Distance Non-Kinase Domain

Yeast Signaling (MAPK)

Validating Transgenomic Predictions

SnRK At AKIN10 and AKin11 Rescue yeast SNF1 deletion Functional homolog

MAPK

MEME PSSM

PPC4.2.6 MEME Motifs

Summary Functional groups by clustering Functional assignment by transgenomic comparison Directed search for functional motifs by motif comparison Construction of public data resources

Bioinformatics Group Michael Gribskov Fariba Fana Degeng Wang Sheila Podell Tobey Tam * Jason Tchieu * Hannes Niedner Douglas Smith Guangfa Zhang * Jeff Harper Major Contributors Catherine Chan Alice Harmon Estelle Hrabak David Kerk Shinhan Shiu