1 Computational Approaches(1/7)  Computational methods can be divided into four categories: prediction methods based on  (i) The overall protein amino.

Slides:



Advertisements
Similar presentations
Assignment of PROSITE motifs to topological regions: Application to a novel database of well characterised transmembrane proteins Tim Nugent.
Advertisements

(SubLoc) Support vector machine approach for protein subcelluar localization prediction (SubLoc) Kim Hye Jin Intelligent Multimedia Lab
Targeting and assembly of proteins destined for chloroplasts and mitochondria How are proteins targeted to chloroplasts and mitochondria from the cytoplasm?
11/9/99ICTAI-99, Chicago1 Protein Secondary Structure Prediction Using Data Mining Tool C5 Meiliu Lu †, Du Zhang †, Hongjun Xu †, Ken Tse-yau Lau ‡, and.
MitoInteractome : Mitochondrial Protein Interactome Database Rohit Reja Korean Bioinformation Center, Daejeon, Korea.
Using phylogenetic profiles to predict protein function and localization As discussed by Catherine Grasso.
CITE EVIDENCE THAT ORGANISMS ARE LINKED BY LINES OF DESCENT FROM COMMON ANCESTRY LEARNING GOAL.
Tools to analyze protein characteristics Protein sequence -Family member -Multiple alignments Identification of conserved regions Evolutionary relationship.
Andreas Bender - Research Group Gisbert Schneider - Goethe-University Frankfurt1 Analysis of mitochondrial transit peptides of Plasmodium falciparum Andreas.
Prediction of protein localization and membrane protein topology Gunnar von Heijne Department of Biochemistry and Biophysics Stockholm Bioinformatics Center.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Structural bioinformatics
Training a Neural Network to Recognize Phage Major Capsid Proteins Author: Michael Arnoult, San Diego State University Mentors: Victor Seguritan, Anca.
Mike Arnoult 9/30/2010 The role of Artificial Neural Networks in Phage Research.
Current Approaches to Whole Genome Phylogenetic Analysis Hongli Li.
Introduction to Pattern Recognition Prediction in Bioinformatics What do we want to predict? –Features from sequence –Data mining How can we predict? –Homology.
Identifying Extracellular Plant Proteins Based on Frequent Subsequences of Amino Acids Y. Wang, O. Zaiane, R. Goebel.
Tools to analyze protein characteristics Protein sequence -Family member -Multiple alignments Identification of conserved regions Evolutionary relationship.
An Introduction to Bioinformatics Protein Structure Prediction.
MCSG Site Visit, Argonne, January 30, 2003 Genome Analysis to Select Targets which Probe Fold and Function Space  How many protein superfamilies and families.
Identifying functional residues of proteins from sequence info Using MSA (multiple sequence alignment) - search for remote homologs using HMMs or profiles.
CHAPTER 15 Microbial Genomics Genomic Cloning Techniques Vectors for Genomic Cloning and Sequencing MS2, RNA virus nt sequenced in 1976 X17, ssDNA.
I.1 ii.2 iii.3 iv.4 1+1=. i.1 ii.2 iii.3 iv.4 1+1=
Project list 1.Peptide MHC binding predictions using position specific scoring matrices including pseudo counts and sequences weighting clustering (Hobohm)
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
I.1 ii.2 iii.3 iv.4 1+1=. i.1 ii.2 iii.3 iv.4 1+1=
Training a Neural Network to Recognize Phage Major Capsid Proteins Author: Michael Arnoult, San Diego State University Mentors: Victor Seguritan, Anca.
PREDICTION OF PROTEIN FEATURES Beyond protein structure (TM, signal/target peptides, coiled coils, conservation…)
MCB 7200: Molecular Biology
Lim et al, Supplemental Figure S Arsenic Plant height (Cm) As[μM] b/c g f e d c/d a/b a c/d a a/b Cadmium
Advanced Tools and Algorithms in Bioinformatics Chittibabu Guda Summer, 2004 UCSD Extension, Department of Biosciences.
Prediction of Subcellular Localization of Proteins ~ Past, Present, and Future ~ Human Genome Center, Inst. Med. Sci., University of Tokyo Kenta Nakai.
Shankar Subramaniam University of California at San Diego Data to Biology.
Truncation of Protein Sequences for Fast Profile Alignment with Application to Subcellular Localization Man-Wai MAK and Wei WANG The Hong Kong Polytechnic.
7 Unit 2: Cells. Vocabulary Cells Cells Are Awesome!
Functional Associations of Protein in Entire Genomes Sequences Bioinformatics Center of Shanghai Institutes for Biological Sciences Bingding.
CHAPTER 6 SYSTEMS BIOLOGY OF CELL ORGANIZATION
Gene Regulatory Network Inference. Progress in Disease Treatment  Personalized medicine is becoming more prevalent for several kinds of cancer treatment.
Predicting the Cellular Localization Sites of Proteins Using Decision Tree and Neural Networks Yetian Chen
Copyright  2003 limsoon wong Recognition of Protein Features Limsoon Wong Institute for Infocomm Research BI6103 guest lecture on ?? March 2004.
Protein Functional Annotation Dr G.P.S. Raghava. Annotation Methods Annotation by homology (BLAST) requires a large, well annotated database of protein.
1 Introduction(1/2)  Eukaryotic cells can synthesize up to 10,000 different kinds of proteins  The correct transport of a protein to its final destination.
Function, evolution, motifs and hierarchy Ashwin Sivakumar.
Copyright © 2009 Pearson Education, Inc. Art and Photos in PowerPoint ® Concepts of Genetics Ninth Edition Klug, Cummings, Spencer, Palladino Chapter 21.
Figure S1_Yao Qin et al. Figure S1 Occurrence and distribution of trihelix family in different plant species. Red branches in the cladogram indicate that.
Imagine a nerve cell that measures 125 µm x 1 µm x 1 µm. Predict how its surface area-to- volume ratio compare with those in the project yesterday. Then.
Unraveling condition specific gene transcriptional regulatory networks in Saccharomyces cerevisiae Speaker: Chunhui Cai.
Frontiers in the Convergence of Bioscience and Information Technologies 2007 Seyed Koosha Golmohammadi, Lukasz Kurgan, Brendan Crowley, and Marek Reformat.
Localization prediction of transmembrane proteins Stefan Maetschke, Mikael Bodén and Marcus Gallagher The University of Queensland.
What is a Project Purpose –Use a method introduced in the course to describe some biological problem How –Construct a data set describing the problem –Define.
Rice SCAMP1 Defines Clathrin-Coated, trans- Golgi–Located Tubular- Vesicular Structures as an Early Endosome in Tobacco BY-2 Cells.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Metabolic Network Inference from Multiple Types of Genomic Data Yoshihiro Yamanishi Centre de Bio-informatique, Ecole des Mines de Paris.
Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.
An overview of Bioinformatics. Cell and Central Dogma.
LOGO iDNA-Prot|dis: Identifying DNA-Binding Proteins by Incorporating Amino Acid Distance- Pairs and Reduced Alphabet Profile into the General Pseudo Amino.
Introduction to biological molecular networks
Protein Properties Function, structure Residue features Targeting Post-trans modifications BIO520 BioinformaticsJim Lund Reading: Chapter , 11.7,
JIGSAW: a better way to combine predictions J.E. Allen, W.H. Majoros, M. Pertea, and S.L. Salzberg. JIGSAW, GeneZilla, and GlimmerHMM: puzzling out the.
Final Report (30% final score) Bin Liu, PhD, Associate Professor.
Finding genes in the genome
Detecting Protein Function and Protein-Protein Interactions from Genome Sequences TuyetLinh Nguyen.
Prediction of protein features. Beyond protein structure
Compare and contrast two cell groups
Functional Annotation of Transcripts
Cell Signaling (Lecture 1)
Biotechnology and Genetic Engineering PBIO 4500/5500
Protein Functional Annotation
Protein Functional Annotation
DNA and the Genome Key Area 1b Organisation of DNA.
Presentation transcript:

1 Computational Approaches(1/7)  Computational methods can be divided into four categories: prediction methods based on  (i) The overall protein amino acid composition  (ii) Known targeting sequences  (iii) Sequence homology and/or motifs  (iv) A combination of several sources of information (hybrid methods)

2 Computational Approaches(2/7) (i) The overall protein amino acid composition(1/2)  Nakashima and Nishikawa  Method for discriminating between intracellular and extracellular proteins  Using the distance between the overall amino acid composition vectors  Cedano et al  ProtLock for predicting five classes of subcellular localizations  Extracellular, intracellular, integral membrane, anchored membrane, and nuclear  Reinhardt and Hubbard  NNPSL, an approach using artificial neural networks (ANNs)  Predicting four eukaryotic and three prokaryotic subcellular localizations  Chou et al  SVM-based method for predicting twelve different subcellular localization taking sequence order effects into account

3 Computational Approaches(3/7) (i) The overall protein amino acid composition(2/2)  Huang et al  Using Fuzzy k-NNs algorithm  Describe the dipeptide composition of the whole protein sequence for eleven different localizations  Yu, C.S. (CELLO method)  Prediction of five subcellular localizations in Gram-negative bacteria  Based on the composition of peptides of varying lengths  Andrade et al  First to incorporate structural information into the amino acid composition vectors  Composition of eukaryotic proteins with known structure was used  The rationale behind this approach  The interiors of proteins have stayed fairly constant during evolution

4 Computational Approaches(4/7) (ii) Known targeting sequences  Gunnar von Heijne (TargetP)  The most comprehensive method based on N-terminal targeting sequences  Prediction of chloroplast, mitochondrial, secretory pathway, and other proteins  Claros M.G. (MitoProt and Predotar)  Specifically discriminate chloroplast from mitochondrial proteins  Bannai H. et al (iPsort)  Using knowledge-based rules for prediction based on protein sequence features

5 Computational Approaches(5/7) (iii) Sequence homology and/or motifs  Marcotte et al  Assigns the subcellular localization by constructing phylogenetic profiles of the proteins  Cokol M et al(PredictNLS)  Specialized on recognizing nuclear proteins  Based on a collection of nuclear localization sequences  Lu et al(Proteome Annalyst)  Based on SWISS-PROT keywords and the annotation of homologous proteins

6 Computational Approaches(6/7) (iv) Hybrid methods  Nakai K and Kanehisa(PSORT)  One of the first methods developed for predicting the subcellular localization  Using the overall amino acid composition, N-terminal targeting sequence information, and motifs  This method uses a set of knowledge-based "if-then" rules  Predicts 14 animal and 17 plant subcellular localizations  PSORT II and and PSORT-B-Extensions of the PSORT  Drawid and G  Method that incorporates information about sequence motifs, overall sequence properties and mRNA expression levels  Based on a Bayesian prediction model and was tested on the yeast genome  Guda C et al(MITOPRED)  Specialized for predicting mitochondrial proteins  Based on amino acid composition