Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.

Slides:



Advertisements
Similar presentations
Finding regulatory modules from local alignment - Department of Computer Science & Helsinki Institute of Information Technology HIIT University of Helsinki.
Advertisements

Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
Basic Genomic Characteristic  AIM: to collect as much general information as possible about your gene: Nucleotide sequence Databases ○ NCBI GenBank ○
Structural bioinformatics
Intro to Bioinformatics Summary. What did we learn Pairwise alignment – Local and Global Alignments When? How ? Tools : for local blast2seq, for global.
Protein RNA DNA Predicting Protein Function. Biochemical function (molecular function) What does it do? Kinase??? Ligase??? Page 245.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Multiple Sequence Alignment Algorithms in Computational Biology Spring 2006 Most of the slides were created by Dan Geiger and Ydo Wexler and edited by.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
Key dates lists of suggested projects published * *If you or your partner are working in a biology lab, try to find a relevant project which can.
Biological networks Tutorial 12. Protein-Protein interactions –STRING Protein and genetic interactions –BioGRID Signaling pathways –SPIKE Network visualization.
Projects Key dates lists of suggested projects published * *You are highly encouraged to choose a project yourself or find a relevant project.
Introduction to Bioinformatics - Tutorial no. 5 MEME – Discovering motifs in sequences MAST – Searching for motifs in databanks TRANSFAC – The Transcription.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Multiple sequence alignments and motif discovery Tutorial 5.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Exploring Protein Sequences Tutorial 5. Exploring Protein Sequences Multiple alignment –ClustalW Motif discovery –MEME –Jaspar.
Bioinformatics Unit 1: Data Bases and Alignments Lecture 3: “Homology” Searches and Sequence Alignments (cont.) The Mechanics of Alignments.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Protein Interactions and Disease Audry Kang 7/15/2013.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Bioinformatics for biomedicine Protein domains and 3D structure Lecture 4, Per Kraulis
Genome Informatics 2005 ~ 220 participants 1 keynote speaker: David Haussler 47 talks 121 posters.
From motif search to gene expression analysis
Protein Sequence Alignment and Database Searching.
NCBI Review Concepts Chuong Huynh. NCBI Pairwise Sequence Alignments Purpose: identification of sequences with significant similarity to (a)
Sequence analysis: Macromolecular motif recognition Sylvia Nagl.
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
Finish up array applications Move on to proteomics Protein microarrays.
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
Motif discovery Tutorial 5. Motif discovery MEME Creates motif PSSM de-novo (unknown motif) MAST Searches for a PSSM in a DB TOMTOM Searches for a PSSM.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
Biological networks Tutorial 12. Protein-Protein interactions –STRING Protein and genetic interactions –BioGRID Network visualization –Cytoscape Cool.
Protein and RNA Families
Protein Sequence Analysis - Overview - NIH Proteomics Workshop 2007 Raja Mazumder Scientific Coordinator, PIR Research Assistant Professor, Department.
Motif discovery and Protein Databases Tutorial 5.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
EB3233 Bioinformatics Introduction to Bioinformatics.
Analysis and comparison of very large metagenomes with fast clustering and functional annotation Weizhong Li, BMC Bioinformatics 2009 Present by Chuan-Yih.
Genome annotation and search for homologs. Genome of the week Discuss the diversity and features of selected microbial genomes. Link to the paper describing.
Bioinformatics and Computational Biology
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Exercises Pairwise alignment Homology search (BLAST) Multiple alignment (CLUSTAL W) Iterative Profile Search: Profile Search –Pfam –Prosite –PSI-BLAST.
Biological Networks.
Gene expression. Gene Expression 2 protein RNA DNA.
David Wishart February 18th, 2004 Lecture 3 BLAST (c) 2004 CGDN.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Motif Search and RNA Structure Prediction Lesson 9.
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
Rita Casadio BIOCOMPUTING GROUP University of Bologna, Italy Prediction of protein function from sequence analysis.
Copyright OpenHelix. No use or reproduction without express written consent1.
V diagonal lines give equivalent residues ILS TRIVHVNSILPSTN V I L S T R I V I L P E F S T Sequence A Sequence B Dot Plots, Path Matrices, Score Matrices.
V diagonal lines give equivalent residues ILS TRIVHVNSILPSTN V I L S T R I V I L P E F S T Sequence A Sequence B Dot Plots, Path Matrices, Score Matrices.
Introduction to Bioinformatics Lecturer: Prof. Yael Mandel-Gutfreund Teaching Assistance: Rachelly Normand Olga Karinski Course web site :
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
Projects
Introduction to Bioinformatics
FINAL PROJECT- Key dates
Sequence based searches:
Schedule for the Afternoon
MULTIPLE SEQUENCE ALIGNMENT
Basic Local Alignment Search Tool
Presentation transcript:

Biological Networks

Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002

Building models from parts lists Lazebnik, Cancer Cell, 2002

Building models from parts lists

Computational tools are needed to distill pathways of interest from large molecular interaction databases Thinking computationally about biological process may lead to more accurate models, which in turn can be used to improve the design of algorithms Navlakha and Bar-Joseph 2011

Jeong et al. Nature 411, (2001) Biological Networks

Proteins Physical Interaction Protein-Protein A B Protein Interaction Transcription factor Target genes Transcriptional Interaction Protein-DNA A B Transcriptional Different types of Biological Networks Nodes Edges

What can we learn from the topology of biological networks Hubs tend to be “older” proteins Hubs are evolutionary conserved Hubs are highly connected nodes Are hubs functionally important ?

Hubs are usually critical proteins for the species Lethal Slow-growth Non-lethal Unknown Jeong et al. Nature 411, (2001)

Networks can help to predict function

Can the network help to predict function Begley TJ, Mol Cancer Res Systematic phenotyping of 1615 gene knockout strains in yeast Evaluation of growth of each strain in the presence of MMS (and other DNA damaging agents) Screening against a network of 12,232 protein interactions

Mapping the phenotypic data to the network Begley TJ, Mol Cancer Res. 2002

Mapping the phenotypic data to the network Begley TJ, Mol Cancer Res. 2002

Networks can help to predict function Begley TJ, Mol Cancer Res

A network approach to predict new drug targets Aim :to identify critical positions on the ribosome which could be potential targets of new antibiotics

Keats ( )Kafka ( )Orwell ( ) Mozart ( )Schubert ( ) Chopin ( )

In our days… Infectious diseases are still number 1 cause of premature death. (0-44 years of age) worldwide. Annually kill >13 million people (~33% of all deaths)

Antibiotics targets of the large ribosomal subunit The ribosome is a target for approximately half of antibiotics characterized to date

Looking at the ribosome as a network A1191

Many biological network have characteristics of a Small World Network Every node can be reached from every other by a small number of steps

Does the ribosome network have characteristics of a Small World Network? Ribosome GraphSWN (L) Average Path Length ( C) Clustering Coefficient

What can we learn from the ribosome network? 1.Critical sites in the ribosome network may represent functional sites (not discovered before) 2. New functional sites may be good sites for drug design

Looking for critical positions in a network

Degree: the number of edges that a node has. The node with the highest degree in the graph (HUB)

Degree: the number of edges that a node has. The node with the highest degree in the graph (HUB) Looking for critical positions in a network

Closeness (centrality) Closeness: measure how close a node to all other nodes in the network. The nodes with the highest closeness

Betweenness (connectivity) The node with the highest betweenness Betweenness: quantify the number of all shortest paths that pass through a node.

The node with the highest degree The node with the highest betweenness The nodes with the highest closeness Looking for critical positions in a network

Looking at macromolecular structures as a network A1191 A1191 have the highest closeness, betwenness, and degree.

Which (is there a?) property best characterizes the known function sites? How can the network approach help identify functional sites in the ribosome ? Characterize the whole ribosome as a network Calculate the network properties of each nucleotide ?

Lethal mutations Neutral mutations 1 2 When mutating the critical site on the ribosome the bacteria will not grow

Critical site on the ribosome have very high centrality values (closeness) Lethal Mutations Neutral Mutations David-Eden and Mandel-Gutfreund, 2008 nucleotides with the highest closeness nucleotides with the highest closeness P-value~0 P-value=1

Critical site on the ribosome have very high connectivity (betweenness) Lethal Mutations Neutral Mutations David-Eden et al, 2008 nucleotides with the highest betweennes nucleotides with the highest betweennes P-value~0 P-value=1

p~0 p=0.01 Critical site on the ribosome have unique network properties Lethal mutationsNeutral mutations David-Eden et al, NAR (2008)

‘Druggability Index’ ‘Druggability Index’ Based on the network property David-Eden et al. NAR (2010) Bad siteGood site

Pockets with the highest ‘Druggability Index’ overlap known drug binding site s David-Eden et al. NAR (2010) DI=1DI=0.98 ErythromycinTelithromycin Girodazole DI=0.94DI=0.93

Course Summary and How to start working on your project

What did we learn Pairwise alignment – Dynamic Programing Local and Global Alignments When? How ? Recommended Tools : for local alignment blast2seq last.ncbi.nlm.nih.gov/Blast.cgi?PAGE_TYPE=BlastSearch&PROG_DE F=blastn&BLAST_PROG_DEF=megaBlast&BLAST_SPEC=blast2seq For global best use MSA tools such as Clustal W2, Muscle (see next slide)

What did we learn Multiple alignments (MSA) When? How ? MSA are needed as an input for many different purposes: searching motifs, phylogenetic analysis, protein and RNA structure predictions, conservation of specific nts/residues Recommended Tools : Clustal W2 (best for DNA and RNA), MUSCLE (best for proteins) Phylogeny.fr phylogenetic trees

What did we learn Search a sequence against a database When? How ? - BLAST :Remember different option for BLAST!!! (blastP blastN…. ), make sure to search the right database!!! DO NOT FORGET –You can change the scoring matrices, gap penalty etc - PSIBLAST Searching for remote homologies BLAST

What did we learn >Motif search When? How ? -Searching for overabundance of unknown regulatory motifs in a set of sequences ; e.g promoters of genes which have similar expression pattern (MEME) >Domain search Pfam (database to search for protein domains) Suggested Tools : MEME DRIMUST PFAM

What did we learn Protein Secondary Structure Prediction- When? How ? – Helix/Beta/Coil – Most successful approaches rely on information from the environment and MSA - Predictions level around 80% Suggested tools Jpred:

What did we learn Protein Tertiary Structure Prediction- When? How ? – First we must look at sequence identity to a sequence with a known structure!! – Sequence homology based methods- Homology modeling – Structure homology based methods- Threading Remember : Low quality models can be miss leading !! Database and tools Protein Data Bank Suggested tool for molecular visualization Good tool for homology modeling

What did we learn RNA Structure and Function Prediction- When? How ? – MFE based methods– good for local interactions, several predictions of low energy structures – Adding information from MSA can help but usually not available – RNA families are characterized by their structure (Rfam). Suggested tools: RNAfold RFAM

What did we learn Gene expression When? How ? > Unsupervised methods- Different clustering methods : K-means, Hierarchical Clustering > Supervised methods-such as SVM –GO annotation (analysis of gene clusters..) Selected databases and tools GEO EPclust David

What did we learn? Biological Networks Different types of Biological Networks Protein-Protein (non-directed) Regulatory networks (directed) structural networks Network Motifs Network Topology Selected tools String Biogrid Cytoscape

Most useful databases Genomic database The human genome browser Protein database Uniprot Structure database PDB (RCSB) Gene expression database GEO

So How do we start … Now that you have selected a project you should carefully plan your next steps: A.Make sure you understand the problem and read the necessary background to proceed B. formulate your working plan, step by step C. After you have a plan, start from extracting the necessary data and decide on the relevant tools to use at the first step. When running a tool make sure to summarize the results and extract the relevant information you need to answer your question, it is recommended to save the raw data for your records, don't present raw data in your final project. Your initial results should guide you towards your next steps. D. When you feel you explored all tools you can apply to answer your question you should summarize and get to conclusions. Remember NO is also an answer as long as you are sure it is NO. Also remember this is a course project not only a HW exercise..

Preparing a poster Prepare in PPT poster size cm Title of the project Names and affiliation of the students presenting The poster should include 5 sections : Background should include description of your question (can add figure) Goal and Research Plan: Describe the main objective and the research plan Results (main section) : Present your results in 3-4 figures, describe each figure (figure legends) and give a title to each result Conclusions : summarized in points the conclusions of your project References : List the references of paper/databases/tools used for your project

Key date reminder 16.1 Submission project proposal 20.1 Meetings with supervisors 19.3 Poster submission 26.3 Poster presentation (POSTER DAY 12:30-14:30)