Bioinformatics Predrag Radivojac I NDIANA U NIVERSITY.

Slides:



Advertisements
Similar presentations
Martin John Bishop UK HGMP Resource Centre Hinxton Cambridge CB10 1 SB
Advertisements

1 Introduction to Sequence Analysis Utah State University – Spring 2012 STAT 5570: Statistical Bioinformatics Notes 6.1.
BIOINFORMATICS Ency Lee.
AP Biology Teaching Biology Through Bioinformatics Real world genomics research in your classroom Kim B. Foglia Division Ave. High School Levittown.
Bioinformatics at IU - Ketan Mane. Bioinformatics at IU What is Bioinformatics? Bioinformatics is the study of the inherent structure of biological information.
Structural Genomics and Human Health
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Bioinformatics Dr. Aladdin HamwiehKhalid Al-shamaa Abdulqader Jighly Lecture 1 Introduction Aleppo University Faculty of technical engineering.
Introduction to Bioinformatics Spring 2008 Yana Kortsarts, Computer Science Department Bob Morris, Biology Department.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Using Bioinformatics to Make the Bio- Math Connection The Confessions of a Biology Teacher.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
What is bioinformatics? Finding patterns in molecular biological data Implies: managing molecular biological data identifying correlations in molecular.
Data-intensive Computing: Case Study Area 1: Bioinformatics B. Ramamurthy 6/17/20151.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
The Cell, Central Dogma and Human Genome Project.
Integration of Bioinformatics into Inquiry Based Learning by Kathleen Gabric.
Workshop in Bioinformatics 2010 What is it ? The goals of the class… How we do it… What’s in the class Why should I take the class..
Bioinformatics in the Biology Curriculum Gloria Rendon NCSA July 2008.
CISC667, F05, Lec24, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) DNA Microarray, 2d gel, MSMS, yeast 2-hybrid.
Bioinformatics Original definition (1979 by Paulien Hogeweg): “application of information technology and computer science to the field of molecular biology”
GTL Facilities Characterization and Imaging of Molecular Machines Lee Makowski.
Ayesha Masrur Khan Spring Course Outline Introduction to Bioinformatics Definition of Bioinformatics and Related Fields Earliest Bioinformatics.
ExPASy - Expert Protein Analysis System The bioinformatics resource portal and other resources An Overview.
Pathways Database System: An Integrated System For Biological Pathways L. Krishnamurthy, J. Nadeau, G. Ozsoyoglu, M. Ozsoyoglu, G. Schaeffer, M. Tasan.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Cédric Notredame (30/08/2015) Chemoinformatics And Bioinformatics Cédric Notredame Molecular Biology Bioinformatics Chemoinformatics Chemistry.
BASIC BIOCHEMISTRY MLAB Introduction. INTRODUCTION TO BASIC BIOCHEMISTRY Biochemistry can be defined as the science concerned with the chemical.
Bioinformatics.
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
PROTEIN STRUCTURE NAME: ANUSHA. INTRODUCTION Frederick Sanger was awarded his first Nobel Prize for determining the amino acid sequence of insulin, the.
A brief Introduction to Bioinformatics Y. SINGH NELSON R. MANDELA SCHOOL OF MEDICINE DEPARTMENT OF TELEHEALTH Content licensed under.
Fast Search Protein Structure Prediction Algorithm for Almost Perfect Matches1 By Jayakumar Rudhrasenan S Primary Supervisor: Prof. Heiko Schroder.
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
PART II. Prediction of functional regions within disordered proteins Zsuzsanna Dosztányi MTA-ELTE Momentum Bioinformatics Group Department of Biochemistry.
BIOINFORMATICS IN BIOCHEMISTRY Bioinformatics– a field at the interface of molecular biology, computer science, and mathematics Bioinformatics focuses.
Intelligent systems in bioinformatics Introduction to the course.
Discovering the Correlation Between Evolutionary Genomics and Protein-Protein Interaction Rezaul Kabir and Brett Thompson
Workshop Aims NMSU GO Workshop 20 May Aims of this Workshop  WIIFM? modeling examples background information about GO modeling  Strategies for.
A Tutorial of Sequence Matching in Oracle Haifeng Ji* and Gang Qian** * Oklahoma City Community College ** University of Central Oklahoma.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
PREDICTION OF CATALYTIC RESIDUES IN PROTEINS USING MACHINE-LEARNING TECHNIQUES Natalia V. Petrova (Ph.D. Student, Georgetown University, Biochemistry Department),
Proteomics Session 1 Introduction. Some basic concepts in biology and biochemistry.
Information Technology in the Natural Sciences Biology – Chemistry – Physics.
Central dogma: the story of life RNA DNA Protein.
EB3233 Bioinformatics Introduction to Bioinformatics.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
An overview of Bioinformatics. Cell and Central Dogma.
Bioinformatics and Computational Biology
 Unit 3: Seminar Sickle Cell Anemia. Types of Biomolecules Figure 2-17 Molecular Biology of the Cell (© Garland Science 2008)
Introduction to biological molecular networks
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Bioinformatics Project BB201 Metabolism A.Nasser
Bioinformatics Chem 434 Dr. Nancy Warter-Perez Computer Engineering Dr. Jamil Momand Chemistry & Biochemistry.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
COURSE OF BIOINFORMATICS Exam_30/01/2014 A.
Effect of Alcohol on Brain Development NormalFetal Alcohol Syndrome.
Haemoglobin: Structure and Formation of Proteins 1.1.2, a, b,c.
Graduate Research with Bioinformatics Research Mentors Nancy Warter-Perez, ECE Robert Vellanoweth Chem and Biochem Fellow Sean Caonguyen 8/20/08.
BME435 BIOINFORMATICS.
19.5 Protein Structure: Tertiary and Quaternary Levels
Bioinformatics Overview
Biological Databases By: Komal Arora.
7.3 Translation udent_view0/chapter3/animation__how_translation_work s.html.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
LESSON 1 INTNRODUCTION HYE-JOO KWON, Ph.D /
Network biology An introduction to STRING and Cytoscape
Introduction to Bioinformatics
Presentation transcript:

Bioinformatics Predrag Radivojac I NDIANA U NIVERSITY

Basics of Molecular Biology Can we understand how cells function? Eukaryotic cell

Bioinformatics is multidisciplinary! What is Bioinformatics? –Integrates: computer science, statistics, chemistry, physics, and molecular biology –Goal: organize and store huge amounts of biological data and extract knowledge from it Major areas of research –Genomics –Proteomics –Databases Practical discipline Some major applications · Drug design · Evolutionary studies · Genome characterization

Interesting Problems Sequence Alignment

Interesting Problems

Sequence assembly Goal: solve the puzzle, i.e. connect the pieces into one genomic sequence

Interesting Problems Proteomics Mass spectrometry

Interesting Problems Microarray data

Interesting Problems Functional Genomics Gene Regulation

Diseases are interconnected… Goh et al. PNAS, 104: 8685 (2007).

Disease Development of tools that can be used to understand and treat human disease Prediction of disease-associated genes Important from biological standpoint medical standpoint computational standpoint Background human genome low-throughput data high-throughput data ontologies for protein function at multiple levels The Time is Right!

Alzheimer’s disease Top PhenoPred hits: 1) CDK5 2) NTN1 AUC = 77.5%

Loss/Gain of function and disease Pauling et al. Science 110: 543 (1949). Chui & Dover. Curr Opin Pediatr, 13: 22 (2001). Sickle Cell Disease: Autosomal recessive disorder E6V in HBB causes interaction w/ F85 and L88 Formation of amyloid fibrils Abnormally shaped red blood cells, leads to sickle cell anemia Manifestation of disease vastly different over patients 2hbs E6V 4hhb

Lipitor (ATORVASTATIN) E6V

15 Proteins = chains of amino acids biomolecule, macromolecule –more than 50% of the dry weight of cells is proteins polymer of amino acids connected into linear chains strings of symbols machinery of life –play central role in the structure and function of cells –regulate and execute many biological functions a) amino acid b) amino acid chain Introduction to Protein Structure by Branden and Tooze

16 peptide bonds are planar and strong by rotating at each amino acid, proteins adopt structure Protein structure Introduction to Protein Structure by Branden and Tooze

17 Protein function Multi-level phenomenon –biochemical function –biological function –phenotypical function Example: kinase –biochemical function – transferase –biological function – cell cycle regulation –phenotypical function – disease Function is everything that happens to or through a protein (Rost et al. 2003)

Myoglobin 1.4A X-ray PDB: 2jho 153 residues C  - C  < 6A Protein contact graph

21 S113 of isocitrate dehydrogenase G = (V, E) f: V  A A = {A, C, D, … W, Y} g: V  {  1, +1} Notation: Residue neighborhood

22 Graphlets are small non-isomorphic connected graphs. Different positions of the pivot vertex with respect to the graphlet correspond to graph-theoretical concept of automorphism orbits, or orbits. S Przulj et al. Bioinformatics 20: 3508 (2004).

Results

2-graphlets:01 3-graphlets:011, graphlets:0111, , 0123 Key insight: Efficient combinatorial enumeration of graphlets / orbits over 7 disjoint cases breadth-first search

0 1 | A | o 2 | A | 2 o 5, o 6, o 11 | A | 3 o 3, o 4 ? A = {0, 1} 00, 01 = 10, 11 (3) A = {0, 1, 2} 00, 11, 22, 01 = 10, 02 = 20, 12 = 21 (6) binomial (multinomial) coefficients | A |= 20, dimensionality = 1,062,

Inner product between vectors of counts of labeled orbits where K is a kernel because matrices of inner products are symmetric and positive definite (proof due to David Haussler). Graphlet kernel  i ( x ) is the number of times labeled orbit i occurs in the graph