Function first: a powerful approach to post-genomic drug discovery Stephen F. Betz, Susan M. Baxter and Jacquelyn S. Fetrow GeneFormatics Presented by.

Slides:



Advertisements
Similar presentations
Unravelling the biochemical reaction kinetics from time-series data Santiago Schnell Indiana University School of Informatics and Biocomplexity Institute.
Advertisements

PREDetector : Prokaryotic Regulatory Element Detector Samuel Hiard 1, Sébastien Rigali 2, Séverine Colson 2, Raphaël Marée 1 and Louis Wehenkel 1 1 Bioinformatics.
In silico small molecule discovery Sales Target gene Discover hit Hit to lead Optimise lead Clinical Target gene identified with a viable assay High throughput.
 A cell is an organization of millions of molecules  Proper communication between these molecules is essential to the normal functioning of the cell.
Research Methodology of Biotechnology: Protein-Protein Interactions Yao-Te Huang Aug 16, 2011.
Biochemistry 300 Introduction to Structural Biology Walter Chazin 5140 BIOSCI/MRBIII
Biochemistry 301 Overview of Structural Biology Techniques Jan. 19, 2004.
Bioinformatics Needs for the post-genomic era Dr. Erik Bongcam-Rudloff The Linnaeus Centre for Bioinformatics.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Structural bioinformatics
Introduction to Proteomics. First issue of Proteomics- Jan. 1, 2001.
A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae Article by Peter Uetz, et.al. Presented by Kerstin Obando.
2004 PP&CW Optimization of protein expression and solubility Alternative and novel prokaryotic expression systems Eukaryotic expression systems Methods.
Biochemistry 300 Introduction to Structural Biology Walter Chazin 5140 BIOSCI/MRBIII
Summary Protein design seeks to find amino acid sequences which stably fold into specific 3-D structures. Modeling the inherent flexibility of the protein.
Proteomics Understanding Proteins in the Postgenomic Era.
Bioinformatics Ayesha M. Khan Spring Phylogenetic software PHYLIP l 2.
Important Points in Drug Design based on Bioinformatics Tools History of Drug/Vaccine development –Plants or Natural Product Plant and Natural products.
Overview of Bioinformatics A/P Shoba Ranganathan Justin Choo National University of Singapore A Tutorial on Bioinformatics.
Inverse Kinematics for Molecular World Sadia Malik April 18, 2002 CS 395T U.T. Austin.
Biochemistry 300 Introduction to Structural Biology Walter Chazin 5140 BIOSCI/MRBIII
Bioinformatics for biomedicine Protein domains and 3D structure Lecture 4, Per Kraulis
Cédric Notredame (30/08/2015) Chemoinformatics And Bioinformatics Cédric Notredame Molecular Biology Bioinformatics Chemoinformatics Chemistry.
Computer-Assisted Drug Design (1) i)Random Screening ii)Lead Development and Optimization using Multivariate Statistical Analyses. iii)Lead Generation.
NUS CS5247 A dimensionality reduction approach to modeling protein flexibility By, By Miguel L. Teodoro, George N. Phillips J* and Lydia E. Kavraki Rice.
23 May June May 2002 From genes to drugs via crystallography 19 May 1996 Experimental and computational approaches to structure based.
In God We Trust.
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
Chapter 13. The Impact of Genomics on Antimicrobial Drug Discovery and Toxicology CBBL - Young-sik Sohn-
© Wiley Publishing All Rights Reserved. Protein and Specialized Sequence Databases.
Transmembrane proteins in the Protein Data Bank: identification and classification Gabor, E. Tusnady, Zsuzanna Dosztanyi and Istvan Simon Bioinformatics,
TOPICS IN (NANO) BIOTECHNOLOGY
-A cell is an organization of millions of molecules -Proper communication between these molecules is essential to the normal functioning of the cell -To.
Finish up array applications Move on to proteomics Protein microarrays.
Reconstruction of Transcriptional Regulatory Networks
Ozgur Ozturk, Ahmet Sacan, Hakan Ferhatosmanoglu, Yusu Wang The Ohio State University LFM-Pro: a tool for mining family-specific sites in protein structure.
Biological Databases Biology outside the lab. Why do we need Bioinfomatics? Over the past few decades, major advances in the field of molecular biology,
FLEXGene Repository Applications. Exploiting FLEXGene Genes FLEXGene Clones Expression Clones Proteins Protein Expression Localization Interaction Functional.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Samudrala group - overall research areas CASP6 prediction for T Å C α RMSD for all 70 residues CASP6 prediction for T Å C α RMSD for all.
Patentability Considerations in the 3-D Structure Arts Patentability Considerations in the 3-D Structure Arts Michael P. Woodward Supervisory Patent Examiner.
High throughput Protein Measurement Techniques Harin Kanani.
Structural proteomics
PREDICTION OF CATALYTIC RESIDUES IN PROTEINS USING MACHINE-LEARNING TECHNIQUES Natalia V. Petrova (Ph.D. Student, Georgetown University, Biochemistry Department),
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Central dogma: the story of life RNA DNA Protein.
By: Amira Djebbari and John Quackenbush BMC Systems Biology 2008, 2: 57 Presented by: Garron Wright April 20, 2009 CSCE 582.
Structural proteomics Handouts. Proteomics section from book already assigned.
Bioinformatics and Computational Biology
Discovering Modes of Action for Therapeutic Compounds Using a Genome-Wide Screen of Yeast Heterozygotes Pek Yee Lum, Christopher D. Armour, Daniel D. Shoemaker,
The type of target influence the type of drug Main drugs’ categories –Small molecules –Biologicals (e.g. antibodies, hormones, etc) The drug discovery.
EMBL-EBI Representative sets and Clustering.. EMBL-EBI Representative sets A subset of data that provides a statistically valid sample set for the complete.
BMC Bioinformatics 2005, 6(Suppl 4):S3 Protein Structure Prediction not a trivial matter Strict relation between protein function and structure Gap between.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Molecular Cell Biology Logic and Approaches to Research Cooper.
Modelling proteomes Ram Samudrala University of Washington.
신기술 접목에 의한 신약개발의 발전전망과 전략 LGCI 생명과학 기술원. Confidential LGCI Life Science R&D 새 시대 – Post Genomic Era Genome count ‘The genomes of various species including.
Structural Bioinformatics Elodie Laine Master BIM-BMC Semester 3, Genomics of Microorganisms, UMR 7238, CNRS-UPMC e-documents:
Page 1 Computer-aided Drug Design —Profacgen. Page 2 The most fundamental goal in the drug design process is to determine whether a given compound will.
Detection of genome regulation sequences
Protein structure Our understanding of life at the molecular level is highly dependent on the ability to map the molecular details of individual proteins.
APPLICATIONS OF BIOINFORMATICS IN DRUG DISCOVERY
Protein structure analysis Our understanding of life at the molecular level is highly dependent on the ability to map the molecular details of individual.
Molecular Docking Profacgen. The interactions between proteins and other molecules play important roles in various biological processes, including gene.
Predicting Active Site Residue Annotations in the Pfam Database
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
BIOINFORMATICS Summary
The increasing availability of quantitative biological data from the human genome project, coupled with advances in instrumentation, reagents, methodologies,
Presentation transcript:

Function first: a powerful approach to post-genomic drug discovery Stephen F. Betz, Susan M. Baxter and Jacquelyn S. Fetrow GeneFormatics Presented by Jamie Duke April 7 th, 2004

2 Goal: To answer this question: – How can we effectively use the new information from the genome sequencing projects to accelerate the development of new therapeutics that target gene products or their functions?

3 A “deluge of targets” There is an overwhelming amount of nucleotide sequences generate from high- throughput sequencing and differential expression methods Targets are not necessarily even expressed in vivo The actual targets are waiting to be discovered

4 Post-genomic drug discovery Methods must be able to deal with a large number of targets General strategy relies on high-throughput sequencing of large compound libraries against target proteins – Requires a knowledge of enzymatic activity, or binding against a known ligand Methods are costly, number of targets analyzed are limited, and a second assay is generally required

5 Screening Strategies Use of nuclear magnetic resonance (NMR) and x-ray crystallography Structure based drug design has also come into play with therapeutics Both strategies still require analyzing single proteins serially – Best method for the future involves automation

6 Screening Strategies Experimental efforts are resource intensive, and limited to proteins that can be cloned X-ray Crystallography is only possible if the protein can form diffraction-quality crystals NMR is only possible if the protein is well behaved in solution Structural biology is only possible with high quality structures

7 Structure Structural prediction is neither easy nor cheap Knowing tertiary structure does not guarantee the transfer of function or small molecule binding sites Inference of function from similar sequences with known function is correct less than 50% of the time – A “similar sequence” is a sequence that is 30% or more identical, most proteins do not meet this requirement – Additionally, different structures have been known to support the same activity

8 Selectivity The aim is to develop truly selective compounds from the beginning of the discovery process Decrease the failure rate of compounds in development and ultimately lower cost and time

9 Function in Drug Discovery Drug discovery starts by determining the function of the drug leads from mining the genomic data – Pathway involvement, catalytic activity, protein class or active-site chemistry Functional features can be used to develop assays for a more straightforward path “Parallel large-scale processes and analyses to identify function first will be key for this lead discovery approach to be successful in the post- genomic era.”

10 Function Assignment Function of the sequences is often inferred through sequence “similarity” – Function is automatically transferred, and can lead to misannotation and misinterpretation SAGE and parallel protein analysis are generally used – These experimental procedures allow for the gene product function to be identified in a complex environment yielding data which is used to validate the target – Unfortunately, low copy genes and a high false positive rate limit the use of these methods

11 Function first approach to structural and chemo-proteomics Process starts with the identifying a set of protein sequences in the human proteome – Looking for sequence that have particular binding sites, carries out catalysis, or has been previously identified Proteins are classified by their functional sites – Analysis of the families is key to specific drug design Structures of family members are determined using protein folding algorithms Small molecule binding sites are identified This approach saves crucial time and money in the drug discovery process

12 Approximate Structure Analysis For each protein, an approximate model is generated Algorithms developed by Jeffrey Skolnick for the CASP competitions are used to predict the structure Models are not perfect due to imperfect scoring functions and energy potentials

13 Fuzzy Functional Forms™ A technique to identify biochemical function An FFF is a motif that describes the chemistry and geometry of the functional site within the protein Information is based upon known structures in the PDB – Functional residues are identified in related protein structures – Residues are selected based on the nature of the function, chemistry or structure of the site – Geometric constraints are defined for key residues in the structure

14 Functional Family Approach Functional sites are (generally) well conserved in families FFF’s are used to determine all proteins in the genome with the given functional site Functional families are identified by: – Sites identified by FFF, and – Computational information on the functional site that yields valuable biologically relevant data necessary for drug discovery

15 Functional Family Approach The ultimate goal is to identify small molecules that will selectively inhibit a single member of a family, thus reducing interaction with other proteins This approach allows for the account and classification of functionally related proteins Provides a better assurance of the “druggability” for a putative target

16 Complementary to Cell Based Information This method allows for efficient target validation due to parallel identification Allows for large-scale identification of function and structure without large-scale investments Provides information that is relevant to assays being run to determine functionality and interpretation of microarray data

17 Rapid Analysis of the Structure Because the structure of the protein does not need to be determined to the atomic level, it is much less computationally intense Far fewer protein folding is done in silico because only models identified through FFF or high scoring models are folded The process is automated – Scientists can compute more than 25,000 protein sequences and make structure-function assignments in weeks, not months or years as would take to serially test each sequence through the experimental techniques The combination of structure and function information that allows for more reliable assignments than sequence based methods

18 Alternative Drug Binding Sites One key feature of FFF is it’s ability to identify multiple active sites for one protein A protein may be annotated as a phosphatase, but it may also have a catalytic site, a metal binding site, and a regulatory site, as does serine-threonine phosphatase Alternative sites are also potentially druggable The information from multifunctional

19 Binding Sites

20 Key Information It is key to know the structure of the functional site – Recognition of the similarities and differences among a set of potential targets allows for designing specific small molecules that are specific for each member of the family

21 Conclusion The function first approach provides an effective way to mine the genomic data to lead to compounds that can be developed into drugs Using this method, in association with biological, structural, and chemical methods will lead to drug discovery that is more efficient, and cost effective