COMPUTATIONAL ENGINEERING OF BIONANOSTRUCTURES RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON How can we design peptides and proteins capable.

Slides:



Advertisements
Similar presentations
Clustering the Temporal Sequences of 3D Protein Structure Mayumi Kamada +*, Sachi Kimura, Mikito Toda ‡, Masami Takata +, Kazuki Joe + + : Graduate School.
Advertisements

Doug Brutlag 2011 Bioinformatics Genomics, Bioinformatics.
Computational characterization of biomolecular networks in physiology and disease Kakajan Komurov, Ph.D Department of Systems Biology University of Texas.
Structural bioinformatics
A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae Article by Peter Uetz, et.al. Presented by Kerstin Obando.
Modern Biology and Applied Mathematics A Productive Conjunction of Science and Mathematics Bio-Math Teachers Workshop Rutgers University July Luther.
Sequence similarity.
1-month Practical Course Genome Analysis Lecture 3: Residue exchange matrices Centre for Integrative Bioinformatics VU (IBIVU) Vrije Universiteit Amsterdam.
BLOSUM Information Resources Algorithms in Computational Biology Spring 2006 Created by Itai Sharon.
NUS CS5247 A dimensionality reduction approach to modeling protein flexibility By, By Miguel L. Teodoro, George N. Phillips J* and Lydia E. Kavraki Rice.
NOVEL PARADIGMS FOR DRUG DISCOVERY SHOTGUN COMPUTATIONAL MULTITARGET SCREENING RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON PIONEER AWARD.
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
Modelling, comparison, and analysis of proteomes Ram Samudrala University of Washington.
Problem Statement and Motivation Key Achievements and Future Goals Technical Approach Investigators: Yang Dai Prime Grant Support: NSF High-throughput.
Chapter 13. The Impact of Genomics on Antimicrobial Drug Discovery and Toxicology CBBL - Young-sik Sohn-
Lecture 10 – protein structure prediction. A protein sequence.
Modelling proteomes An integrated computational framework for systems biology research Ram Samudrala University of Washington How does the genome of an.
Gary Stormo by Andrew Bardee. History Born 1950 in South Dakota Undergraduate in Biology from Caltech PhD in Molecular Biology from University of Colorado.
Pairwise Sequence Alignment. The most important class of bioinformatics tools – pairwise alignment of DNA and protein seqs. alignment 1alignment 2 Seq.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Function first: a powerful approach to post-genomic drug discovery Stephen F. Betz, Susan M. Baxter and Jacquelyn S. Fetrow GeneFormatics Presented by.
Laxman Yetukuri T : Modeling of Proteomics Data
Modelling Genome Structure and Function Ram Samudrala University of Washington.
Sequence Alignment Csc 487/687 Computing for bioinformatics.
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
Samudrala group - overall research areas CASP6 prediction for T Å C α RMSD for all 70 residues CASP6 prediction for T Å C α RMSD for all.
An Integrated Computational Framework for Systems Biology Ram Samudrala University of Washington How does the genome of an organism specify its behaviour.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Lecture 9. Functional Genomics at the Protein Level: Proteomics.
Computational engineering of bionanostructures Ram Samudrala University of Washington How can we analyse, design, & engineer peptides capable of specific.
Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte.
Central dogma: the story of life RNA DNA Protein.
Modelling protein tertiary structure Ram Samudrala University of Washington.
THERAPUETIC DISCOVERY BY MODELLING INTERACTOMES RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON How does the genome of an organism specify its.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
Bioinformatics and Computational Biology
COMPUTATIONAL ENGINEERING OF BIONANOSTRUCTURES
MODELLING INTERACTOMES RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON How does the genome of an organism specify its behaviour and characteristics?
Modelling proteomes Ram Samudrala Department of Microbiology How does the genome of an organism specify its behaviour and characteristics?
COMPUTATIONAL BIOLOGY IN DRUG DISCOVERY RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON How can we computationally screen compounds against.
Sequence Alignment.
Typically, classifiers are trained based on local features of each site in the training set of protein sequences. Thus no global sequence information is.
MODELLING PROTEOMES RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON How does the genome of an organism specify its behaviour and characteristics?
Modelling proteomes Ram Samudrala University of Washington How does the genome of an organism specify its behaviour and characteristics?
MODELLING INTERACTOMES RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON How does the genome of an organism specify its behaviour and characteristics?
Modelling proteins and proteomes using Linux clusters Ram Samudrala University of Washington.
HCV Protease Yasir Waheed. Discovery of Hepatitis C.
Molecular dynamics simulations of toxin binding to ion channels Quantitative description protein –ligand interactions is a fundamental problem in molecular.
Discovery of Therapeutics to Improve Quality of Life Ram Samudrala University of Washington.
Modelling proteomes Ram Samudrala University of Washington.
Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and Discovery Program.
Modelling proteomes: Application to understanding HIV disease progression Ram Samudrala Department of Microbiology University of Washington How does the.
Modelling genome structure and function Ram Samudrala University of Washington.
Modelling Genome Structure and Function Ram Samudrala University of Washington.
Modelling proteomes Ram Samudrala University of Washington How does the genome of an organism specify its behaviour and characteristics?
Challenges and accomplishments in molecular prediction Yanay Ofran.
MODELLING PROTEOMES RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON How does the genome of an organism specify its behaviour and characteristics?
Molecular Modeling in Drug Discovery: an Overview
Bioinformatics Overview
Applications of molecular simulation in materials science and biology
University of Washington
MODELLING INTERACTOMES
Modelling the rice proteome
MODELLING INTERACTOMES
University of Washington
Molecular Docking Profacgen. The interactions between proteins and other molecules play important roles in various biological processes, including gene.
Extra Tree Classifier-WS3 Bagging Classifier-WS3
Introduction to Bioinformatics II
How does the genome of an organism
Protein structure prediction.
Presentation transcript:

COMPUTATIONAL ENGINEERING OF BIONANOSTRUCTURES RAM SAMUDRALA ASSOCIATE PROFESSOR UNIVERSITY OF WASHINGTON How can we design peptides and proteins capable of interacting with inorganic substrates with specific selectivity and affinity?

MOTIVATION The functions necessary for life are undertaken by proteins. Protein function is mediated by protein three-dimensional structure. A number of semi-accurate computational methodologies have been developed for the analysis and modelling of the sequences and structures of naturally occurring proteins. We can harness these knowledge- and biophysics-based computational methodologies to design peptides and proteins capable of interacting inorganic substrates with specific affinity and selectivity. Goal is to develop generalised computational techniques to construct molecular building blocks based on peptides and proteins that can be easily assembled to design higher order structures. Applications in the area of medicine, nanotechnology, and biological computing.

BACKGROUND: STRUCTURE 0.5 Å C α RMSD for 173 residues (60% identity) T0290 – peptidyl-prolyl isomerase from H. sapiens T0364 – hypothetical from P. putida 5.3 Å C α RMSD for 153 residues (11% identity) T0332 – methyltransferase from H. sapiens 2.0 Å C α RMSD for 159 residues (23% identity) T0288 – PRKCA-binding from H. sapiens 2.2 Å C α RMSD for 93 residues (25% identity) Liu/Hong-Hung/Ngan

BACKGROUND: FUNCTION Wang/Cheng Ion binding energy prediction with a correlation of 0.7 Calcium ions predicted to < 0.05 Å RMSD in 130 cases Meta-functional signature for DXS model from M. tuberculosis Meta-functional signature accuracy

BACKGROUND: INTERACTION McDermott/Wichadakul/Staley/Horst/Manocheewa/Jenwitheesuk/Bernard BtubA/BtubB interolog model from P. dejongeii (35% identity to eukaryotic tubulins) Transcription factor bound to DNA promoter regulog model from S. cerevisiae Prediction of binding energies of HIV protease mutants and inhibitors using docking with dynamics

APPLICATION: DRUG DISCOVERY HSV KHSV CMV Computionally predicted broad spectrum human herpesvirus protease inhibitors is effective in vitro against members from all three classes and is comparable or better than anti-herpes drugs HSV Our protease inhibitor acts synergistically with acylovir (a nucleoside analogue that inhibits replication) and it is less likely to lead to resistant strains compared to acylovir Lagunoff/Jenwitheesuk

How can we using this knowledge to design peptides and proteins capable of interacting with inorganic substrates with specific selectivity and affinity? BACK TO THE RELEVANT QUESTION

KNOWLEDGE-BASED DESIGN Proteins that are evolutionarily related generally have similar sequences, structures, and functions. We hypothesised that this applies to experimentally discovered peptides capable of binding to inorganic substrates. We then examined similarity of sequences between experimentally discovered peptides and random peptide sequences using standard sequence comparison tools. Random peptide sequences most similar to a particular group of experimentally discovered peptides were considered to possess the same functional property. Some examples of experimentally discovered peptides (from Mehmet Sarikaya’s group): Quartz binders: RLNPPSQMDPPF QTWPPPLWFSTS LTPHQTTMAHFL Hydroxyapatite binders: MLPHHGA TTTPNRA PVAMPHW Oren/Tamerler/Sarikaya

OPTIMISATION OF SCORING MATRICES (QUARTZ) We perturbed the PAM 250 scoring matrix systematically to produce a higher strong-strong self-similarity and lower strong-weak cross-similarity score, and backtested the predictive power of the new QUARTZ I matrix. Oren/Tamerler/Sarikaya

EXPERIMENTAL VERIFICATION (QUARTZ) Three sets of experiments were performed by Mehmet Sarikaya’s group to validate the computationally designed sequences. Oren/Tamerler/Sarikaya

DESIGN OF SECOND GENERATION MATRICES Oren/Tamerler/Sarikaya

HA12 (12 aa linear) 49: 16S, 20M, 13W HA7 (7 aa constrained) 56: 12S, 27M, 17W 1.M L P H H G A 2.T T T P N R A 3.P V A M P H W 4.N N N Y S R H 5.P K D A V P A 6.P S F D N G F 7.Q L I P V S N 8.L T Q S D H P 9.H S P S N P S 10.R T N Q P Q K 11.D P Q Y G Q H 12.N S G S R H H 13.T P P H H Q P 14.H Q H N M K I 15.M H P T H T T 16.H P A T I E D 17.S G Q I S L L 18.S G S P V P N 19.D N T S D M V 20.S S W Q R L R 21.Q N K D F Q K 22.H Q E S H P P 23.P H H H H Q P 24.S N Y F A E M 25.Q S S H S F L 26.A I N D T N Q 27.P T T P N E Q 28.S M K V P S S 29.S V E E R G S 30.N E S F T G A 31.Y P T Q T T D 32.I Y E V N T E 33.S P Q T P S R 34.S D N T V R Y 35.S M I P P Y R 36.V L T P T Q S 37.R P I V H H Q 38.M W R D S K P 39.H Q T H H P Q 40.T G L Q N S S 41.L S P K P Q L 42.N P G F A Q A 43.G I G Q P Q A 44.M I F L R V V 45.T A H A M L Y 46.H L P I P S A 47.M G A G R A A 48.S I H S R D T 49.T F H K W P S 50.S T W I P E F 51.P S S P L Q S 52.H L H Q Q N T 53.Q L Q L L Q S 54.R T T P S Y H 55.T T H Q E A P 56.Y P P R S N T 1.S P T K P T P P R S S Q 2.T S T N Y W L Y S S E S 3.V P F Q F K V T G D P L 4.A F S Q L K G F Y S R Y 5.E F Y T P T G L P P G R 6.H T V N R S M D V P G V 7.N T P A H A N A D F F D 8.A S G A K P W T S D L H 9.I P M T P S Y D S H I L 10.H A P Y K S H V W T E Q 11.A F A Y R D N L S M H P 12.L L A D T T H H R P W T 13.H W G E I P S R L S L P 14.L D T Q F I K P P Q K S 15.S V A A L F R H V P G H 16.N G W W T A S P G V P M 17.W K W L Y D L V T P T I 18.N E Y Y I H Q V H P P T 19.G E E L G N R L A R I T 20.S Q P F W M L S R V L A 21.D L F S V H W P P L K A 22.A T S H L H V R L P S R 23.T L V P K N E T P L S S 24.L S A A S H L H T S S S 25.I I P S Q Q Q S L M A P 26.Q I P S Y W P R G P G G 27.S S L H A L H P F G A V 28.Q S T T V L H A S P T L 29.K L P Y A L E L S G T V 30.K F L S L P P P T R S G 31.V A S P E R T S P A F P 32.E S A Q L N R T L Q L P 33.I D M S R L E S Y T L P 34.N H Q G V L S V H G S L 35.H Y L P K N V R T S L Q 36.T L P S P L A L L T V H 37.L S P L H Q L N S S V N 38.S P S M L T S M W P N T 39.N L P S P L I P A S S P 40.S L S P T R S L Y E A T 41.N I S D T L N R S R W K 42.Q S Y S S M L Y P S P F 43.A Q S Q M M S A Q F R P 44.E L L A P R G S L N T G 45.T T N S H E F P P G Q S 46.Y D E I L G A A P S L K 47.T P G E Y L R L A T G R 48.G A Q Q L N S M H P E H 49.R P L E S R T P L Y L P Oren/Tamerler/Sarikaya KNOWLEDGE-BASED DESIGN (HA)

HA12 I HA7 I HA_7 (7 aa constrained) HA12 (12 aa linear) HA12 I HA7 I Oren/Tamerler/Sarikaya BACKTESTING (HA)

CASE STUDY: AMELOGENIN Principal protein involved in enamel formation. Multifunction protein –Mineralization. –Signaling. –Adhesion to process matrix. –Physical protein-protein interactions. Never been crystallised (irregular / unstable?). –Most proteins with non-repeating sequence are active in globular form. –Many proteins fold into globular form upon interaction with substrate / interactor. –Assumption of linear and globular forms. –Start with protein structure prediction.

CASE STUDY: AMELOGENIN STRUCTURE Predicted five models (typical for CASP). Annotate structure with experimental and simulation evidence to find best predicted globular structure and infer function.

MGTWILFACLLGAAFAMPLPPHPGSPGYINLSYEKSHSQAINTDRTALVLTPLKWYQSMIRQPYPSYGYEPMGGWLHHQIIPVLSQQHPPSHTLQPHHHLPVVPAQQPVA PQQPMMPVPGHHSMTPTQHHQPNIPPSAQQPFQQPFQPQAIPPQSHQPMQPQSPLHPMQPLAPQPPLPPLFSMQPLSPILPELPLEAWPATDKTKREEVD Signal Region Exon 4 CASE STUDY: AMELOGENIN FUNCTION Horst/Oren/Cheng/Wang

MGTWILFACLLGAAFAMPLPPHPGSPGYINLSYEKSHSQAINTDRTALVLTPLKWYQSMIRQPYPSYGYEPMGGWLHHQIIPVLSQQHPPSHTLQPHHHLPVVPAQQPVA PQQPMMPVPGHHSMTPTQHHQPNIPPSAQQPFQQPFQPQAIPPQSHQPMQPQSPLHPMQPLAPQPPLPPLFSMQPLSPILPELPLEAWPATDKTKREEVD PV 2. HPPSHTLQPHHHLPVV 3. VPGHHSMTPTQH 1. LFACLLGAAFAMPLP 2. PGYINLSYEKSHSQAINTDRTA 3. LPPLFSMQPLSPILPELPLEAWPAT MOUSE AMELOGENIN STRUCTURAL ANALYSIS Model 3Model 4Model 5Model 1Model 2 CASE STUDY: AMELOGENIN – WHAT IT DOES Horst/Oren/Cheng/Wang

CASE STUDY: AMELOGENIN INTERACTION Horst

Sequences derived from amelogenin: 1.HTLQPHHHLPVV (12) 2.VPGHHSMTPTQH (12) 3.LFACLLGAAFAMPLP (15) 4.HPPSHTLQPHHHLPVV (16) 5.PGYINLSYEKSHSQAINTDRTA (22) 6.LPPLFSMQPLSPILPELPLEAWPAT (25) 7.HPPSHTLQPHHHLPVVPAQQPVAPQQPMMPVPGHHSMTPTQH (42) Oren/Tamerler/Sarikaya CASE STUDY: AMELOGENIN – HA BINDING

Characterise sequences and structures of naturally occurring proteins in terms of their total similarity scores using different scoring matrices. This will produce a database of sequences with predicted and known structures with specific selectivity and affinity to different inorganics. This database can be analysed for atom-atom preferences, torsion angle preferences, and other characteristics to define energy functions and move sets for performing protein structure simulations. We will combine this with our all-atom energy function capable of handling inorganics and our protein structure simulation software. Design higher order protein-like scaffolds with specific functionalities: BIOPHYSICS-BASED DESIGN Strong quartz binding region Strong hydroxyapatite binding region Active site

People: Ersin Emre Oren Jeremy Horst Samudrala group Mehmet Sarikaya and his group Candan Tamerler-Behar and her group Support from: National Institutes of Health National Science Foundation Kinship Foundation (Searle Scholars Program) Defense University Research Initiative on NanoTechnology Genetically Engineered Materials Science and Engineering Center Puget Sound Partners in Global Health (Gates Foundation) UW Technologies Initiative UW Technology Gap Research Fund Washington Research Fund ACKNOWLEDGEMENTS