Home - Distributed Parallel Protein folding Chris Garlock.

Slides:



Advertisements
Similar presentations
A Dynamic World, what can Grids do for Multi-Core computing? Daniel Goodman, Anne Trefethen and Douglas Creager
Advertisements

1 Chapter 1 Why Parallel Computing? An Introduction to Parallel Programming Peter Pacheco.
SUPERCOMPUTER TO THE RESCUE Justin Curry EKU, Dept. of Technology, CEN/CET)
Software Demonstration and Course Description P.N. Lai.
Dinker Batra CLUSTERING Categories of Clusters. Dinker Batra Introduction A computer cluster is a group of linked computers, working together closely.
How mechanical force regulates the function of proteins? Vesa Hytönen Research group of Professor Viola Vogel, Biologically Oriented.
Graphical Models for Protein Kinetics Nina Singhal CS374 Presentation Nov. 1, 2005.
Atomistic Protein Folding Simulations on the Submillisecond Timescale Using Worldwide Distributed Computing Qing Lu CMSC 838 Presentation.
1 CIS450/IMSE450/ECE478 Operating Systems Winter 2003 Professor Jinhua Guo.
Virtual Machines for HPC Paul Lu, Cam Macdonell Dept of Computing Science.
Beowulf Cluster Computing Each Computer in the cluster is equipped with: – Intel Core 2 Duo 6400 Processor(Master: Core 2 Duo 6700) – 2 Gigabytes of DDR.
Final Project Week 3 - 5/7/09 GSEA and Cluster Computing in Protein Research Leon Kay, Yan Tran, Chris Thomas Yan Gary Chris Leon.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 1: Introduction What is an Operating System? Mainframe Systems Desktop Systems.
1/16/2008CSCI 315 Operating Systems Design1 Introduction Notice: The slides for this lecture have been largely based on those accompanying the textbook.
DB2 (Express C Edition) Installation and Using a Database
Modeling molecular dynamics from simulations Nina Singhal Hinrichs Departments of Computer Science and Statistics University of Chicago January 28, 2009.
High Throughput Computing with Condor at Purdue XSEDE ECSS Monthly Symposium Condor.
1 port BOSS on Wenjing Wu (IHEP-CC)
Free energies and phase transitions. Condition for phase coexistence in a one-component system:
 Introduction to Operating System Introduction to Operating System  Types Of An Operating System Types Of An Operating System  Single User Single User.
UNIT - 1Topic - 2 C OMPUTING E NVIRONMENTS. What is Computing Environment? Computing Environment explains how a collection of computers will process and.
Artdaq Introduction artdaq is a toolkit for creating the event building and filtering portions of a DAQ. A set of ready-to-use components along with hooks.
 H.M.BILAL Operating System Concepts.  What is an Operating System?  Mainframe Systems  Desktop Systems  Multiprocessor Systems  Distributed Systems.
Statistical Physics of the Transition State Ensemble in Protein Folding Alfonso Ramon Lam Ng, Jose M. Borreguero, Feng Ding, Sergey V. Buldyrev, Eugene.
PROTEINS PROTEINS Levels of Protein Structure.
Investigating Protein Conformational Change on a Distributed Computing Cluster Christopher Woods Jeremy Frey Jonathan Essex University.
Scheduling Many-Body Short Range MD Simulations on a Cluster of Workstations and Custom VLSI Hardware Sumanth J.V, David R. Swanson and Hong Jiang University.
Wenjing Wu Computer Center, Institute of High Energy Physics Chinese Academy of Sciences, Beijing BOINC workshop 2013.
Problem is to compute: f(latitude, longitude, elevation, time)  temperature, pressure, humidity, wind velocity Approach: –Discretize the.
April 26, CSE8380 Parallel and Distributed Processing Presentation Hong Yue Department of Computer Science & Engineering Southern Methodist University.
Anton Supercomputer Brandon Dean 4/28/15. History Named after Antonie van Leeuwenhoek – “father of microbiology” Molecular Dynamics (MD) simulations were.
Distributed Computing CSC 345 – Operating Systems By - Fure Unukpo 1 Saturday, April 26, 2014.
Distributed Computing Systems CSCI 4780/6780. Geographical Scalability Challenges Synchronous communication –Waiting for a reply does not scale well!!
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Research Infrastructures – Proposal n GROMACs.
1/20 Study of Highly Accurate and Fast Protein-Ligand Docking Method Based on Molecular Dynamics Reporter: Yu Lun Kuo
GPUs: Overview of Architecture and Programming Options Lee Barford firstname dot lastname at gmail dot com.
Modelling proteins and proteomes using Linux clusters Ram Samudrala University of Washington.
Molecular Modelling - Lecture 2 Techniques for Conformational Sampling Uses CHARMM force field Written in C++
Design Issues of Prefetching Strategies for Heterogeneous Software DSM Author :Ssu-Hsuan Lu, Chien-Lung Chou, Kuang-Jui Wang, Hsiao-Hsi Wang, and Kuan-Ching.
Operating System. Chapter 1: Introduction What is an Operating System? Mainframe Systems Desktop Systems Multiprocessor Systems Distributed Systems Clustered.
The Effects of Parallel Programming on Gaming Anthony Waterman.
Dynameomics: Protein Mechanics, Folding and Unfolding through Large Scale All-Atom Molecular Dynamics Simulations INCITE 6 David A. C. Beck Valerie Daggett.
Computer System Evolution. Yesterday’s Computers filled Rooms IBM Selective Sequence Electroinic Calculator, 1948.
1/50 University of Turkish Aeronautical Association Computer Engineering Department Ceng 541 Introduction to Parallel Computing Dr. Tansel Dökeroğlu
Mobile Analyzer A Distributed Computing Platform Juho Karppinen Helsinki Institute of Physics Technology Program May 23th, 2002 Mobile.
Cluster Based Protein Folding Douglas Fuller and Brandon McKethan.
Chapter 1: Introduction What is an Operating System? Mainframe Systems Desktop Systems Multiprocessor Systems Distributed Systems Clustered System Real.
CernVM and Volunteer Computing Ivan D Reid Brunel University London Laurence Field CERN.
GROMACs GROningen MAchine for Chemical Simulations Eliana Bohorquez Puentes Nicolas Ortiz Gonzalez.
Chapter 1: Introduction
The Accelerated Weighted Ensemble
Chapter 1: Introduction
Chapter 1: Introduction
Turing Machines.
Chapter 1: Introduction
Chapter 1: Introduction
Chapter 1: Introduction
Home - Distributed Parallel Protein folding
distributed computing
Chapter 1: Introduction
The Design of a Grid Computing System for Drug Discovery and Design
Language Processors Application Domain – ideas concerning the behavior of a software. Execution Domain – Ideas implemented in Computer System. Semantic.
Large Time Scale Molecular Paths Using Least Action.
Dr. Tansel Dökeroğlu University of Turkish Aeronautical Association Computer Engineering Department Ceng 442 Introduction to Parallel.
Chapter 1: Introduction
Chapter 1: Introduction
Chapter 1: Introduction
Experimental Overview
Chapter 1: Introduction
Chapter 1: Introduction
Presentation transcript:

Home - Distributed Parallel Protein folding Chris Garlock

Protein Folding - Why is it important  Proteins are biological nano-machines which play apart in all of our bodies functions  Protein folding is the process all proteins undergo to assemble into their native structure  Strive for states of low free energy  Sometimes proteins misfold, and misfolded proteins can clump together or aggregate, which can cause serious health problems  Alzheimers, Cystic Fibrosis, Several types of cancer and more

How is protein folding simulated on a computer? Atomic level simulations Newtonian mechanics (lots of numerical integration) Proteins can fold in many ways, so a statistical model is needed to represent all of the possibilities accurately Markov State Models Set of states (shapes) Transition rates between states

Problems with serial protein folding Modern computers can simulate ~50 nanoseconds of protein assembly in 24 hours Many proteins fold on the millisecond timescale (1,000,000 nanoseconds) It would take 20,000 days or ~55 years to simulate folding for just one protein!

Folding At Home uses donated computational power from otherwise idle processors across the globe Anyone who wants to contribute to the project just needs to download the client program, and the server will give them work units to complete, once a work unit is finished, the client will return results to the server and get a new work unit 500,000 processor cores outputting 19,900 TFLOP/S to help simulate protein folding This is faster than the Titan supercomputer!

Challenges with distributed computing Heterogeneous processors clients can run on Windows, Linux, OSX, Android, and PS3. A special client can also be downloaded to run on a rack of computers or a cluster of GPU’s Extremely slow communication between processors WAN connections Processors can be unreliable What happens if a host powers down their machine in the middle of a work unit?

How to parallelize folding simulations 1. To start a simulation project, first choose some initial conformations (protein shapes). 2. Each conformation becomes the starting point for some simulations which together are called a run 3. Within each run we launch many different trajectories each called a clone All clones in a run start with the same conformation, but different initial velocities for the atoms involved 4. Because each clone takes large amounts of time to execute, clones are further divided into generations. Generations have to be run serially. 5. Some clones may find additional conformations (states of equilibrium) in which case new runs are started from those conformations 6. Repeat steps 2-5 until the Markov State Model is complete

A Generate Initial Conformation Start a run for A (comprised of 5 clones) B C Discover additional Conformations Start a run for B and C D Discover additional Conformations and new pathways E Start a run for D and E Discover a misfold condition

Results The results of simulations have been verified experimentally several times The drug design industry has begun to use information from simulations to narrow down the number of molecules they test experimentally. This allows drug designers to be more thorough when evaluating a molecule, and improves their throughput. The Folding at Home project has been active for 15 years, and over 100 papers have been published about its findings

Questions?