Cluster Based Protein Folding Douglas Fuller and Brandon McKethan.

Slides:



Advertisements
Similar presentations
1 NAMD - Scalable Molecular Dynamics Gengbin Zheng 9/1/01.
Advertisements

Abhinav Bhatele, Laxmikant V. Kale University of Illinois at Urbana-Champaign Sameer Kumar IBM T. J. Watson Research Center.
OpenFOAM on a GPU-based Heterogeneous Cluster
A COMPLEX NETWORK APPROACH TO FOLLOWING THE PATH OF ENERGY IN PROTEIN CONFORMATIONAL CHANGES Del Jackson CS 790G Complex Networks
ECE669 L4: Parallel Applications February 10, 2004 ECE 669 Parallel Computer Architecture Lecture 4 Parallel Applications.
StreamMD Molecular Dynamics Eric Darve. MD of water molecules Cutoff is used to truncate electrostatic potential Gridding technique: water molecules are.
Graphical Models for Protein Kinetics Nina Singhal CS374 Presentation Nov. 1, 2005.
Atomistic Protein Folding Simulations on the Submillisecond Timescale Using Worldwide Distributed Computing Qing Lu CMSC 838 Presentation.
High Performance Computing 1 Parallelization Strategies and Load Balancing Some material borrowed from lectures of J. Demmel, UC Berkeley.
P2P-based Simulator for Protein Folding Shun-Yun Hu 2005/06/03.
Chapter 14 Simulation. Monte Carlo Process Statistical Analysis of Simulation Results Verification of the Simulation Model Computer Simulation with Excel.
Stochastic Roadmap Simulation: An Efficient Representation and Algorithm for Analyzing Molecular Motion Mehmet Serkan Apaydin, Douglas L. Brutlag, Carlos.
Chapter 13 Finite Difference Methods: Outline Solving ordinary and partial differential equations Finite difference methods (FDM) vs Finite Element Methods.
Topology Aware Mapping for Performance Optimization of Science Applications Abhinav S Bhatele Parallel Programming Lab, UIUC.
Sort-Last Parallel Rendering for Viewing Extremely Large Data Sets on Tile Displays Paper by Kenneth Moreland, Brian Wylie, and Constantine Pavlakos Presented.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Modeling molecular dynamics from simulations Nina Singhal Hinrichs Departments of Computer Science and Statistics University of Chicago January 28, 2009.
Efficient Parallel Implementation of Molecular Dynamics with Embedded Atom Method on Multi-core Platforms Reporter: Jilin Zhang Authors:Changjun Hu, Yali.
Energy Profiling And Analysis Of The HPC Challenge Benchmarks Scalable Performance Laboratory Department of Computer Science Virginia Tech Shuaiwen Song,
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
CCA Common Component Architecture Manoj Krishnan Pacific Northwest National Laboratory MCMD Programming and Implementation Issues.
Scheduling Many-Body Short Range MD Simulations on a Cluster of Workstations and Custom VLSI Hardware Sumanth J.V, David R. Swanson and Hong Jiang University.
Performance Model & Tools Summary Hung-Hsun Su UPC Group, HCS lab 2/5/2004.
Protein Folding Programs By Asım OKUR CSE 549 November 14, 2002.
Labs start today, or Thursday Inquiry 1 proposal due 9/9 or 9/13 Class communication via Blackboard, webpage, and/or .
Research Into the Time Reversal of Cellular Automata Team rm -rf / Daniel Kaplun, Dominic Labanowski, Alex Lesman.
Today: A little more Strong Inference Why are you connected to polar bears? How do we know that anyone else knows anything?
Parallelization of 2D Lid-Driven Cavity Flow
Overcoming Scaling Challenges in Bio-molecular Simulations Abhinav Bhatelé Sameer Kumar Chao Mei James C. Phillips Gengbin Zheng Laxmikant V. Kalé.
Proteins Chapter 5 Big Idea #2: Structure Meets Function.
Computational Aspects of Multi-scale Modeling Ahmed Sameh, Ananth Grama Computing Research Institute Purdue University.
CS 484 Designing Parallel Algorithms Designing a parallel algorithm is not easy. There is no recipe or magical ingredient Except creativity We can benefit.
What is Life? ASTR 4: Life in the Universe. Discussion Topics Three Domains of Life Basic Cell Structure Macro-molecules Properties of Life.
Home - Distributed Parallel Protein folding Chris Garlock.
CS 484 Load Balancing. Goal: All processors working all the time Efficiency of 1 Distribute the load (work) to meet the goal Two types of load balancing.
Application of the MCMC Method for the Calibration of DSMC Parameters James S. Strand and David B. Goldstein The University of Texas at Austin Sponsored.
Parallel Genetic Algorithms By Larry Hale and Trevor McCasland.
A System Performance Model Distributed Process Scheduling.
Labs start next week (first reflection due)
MSc in High Performance Computing Computational Chemistry Module Parallel Molecular Dynamics (i) Bill Smith CCLRC Daresbury Laboratory
BMC Bioinformatics 2005, 6(Suppl 4):S3 Protein Structure Prediction not a trivial matter Strict relation between protein function and structure Gap between.
What is Protein Folding? Implications of Misfolding Computational Techniques Background image: Staphylococcal protein A, Z Domain (
Dr. Stuart Reichler Bio 311D. DateSubjectTextbook Chapters Jan 14Introduction, Philosophy of Science 16Strong Inferencearticle on webpage 18, 23Meiosis12,
Strong Scalability Analysis and Performance Evaluation of a SAMR CCA-based Reacting Flow Code Sophia Lefantzi, Jaideep Ray and Sameer Shende SAMR: Structured.
COMP7330/7336 Advanced Parallel and Distributed Computing Task Partitioning Dr. Xiao Qin Auburn University
Parallel Molecular Dynamics A case study : Programming for performance Laxmikant Kale
BIO409/509 Cell and Molecular Biology.
Massively Parallel Cosmological Simulations with ChaNGa Pritish Jetley, Filippo Gioachin, Celso Mendes, Laxmikant V. Kale and Thomas Quinn.
Ioannis E. Venetis Department of Computer Engineering and Informatics
ChaNGa: Design Issues in High Performance Cosmology
Modeling molecular dynamics from simulations
CS 584 Lecture 3 How is the assignment going?.
Studying Biology: Start with a question. How do we get answers?
Parallel Algorithm Design
Permeability of gases in glassy polymers by computer simulation
Dr. Stuart Reichler Bio 325.
Studying Biology: Start with a question. How do we get answers?
Home - Distributed Parallel Protein folding
distributed computing
Computer simulation of cell aggregation & growth on a tissue scaffold
Numerical Algorithms Quiz questions
By Brandon, Ben, and Lee Parallel Computing.
IXPUG, SC’16 Lightning Talk Kavitha Chandrasekar*, Laxmikant V. Kale
Ph.D. Thesis Numerical Solution of PDEs and Their Object-oriented Parallel Implementations Xing Cai October 26, 1998.
Discrete-time markov chain (continuation)
Gary M. Zoppetti Gagan Agrawal Rishi Kumar
Pulse Propagation in Granular Media
Studying Biology: Start with a question. How do we get answers?
Monte carlo simulations on mixed resolution protein models
Studying Biology: Start with a question. How do we get answers?
Presentation transcript:

Cluster Based Protein Folding Douglas Fuller and Brandon McKethan

Overview: What is  Screensaver-based distributed computing program run by Stanford  Utilizes unused processing power to fold proteins through a finite number of frames  The same work units are run on numerous computers to confirm accuracy

What Does Accomplish?  Helps in showing the process of linear amino acid chain to 3D protein structure  Is used on proteins involved with many diseases in order to elucidate how misfolding occurs  Will eventually lead to mutation-to- phenotype simulations

Computational Aspects  Monte Carlo simulation using lowest energy state calculations  Non-parallel, unimolecular program  Heuristic approaches

Implications of Heuristics/Unimolecular  The environment of the cell and molecular interactions  Solvents and extramolecular interactions cannot be ignored in the process of folding  Many diseases arise from misfolding that is not influenced by the internal energy state

Diseases of Protein Misfolding May Require Multimolecular Interactions  Cancers – B-RAF, Hsp-90 and 17-AAG  Prions – Infectious Proteins  BSE (Mad Cow Disease)  Kuru  Sheep Scrapie

Computational Weight of Multimolecular Interactions  The number of energy states and inter/intra-molecular interactions are much higher than unimolecular  Pushes the computational return time above appreciable limits for the project  Desktop computing and the Lowest Common Denominator

Cluster Based Folding and Future Aspects of Folding  Use cluster computing as the testing ground for truly parallel simulations  Individual proteins are discrete units  Allows the program to be refined while highly parallel desktop computing comes to fruition  5-10 year timeframe

Post-Multithreading Possibilities  Next truly discrete unit is the atom itself  Atom-per-processor modeling vs. Monte Carlo  Requires incredibly high number of processors – 100’s of thousands  Once again clusters provide testing ground

Parallelizing a computation  Considered “re-bugging” your code  Distribute work to multiple processors  Requires communication to deal with dependencies  Requires computation to distribute work and recombine results  Now what?

Domain Decomposition  Decide how to divide work  Spatially  Temporally  Other?  Introduces overhead  Can pessimize instead of optimize

Cheat!  “Embarassingly parallel” code  Splits naturally into small pieces  Small pieces can ignore each other  Small pieces can be computed by a single node   Problem: fold all proteins they care about  Decomposition: individual proteins  Dependencies: none!

Domain Decomposition: Challenges  Analyze dependencies  Communication patterns  Communication volume  Data distribution  Overlap computation/communication  Consider system characteristics  Communication latency/bandwidth  Computational efficiency  Computation/communication ratio  Do this all ahead of time?

Domain Decomposition: Pitfalls  Parallel overhead  Computation waiting on communication  Feed-forward dependencies  Dynamic decomposition schemes  Pick two: performance, portability, scalability