O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Cluster Computing Applications Project Parallelizing BLAST Research Alliance of Minorities.

Slides:



Advertisements
Similar presentations
Will Minter Division Director, Asset Management & Small Business Programs Office November 15, 2006 ITER Project.
Advertisements

O AK R IDGE N ATIONAL L ABORATORY U. S. D EPARTMENT OF E NERGY Center for Computational Sciences Cray X1 and Black Widow at ORNL Center for Computational.
O AK R IDGE N ATIONAL L ABORATORY U. S. D EPARTMENT OF E NERGY Weigh-in-Motion (WIM) with Rational Rose Sabrina A. Phillips Mississippi Valley State University.
Attack Graphs for Proactive Digital Forensics Tara L. McQueen Delaware State University Louis P. Wilder Computational Sciences and Engineering Division.
I would like to thank Louis P. Wilder and Dr. Joseph Trien for the opportunity to work on this project and for their continued support. The Research Alliance.
Weigh-in-Motion User Manual for WIM Integrated System Cindy Lopez City University of New York-York College Research Alliance in Math and Science (RAMS)
Information Technology Center Introduction to High Performance Computing at KFUPM.
Presented by: Yash Gurung, ICFAI UNIVERSITY.Sikkim BUILDING of 3 R'sCLUSTER PARALLEL COMPUTER.
First Lego League of Tennessee Quentoria Leeks Fisk University Research Alliance in Math and Science Computer Applications and Web Technologies Networking.
Presented to George Seweryniak Mathematical, Information, and Computational Sciences Erin A. Lennartz Virginia Polytechnic Institute and State University.
Managed by UT-Battelle for the Department of Energy 1 Mathematical Modeling of Fatty Acid Oxidation in Skeletal Muscle Cells Sheds New Light on Obesity.
Linux Platform  Download the source tar ball from the BLAST source code link  ncbi-blast src.tar.gz  Compilation  cd /BLASTdirectory/c++ ./configure.
Jeff Shen, Morgan Kearse, Jeff Shi, Yang Ding, & Owen Astrachan Genome Revolution Focus 2007, Duke University, Durham, North Carolina Introduction.
Neutron Scattering Experiment Automation with Python RT2010 Conference, Lisbon, Portugal (PCM-26) Piotr Żołnierczuk, Rick Riedel Neutron Scattering Science.
O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY 1 Identifying Regulatory Transcriptional Elements on Functional Gene Groups Using Computer-
Beowulf Cluster Computing Each Computer in the cluster is equipped with: – Intel Core 2 Duo 6400 Processor(Master: Core 2 Duo 6700) – 2 Gigabytes of DDR.
Cluster Computing Applications Project: Parallelizing BLAST The field of Bioinformatics needs faster string matching algorithms. What Exactly is BLAST?
Cluster Computer For Bioinformatics Applications Nile University, Bioinformatics Group. Hisham Adel 2008.
07/14/08. 2 Points Introduction. Cluster and Supercomputers. Cluster Types and Advantages. Our Cluster. Cluster Performance. Cluster Computer for Basic.
CPP Staff - 30 CPP Staff - 30 FCIPT Staff - 35 IPR Staff IPR Staff ITER-India Staff ITER-India Staff Research Areas: 1.Studies.
The Evaluation of an Embedded System for First Responders Nicholas Brabson The University of Tennessee David Hill Computational Sciences and Engineering.
Oak Ridge National Laboratory — U.S. Department of Energy 1 The ORNL Cluster Computing Experience… John L. Mugler Stephen L. Scott Oak Ridge National Laboratory.
Weigh-in-Motion User Manual For WIM Integrated System Cindy Lopez City University of New York – York College Research Alliance in Math and Science Computational.
SSI-OSCAR A Single System Image for OSCAR Clusters Geoffroy Vallée INRIA – PARIS project team COSET-1 June 26th, 2004.
Methods  OpenGL Functionality Visualization Tool Functionality 1)3D Shape/Adding Color1)Atom/element representations 2)Blending/Rotation 2)Rotation 3)Sphere.
Tiffany M. Marshall Saint Mary-of-the-Woods College Mentors : Tim McKnight Measurement Science and Systems.
O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Nanoscale Electronics / Single-Electron Transport in Quantum Dot Arrays Dene Farrell SUNY.
Integrating Visualization Peripherals into Power-Walls and Similar Tiled Display Environments James Da Cunha Savannah State University Research Alliance.
DynamicBLAST on SURAgrid: Overview, Update, and Demo John-Paul Robinson Enis Afgan and Purushotham Bangalore University of Alabama at Birmingham SURAgrid.
The Effects of Radio Propagation in the Workplace Carolyn Jo Shields Research Alliance in Math and Science Information Technology Services Division, Oak.
O AK R IDGE N ATIONAL L ABORATORY U. S. D EPARTMENT OF E NERGY RobustMap: A Fast and Robust Algorithm for Dimension Reduction and Clustering Lionel F.
Can Thermal Reactor Recycle Eliminate the Need for Multiple Repositories? C. W. Forsberg, E. D. Collins, C. W. Alexander, and J. Renier Actinide and Fission.
Networking and Computing Technologies Division Becky Verastegui December 6, 2004 RAMS Workshop.
United States Grid Security and Reliability Control in High Load Conditions Christopher Lanclos—Mississippi Valley State University Research Alliance in.
OAK RIDGE NATIONAL LABORATORY U.S. DEPARTMENT OF ENERGY Parallel Solution of 2-D Heat Equation Using Laplace Finite Difference Presented by Valerie Spencer.
POSTER TEMPLATES BY: Meta data - data that provides information about data.Meta data - data that provides information about.
Introduction Relationship between climate and health widely studied Climatic temperature stress increases cardiovascular disease risk Solar UV radiation.
Lionel F. Lovett, II Jackson State University Research Alliance in Math and Science Computer Science and Mathematics Division Mentors: George Ostrouchov.
Oak Ridge National Laboratory — U.S. Department of Energy 1 The ORNL Cluster Computing Experience… Stephen L. Scott Oak Ridge National Laboratory Computer.
Managed by UT-Battelle for the Department of Energy 1 Advanced Brain-Wave Analysis For Early Diagnosis of Alzheimer’s Disease (AD) Presented by Jaron Murphy.
Managed by UT-Battelle for the Department of Energy 1 Integrated Catalogue (ICAT) Auto Update System Presented by Jessica Feng Research Alliance in Math.
O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY 1 Parallel Solution of the 3-D Laplace Equation Using a Symmetric-Galerkin Boundary Integral.
Report on CSU HPC (High-Performance Computing) Study Ricky Yu–Kwong Kwok Co-Chair, Research Advisory Committee ISTeC August 18,
O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY A Comparison of Methods for Aligning Genomic Sequences Ja’Nera Mitchom Fisk University Research.
O AK R IDGE N ATIONAL L ABORATORY U. S. D EPARTMENT OF E NERGY 1 On-line Automated Performance Diagnosis on Thousands of Processors Philip C. Roth Future.
HPCVL High Performance Computing Virtual Laboratory Founded 1998 as a joint HPC lab between –Carleton U. (Comp. Sci.) –Queen’s U. (Engineering) –U. of.
Parametric Study of Mechanical Stress in Abdominal Aortic Aneurysms (AAA) Erin A. Lennartz Virginia Polytechnic Institute and State University Research.
Managed by UT-Battelle for the Department of Energy Flux Coupling Machines and Switched Reluctance Motors to Replace Permanent Magnets in Electric Vehicles.
A Tutorial of Sequence Matching in Oracle Haifeng Ji* and Gang Qian** * Oklahoma City Community College ** University of Central Oklahoma.
Constructing Hexahedral Meshes of Abdominal Aortic Aneurysms for Use in Finite Element Analysis Rowena Ong Vanderbilt University Mentor: Kara Kruse Computational.
METHODS CT scans were segmented and triangular surface meshes generated using Amira. Antiga and Steinman’s method (2004) for automatically extracting parameterized.
Computing Resources at Vilnius Gediminas Technical University Dalius Mažeika Parallel Computing Laboratory Vilnius Gediminas Technical University
On High Performance Computing and Grid Activities at Vilnius Gediminas Technical University (VGTU) dr. Vadimas Starikovičius VGTU, Parallel Computing Laboratory.
O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Facilities and How They Are Used ORNL/Probe Randy Burris Dan Million – facility administrator.
Hormone Replacement Therapy: Friend or Foe? A Retrospective Study for Prospective Research Research Alliance in Math and Science Computational Sciences.
The Research Alliance in Math and Science program is sponsored by the Office of Advanced Scientific Computing Research, Office of Science, U.S. Department.
CCSM3 / HadCM3 Under predict precipitation rate near equator regions CCSM3 under predicts greater in SE U.S. than HadCM3 Methodology and Results Interpolate.
Advanced Brain-Wave Analysis For Early Diagnosis of Alzheimer’s Disease (AD) Jaron Murphy The Ohio State University Research Alliance in Math and Science.
1 Spallation Neutron Source Data Analysis Jessica Travierso Research Alliance in Math and Science Program Austin Peay State University Mentor: Vickie E.
Managed by UT-Battelle for the Department of Energy 1 Decreasing the Artificial Attenuation of the RCSIM Radio Channel Simulation Software Abigail Snyder.
Biosequence Similarity Search on the Mercury System Praveen Krishnamurthy, Jeremy Buhler, Roger Chamberlain, Mark Franklin, Kwame Gyang, and Joseph Lancaster.
O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Data Requirements for Climate and Carbon Research John Drake, Climate Dynamics Group Computer.
Parallelization of a Non-Linear Analysis Code Lee Hively and Jim Nutaro (mentors) Computational Sciences and Engineering Travis Whitlow Research Alliance.
Detecting Undesirable Insider Behavior Joseph A. Calandrino* Princeton University Steven J. McKinney* North Carolina State University Frederick T. Sheldon.
CIP HPC CIP - HPC HPC = High Performance Computer It’s not a regular computer, it’s bigger, faster, more powerful, and more.
Managed by UT-Battelle for the Department of Energy 1 United States Grid Security and Reliability Control in High Load Conditions Presented to Associate.
Source Localization in a Moving Sensor Field Acknowledgements A special thanks to my mentor Dr. Jacob Barhen for his assistance through the duration of.
Regression Testing for CHIMERA Jessica Travierso Austin Peay State University Bronson Messer National Center for Computational Sciences August 2009.
Regression Testing for CHIMERA Jessica Travierso Austin Peay State University Research Alliance in Math and Science National Center for Computational Sciences,
Computing Experience…
Presentation transcript:

O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Cluster Computing Applications Project Parallelizing BLAST Research Alliance of Minorities (RAM), Computer Science and Mathematics Division William Burke York College, City University of New York John Mugler and Stephen Scott Oak Ridge National Laboratory

O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Parallelizing the BLAST Algorithm: Feasible or Not? Bioinformatics Research needs faster text string matching algorithms. The purpose of this project is to analyze the BLAST algorithm: Define the structure of BLAST. State why it is a valuable Bioinformatics tool. Explore parallelizations of BLAST. BLAST matches query string fragments against a target database. Eliminates need to run a full text string comparison. Speeds up search database search time. Several methods of parallelizing BLAST have been explored.

O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Introduction Cluster infrastructure Open Source Cluster Application Resources (OSCAR) Cluster, Command and Control (C3) eXtreme TORC (XTORC) Cluster applications Bioinformatics Toolsets Basic Local Alignment Sequence Tool (BLAST)

O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Infrastructure Overview Red Hat Linux 7.2 OSCAR 1.3 C3 - LAM/MPI - Maui Scheduler - MPICH - OpenSSH - OpenSSL - PBS - PVM - SIS -

O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Red Hat Linux 7.2 Installation Configuration Administration Network Configuration. Performance Monitoring. Creating Scripts.

O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY OSCAR 1.3 and C3 Tools OSCAR configures the head node. OSCAR builds and configures compute nodes. C3 reduces time and effort to operate and manage a cluster.

O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY eXtreme TORC eXtreme TORC powered by OSCAR 65 Pentium IV Machines Peak Performance: GFLOPS RAM memory: GB Disk Capacity: 2.68 TB Dual interconnects –Gigabit & Fast Ethernet

O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY The field of needs faster string Bioinformatics matching algorithms

O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Applications Overview BLAST a Bioinformatics tool. Parallelize BLAST’s algorithm. BLAST

O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY BLAST a Bioinformatics Tool What is BLAST? A heuristic algorithm used for string matching query strings to a database. How does BLAST algorithm work? String fragmentation. Statistical means for comparison. How can you parallelize BLAST on a computational cluster?

O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Query word (W = 3) QUERY:GSVEDTTGSQSLAALLNKCKTPQGQRLVNQWIKWPLMDKNRIEERLNLVEAFVEDA PQG 18 neighborhood PEG 15 words PRG 14 PKG 14 PMG 13neighborhood PSG 13score threshold PQN 12( T = 13 ) Etc... QUERY STRING SLAALLNKCKTPQGQWLVNQWIKWPLMDKNRIEERLN L--++K-P-G N n DATABASE STRING GSWNLAALDKDPMGDKNRIEERLNLVEAIKWPLMDJN330 The BLAST Search Algorithm

O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Parallelization of BLAST NBLAST SLRI Bioinformatics Toolkit ParAlign MOBLAST als2000/michalickova.html DNA sequence matching processor PARALIGN™

O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Conclusion  BLAST algorithm has a diverse family of programs.  Several implementations exist for parallelizing the BLAST algorithm.  Future work to include further exploration of the various parallelized BLAST algorithms on clusters.

O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Acknowledgements I would like to extend my thanks to Stephen L. Scott, John Mugler, Thomas Naughton, and Brian Luethke for their invaluable mentoring, Michaelangelo Salcedo for his guidance, Debbie McCoy and Cheryl Hamby for their support in the RAM program.

O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Disclaimer This research was performed under the Research Alliance for Minorities Program administered through the Computer Science and Mathematics Division, Oak Ridge National Laboratory. This Program is sponsored by the Mathematical, Information, and Computational Sciences Division; Office of Advanced Scientific Computing Research; U.S. Department of Energy. Oak Ridge National Laboratory is managed by UT-Battelle, LLC, for the U.S. Department of Energy under contract DE-AC05-00OR This research used resources of the Center for Computational Sciences at Oak Ridge National Laboratory, which is supported by the Office of Science, U.S. Department of Energy. This work has been authored by a contractor of the U.S. Government under contract DE-AC05-00OR Accordingly, the U.S. Government retains a nonexclusive, royalty-free license to publish or reproduce the published form of this contribution, or allow others to do so, for U.S. Government purposes.