A Parallel Structured Ecological Model for High End Shared Memory Computers Dali Wang Department of Computer Science, University of Tennessee, Knoxville.

Slides:



Advertisements
Similar presentations
Pooja ROY, Manmohan MANOHARAN, Weng Fai WONG National University of Singapore ESWEEK (CASES) October 2014 EnVM : Virtual Memory Design for New Memory Architectures.
Advertisements

Slides Prepared from the CI-Tutor Courses at NCSA By S. Masoud Sadjadi School of Computing and Information Sciences Florida.
Program Analysis and Tuning The German High Performance Computing Centre for Climate and Earth System Research Panagiotis Adamidis.
The Charm++ Programming Model and NAMD Abhinav S Bhatele Department of Computer Science University of Illinois at Urbana-Champaign
Thoughts on Shared Caches Jeff Odom University of Maryland.
Dynamic Load Balancing for VORPAL Viktor Przebinda Center for Integrated Plasma Studies.
ATLSS and Uncertainty: relativity and spatially-explicit ecological models as methods to aid management planning in Everglades restoration Louis J. Gross,
Sensitivity Analysis of a Spatially Explicit Fish Population Model Applied to Everglades Restoration Ren é A. Salinas and Louis J. Gross The Institute.
On the Integration and Use of OpenMP Performance Tools in the SPEC OMP2001 Benchmarks Bernd Mohr 1, Allen D. Malony 2, Rudi Eigenmann 3 1 Forschungszentrum.
The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.
1 Software Testing and Quality Assurance Lecture 40 – Software Quality Assurance.
Strategic Directions in Real- Time & Embedded Systems Aatash Patel 18 th September, 2001.
Models for the masses: bringing computational resources for addressing complex ecological problems to stakeholders Louis J. Gross, Eric Carr and Mark.
Leveling the Field for Multicore Open Systems Architectures Markus Levy President, EEMBC President, Multicore Association.
Charm++ Load Balancing Framework Gengbin Zheng Parallel Programming Laboratory Department of Computer Science University of Illinois at.
N Tropy: A Framework for Analyzing Massive Astrophysical Datasets Harnessing the Power of Parallel Grid Resources for Astrophysical Data Analysis Jeffrey.
Efficient Parallel Implementation of Molecular Dynamics with Embedded Atom Method on Multi-core Platforms Reporter: Jilin Zhang Authors:Changjun Hu, Yali.
Performance Evaluation of Hybrid MPI/OpenMP Implementation of a Lattice Boltzmann Application on Multicore Systems Department of Computer Science and Engineering,
GmImgProc Alexandra Olteanu SCPD Alexandru Ştefănescu SCPD.
Multi-core Programming Thread Profiler. 2 Tuning Threaded Code: Intel® Thread Profiler for Explicit Threads Topics Look at Intel® Thread Profiler features.
1 Babak Behzad, Yan Liu 1,2,4, Eric Shook 1,2, Michael P. Finn 5, David M. Mattli 5 and Shaowen Wang 1,2,3,4 Babak Behzad 1,3, Yan Liu 1,2,4, Eric Shook.
:: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: :: Dennis Hoppe (HLRS) ATOM: A near-real time Monitoring.
Scientific Computing Topics for Final Projects Dr. Guy Tel-Zur Version 2,
Computational Design of the CCSM Next Generation Coupler Tom Bettge Tony Craig Brian Kauffman National Center for Atmospheric Research Boulder, Colorado.
Computational issues in Carbon nanotube simulation Ashok Srinivasan Department of Computer Science Florida State University.
Russ Miller Center for Computational Research Computer Science & Engineering SUNY-Buffalo Hauptman-Woodward Medical Inst IDF: Multi-Core Processing for.
The WRF Model The Weather Research and Forecasting (WRF) Model is a mesoscale numerical weather prediction system designed for both atmospheric research.
Exploiting Data Parallelism in SELinux Using a Multicore Processor Bodhisatta Barman Roy National University of Singapore, Singapore Arun Kalyanasundaram,
1 GISolve – TeraGrid GIScience Gateway Shaowen Wang Department of Geography and Grid Research & educatiOn ioWa (GROW) The University of Iowa May.
Scheduling Many-Body Short Range MD Simulations on a Cluster of Workstations and Custom VLSI Hardware Sumanth J.V, David R. Swanson and Hong Jiang University.
Porting Irregular Reductions on Heterogeneous CPU-GPU Configurations Xin Huo, Vignesh T. Ravi, Gagan Agrawal Department of Computer Science and Engineering.
ATLSS Models for Predicting the Impact of Hydrology on Wildlife Populations in the Everglades Mangrove Zone of Florida Bay Jon Cline University of Tennessee.
Efficient Data Accesses for Parallel Sequence Searches Heshan Lin (NCSU) Xiaosong Ma (NCSU & ORNL) Praveen Chandramohan (ORNL) Al Geist (ORNL) Nagiza Samatova.
Hybrid MPI and OpenMP Parallel Programming
HPC User Forum Back End Compiler Panel SiCortex Perspective Kevin Harris Compiler Manager April 2009.
ARGONNE NATIONAL LABORATORY Climate Modeling on the Jazz Linux Cluster at ANL John Taylor Mathematics and Computer Science & Environmental Research Divisions.
NIH Resource for Biomolecular Modeling and Bioinformatics Beckman Institute, UIUC NAMD Development Goals L.V. (Sanjay) Kale Professor.
NIH Resource for Biomolecular Modeling and Bioinformatics Beckman Institute, UIUC NAMD Development Goals L.V. (Sanjay) Kale Professor.
Chapter 6: Integrating Knowledge and Action Scott Kaminski ME / 9 / 2005.
Integrated Ecological Assessment February 28, 2006 Long-Term Plan Annual Update Carl Fitz Recovery Model Development and.
MAPLD 2005/254C. Papachristou 1 Reconfigurable and Evolvable Hardware Fabric Chris Papachristou, Frank Wolff Robert Ewing Electrical Engineering & Computer.
Debugging parallel programs. Breakpoint debugging Probably the most widely familiar method of debugging programs is breakpoint debugging. In this method,
MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.
This project is supported by the NASA Interdisciplinary Science Program The Estuarine Hypoxia Component of the Coastal Ocean Modeling Testbed: Providing.
Parallel Landscape Fish Model for South Florida Ecosystem Simulation Dali Wang 1, Michael W. Berry 2, Eric A. Carr 1, Louis J. Gross 1 The ATLSS Hierarchy.
DOE Network PI Meeting 2005 Runtime Data Management for Data-Intensive Scientific Applications Xiaosong Ma NC State University Joint Faculty: Oak Ridge.
Parallelization Strategies Laxmikant Kale. Overview OpenMP Strategies Need for adaptive strategies –Object migration based dynamic load balancing –Minimal.
Survey of Tools to Support Safe Adaptation with Validation Alain Esteva-Ramirez School of Computing and Information Sciences Florida International University.
A Pattern Language for Parallel Programming Beverly Sanders University of Florida.
1/50 University of Turkish Aeronautical Association Computer Engineering Department Ceng 541 Introduction to Parallel Computing Dr. Tansel Dökeroğlu
MicroGrid Update & A Synthetic Grid Resource Generator Xin Liu, Yang-suk Kee, Andrew Chien Department of Computer Science and Engineering Center for Networked.
Multicore Applications in Physics and Biochemical Research Hristo Iliev Faculty of Physics Sofia University “St. Kliment Ohridski” 3 rd Balkan Conference.
COMP7330/7336 Advanced Parallel and Distributed Computing MapReduce - Introduction Dr. Xiao Qin Auburn University
F1-17: Architecture Studies for New-Gen HPC Systems
Across Trophic Level System Simulation
How can the science of ecological modeling be used as a management tool in assessing the relative impact of alternative hydrology scenarios on populations.
Foundations of Spatially Explicit Species Index Models
FPGA: Real needs and limits
Across Trophic Level System Simulation (ATLSS)
Software Architecture in Practice
Performance Evaluation of Adaptive MPI
Department of Computer Science University of California, Santa Barbara
SDM workshop Strawman report History and Progress and Goal.
Chapter 1 Introduction.
Hybrid Programming with OpenMP and MPI
Dr. Tansel Dökeroğlu University of Turkish Aeronautical Association Computer Engineering Department Ceng 442 Introduction to Parallel.
Department of Computer Science, University of Tennessee, Knoxville
Department of Computer Science University of California, Santa Barbara
Parallel Exact Stochastic Simulation in Biochemical Systems
Computational issues Issues Solutions Large time scale
Presentation transcript:

A Parallel Structured Ecological Model for High End Shared Memory Computers Dali Wang Department of Computer Science, University of Tennessee, Knoxville

Background and Context IWOMP 2005, D. Wang to provide a quantitative modeling package to assist stakeholders in the South and Central Florida restoration effort. to aid in understanding how the biotic communities of South Florida are linked to the hydrologic regime and other abiotic factors, and to provide a tool for both scientific research and ecosystem management.

Previous Experiments IWOMP 2005, D. Wang A successful approach to the parallelization of landscape based (spatially- explicit) fish models is spatial decomposition. For these cases, each processor only simulates the ecological behaviors of fish on a partial landscape. This approach is efficient in standalone fish simulations because the low movement capability of fish does not force large data movement between processors.

Motivations & Objectives IWOMP 2005, D. Wang Design a new partition approach to minimize the data transfer; to efficiently utilize the advanced features of shared- memory computational platforms; However, in an integrated simulation with an individual- based wading bird model, intensive data immigration across all processors is inevitable, since a bird’s flying distance may cover the whole landscape. Typical memory-intensive applications

Model Structure and Fish Dynamics IWOMP 2005, D. Wang Escape, Diffusive Movement, Mortality, Aging, Reproduction, Growth Fish Dynamics Computational domain: approximately 111,000 landscape cells, each has two basic types of area: marsh and pond.

Parallelization Strategy IWOMP 2005, D. Wang Layer-wised Partition: 1) Data transfer between processes can be minimized; 2) Dynamic load balancing can be easily implemented.

Computational Model IWOMP 2005, D. Wang Model Initialization Update low trophic data Escape (OMP_NUM_THREADS -1) threads Aging Reproduction Growth Model Finalization Update hydrological data I/O operations Master thread Computational thread(s) (one thread) Get total fish density (one thread) Get fish consumption Diffusive Movement (OMP_NUM_THREADS -1) threads Mortality (OMP_NUM_THREADS -1) threads (one thread) Big simulation loopNested OpenMPparallel region

Computational Platform IWOMP 2005, D. Wang A SGI Altix system at the Center for Computational Sciences (CCS) of ORNL. 256 Intel Itanium2 processors running at 1.5 GHz, each with 6 MB of L3 cache, 256K of L2 cache, and 32K of L1 cache. 8 GB of memory per processor for a total of 2 Terabytes of total system memory The operating system is a 64-bit version of Linux. The parallel programming model is supported by OpenMP.

Model Result and Performance IWOMP 2005, D. Wang

Future Work (Ecological aspect) IWOMP 2005, D. Wang Field Data Calibration and Verification individual-based wading bird model spatially-explicit spices index model Ecological Model Integration With Ecological Impact Assessment (scenario analysis, …) Simulation-based ecosystem management (spatial optimal control, real-time ecological system analysis)

Future Work (Computational aspect) IWOMP 2005, D. Wang Large-scale simulation: Fine resolution- A hybrid, reconfigurable two- dimensional (spatial/temporal) partitioning using a hybrid MPI/OpenMP model. Fault tolerant computing/simulation Model integration: A component based parallel simulation framework

Related References IWOMP 2005, D. Wang D. Wang, et al. Design and Implementation of a Parallel Fish Model for South Florida. Proceedings of the 37th Hawaii International Conference on System Sciences. D. Wang, et al. A Parallel Landscape Fish Model for Ecosystem Modeling, Simulation: The Transactions of The Society of Modeling and International. D. Wang, et al. A Grid Service Module for Natural Resource Managers, Internet Computing. Parallel Implementation Grid Computing Module Performance Evaluations D. Wang, et al. On Parallelization of a Structured Ecological Model, International Journal on High Performance Computing Applications. Simulation Framework D. Wang, et al. Toward Ecosystem Modeling on Computing Grids, Computing in Science and Engineering. D. Wang, et al. A Parallel Simulation Framework for Regional Ecosystem Modeling, IEEE Transactions on Distributed and Parallel Systems Websites

Acknowledgement IWOMP 2005, D. Wang National Science Foundation U.S. Geological Survey, through Cooperative Agreement with UT Department of Interior’s Critical Ecosystem Studies Initiative Computational Science Initiatives through the Science Alliance at UT/ORNL This research used resources of the Center for Computational Sciences at ORNL, which is supported by the Office of Science of the Department of Energy

Parallel code section IWOMP 2005, D. Wang

Lessons IWOMP 2005, D. Wang MPI/OpenMP Easy Process/Thread Management (dynamic vs. static) Minimum Code Modification vs. Flexible Performance Tuning (user involvement needed) Parallel Profiling Tools (TAU, PAPI, …) High Portability (SMP and Cluster, even New systems (multi-core, embedded )