Opportunities for SLEUTH in High-performance Computing Qingfeng (Gene) Guan, Ph.D. Center for Advanced Land Management Information Technologies School.

Slides:



Advertisements
Similar presentations
Severs AIST Cluster (50 CPU) Titech Cluster (200 CPU) KISTI Cluster (25 CPU) Climate Simulation on ApGrid/TeraGrid at SC2003 Client (AIST) Ninf-G Severs.
Advertisements

UNIVERSITY OF JYVÄSKYLÄ Resource Discovery in P2P Networks Using Evolutionary Neural Networks Presentation for International Conference on Advances in.
Complexity Settlement Simulation using CA model and GIS (proposal) Kampanart Piyathamrongchai University College London Centre for Advanced Spatial Analysis.
Parallel Computing Glib Dmytriiev
Unsolved Problems and Issues Related to the SLEUTH Urban Growth and Land Use Change Model Keith C. Clarke University of California, Santa Barbara SLEUTH.
Accelerators for HPC: Programming Models Accelerators for HPC: StreamIt on GPU High Performance Applications on Heterogeneous Windows Clusters
GPU Programming using BU Shared Computing Cluster
Enabling Speculative Parallelization via Merge Semantics in STMs Kaushik Ravichandran Santosh Pande College.
H EURISTIC S OLVER  Builds and tests alternative fuel treatment schedules (solutions) at each iteration  In each iteration:  Evaluates the effects of.
Monte-Carlo method and Parallel computing  An introduction to GPU programming Mr. Fang-An Kuo, Dr. Matthew R. Smith NCHC Applied Scientific Computing.
A Graph-based Recommender System Zan Huang, Wingyan Chung, Thian-Huat Ong, Hsinchun Chen Artificial Intelligence Lab The University of Arizona 07/15/2002.
EEL6686 Guest Lecture February 25, 2014 A Framework to Analyze Processor Architectures for Next-Generation On-Board Space Computing Tyler M. Lovelly Ph.D.
Master/Slave Architecture Pattern Source: Pattern-Oriented Software Architecture, Vol. 1, Buschmann, et al.
GPU System Architecture Alan Gray EPCC The University of Edinburgh.
GPGPU Introduction Alan Gray EPCC The University of Edinburgh.
HPCC Mid-Morning Break High Performance Computing on a GPU cluster Dirk Colbry, Ph.D. Research Specialist Institute for Cyber Enabled Discovery.
A Scalable Heterogeneous Parallelization Framework for Iterative Local Searches Martin Burtscher 1 and Hassan Rabeti 2 1 Department of Computer Science,
PARALLEL PROCESSING COMPARATIVE STUDY 1. CONTEXT How to finish a work in short time???? Solution To use quicker worker. Inconvenient: The speed of worker.
Cyberinfrastructure for Scalable and High Performance Geospatial Computation Xuan Shi Graduate assistants supported by the CyberGIS grant Fei Ye (2011)
Role of Economics in W&W Project and in Climate Change Projects Explain how land use patterns evolve over time Forecast future land use change Determine.
Rapid Raster Projection Transformation and Web Service Using High-performance Computing Technology 2009 AAG Annual Meeting Las Vegas, NV March 25 th, 2009.
Chapter 4 DECISION SUPPORT AND ARTIFICIAL INTELLIGENCE
A many-core GPU architecture.. Price, performance, and evolution.
GPU Computing with CUDA as a focus Christie Donovan.
2009/04/07 Yun-Yang Ma.  Overview  What is CUDA ◦ Architecture ◦ Programming Model ◦ Memory Model  H.264 Motion Estimation on CUDA ◦ Method ◦ Experimental.
Multi Agent Simulation and its optimization over parallel architecture using CUDA™ Abdur Rahman and Bilal Khan NEDUET(Department Of Computer and Information.
Parallel Programming Henri Bal Rob van Nieuwpoort Vrije Universiteit Amsterdam Faculty of Sciences.
1 ITCS 6/8010 CUDA Programming, UNC-Charlotte, B. Wilkinson, Jan 19, 2011 Emergence of GPU systems and clusters for general purpose High Performance Computing.
Modeling land cover change in the Chesapeake Bay watershed An overview of the SLEUTH model Claire A. Jantz Shippensburg University Geography-Earth Science.
Spatio-temporal differences in model outputs and parameter space as determined by calibration extent 7 th International Conference on Geocomputation Charles.
Modeling Urban Land-use with Cellular Automata Geog 232: Geo-Simulation Sunhui(Sunny) Sim February 7 th, 2005.
GPGPU overview. Graphics Processing Unit (GPU) GPU is the chip in computer video cards, PS3, Xbox, etc – Designed to realize the 3D graphics pipeline.
Synergy.cs.vt.edu Power and Performance Characterization of Computational Kernels on the GPU Yang Jiao, Heshan Lin, Pavan Balaji (ANL), Wu-chun Feng.
Motivation “Every three minutes a woman is diagnosed with Breast cancer” (American Cancer Society, “Detailed Guide: Breast Cancer,” 2006) Explore the use.
GPU Programming with CUDA – Accelerated Architectures Mike Griffiths
An approach for solving the Helmholtz Equation on heterogeneous platforms An approach for solving the Helmholtz Equation on heterogeneous platforms G.
GPU-accelerated Evaluation Platform for High Fidelity Networking Modeling 11 December 2007 Alex Donkers Joost Schutte.
NVDA Preetam Jinka Akhil Kolluri Pavan Naik. Background Graphics processing units (GPUs) Chipsets Workstations Personal computers Mobile devices Servers.
Computer Graphics Graphics Hardware
Parallel Computing for Urban Cellular Automata Qingfeng. Gene. Guan 2004-Nov-18 Geography Department Colloquium Univ. of California, Santa Barbara.
By Arun Bhandari Course: HPC Date: 01/28/12. GPU (Graphics Processing Unit) High performance many core processors Only used to accelerate certain parts.
Gregory Fotiades.  Global illumination techniques are highly desirable for realistic interaction due to their high level of accuracy and photorealism.
CCGrid 2014 Improving I/O Throughput of Scientific Applications using Transparent Parallel Compression Tekin Bicer, Jian Yin and Gagan Agrawal Ohio State.
Tampere University of TechnologySanna IltanenEDGE Laboratory for Architectural and Urban Research Urban Generator introinput datasimulation results.
1 © 2012 The MathWorks, Inc. Parallel computing with MATLAB.
Conclusions and Future Considerations: Parallel processing of raster functions were 3-22 times faster than ArcGIS depending on file size. Also, processing.
Example: Sorting on Distributed Computing Environment Apr 20,
Geosimulation Geosimulation models are developed to represent phenomena that occur in urban systems in highly realistic manner In particular, Cellular.
Parallelization and Characterization of Pattern Matching using GPUs Author: Giorgos Vasiliadis 、 Michalis Polychronakis 、 Sotiris Ioannidis Publisher:
1 CMPE 511 HIGH PERFORMANCE COMPUTING CLUSTERS Dilek Demirel İşçi.
CCGrid 2014 Improving I/O Throughput of Scientific Applications using Transparent Parallel Compression Tekin Bicer, Jian Yin and Gagan Agrawal Ohio State.
Carlo del Mundo Department of Electrical and Computer Engineering Ubiquitous Parallelism Are You Equipped To Code For Multi- and Many- Core Platforms?
1)Leverage raw computational power of GPU  Magnitude performance gains possible.
GPGPU introduction. Why is GPU in the picture Seeking exa-scale computing platform Minimize power per operation. – Power is directly correlated to the.
3/12/2013Computer Engg, IIT(BHU)1 CUDA-3. GPGPU ● General Purpose computation using GPU in applications other than 3D graphics – GPU accelerates critical.
Hybrid Parallel Implementation of The DG Method Advanced Computing Department/ CAAM 03/03/2016 N. Chaabane, B. Riviere, H. Calandra, M. Sekachev, S. Hamlaoui.
Fermi National Accelerator Laboratory & Thomas Jefferson National Accelerator Facility SciDAC LQCD Software The Department of Energy (DOE) Office of Science.
Accelerating K-Means Clustering with Parallel Implementations and GPU Computing Janki Bhimani Miriam Leeser Ningfang Mi
Heterogeneous Processing KYLE ADAMSKI. Overview What is heterogeneous processing? Why it is necessary Issues with heterogeneity CPU’s vs. GPU’s Heterogeneous.
Exploiting Graphics Processors for High-performance IP Lookup in Software Routers Jin Zhao, Xinya Zhang, Xin Wang, Yangdong Deng, Xiaoming Fu IEEE INFOCOM.
Introduction to Machine Learning, its potential usage in network area,
Computer Graphics Graphics Hardware
Generalized and Hybrid Fast-ICA Implementation using GPU
Introduction to Parallel Computing: MPI, OpenMP and Hybrid Programming
Ray-Cast Rendering in VTK-m
Scalable Parallel Interoperable Data Analytics Library
2009 AAG Annual Meeting Las Vegas, NV March 25th, 2009
Hybrid Programming with OpenMP and MPI
Computer Graphics Graphics Hardware
Vrije Universiteit Amsterdam
Presentation transcript:

Opportunities for SLEUTH in High-performance Computing Qingfeng (Gene) Guan, Ph.D. Center for Advanced Land Management Information Technologies School of Natural Resources University of Nebraska - Lincoln 2102 AAG Annual Meeting New York, NY

Content Introduction pRPL – a general-purpose Raster Processing programming Library pSLEUTH – parallelizing SLEUTH using pRPL Parallelizing CA using GPUs Conclusions

Introduction - SLEUTH The urban growth model SLEUTH, uses a modified Cellular Automata to model the spread of urbanization across a landscape (Clarke et al., 1996, 1997). Its name comes from the GIS data layers that are incorporated into the model; Slope, Landuse, Exclusion layer (where growth cannot occur, like the ocean), Urban, Transportation, and Hillshade. (Goldstein. 2004) Prediction of urban development to the year 2050 over southeastern Pennsylvania and part of Delaware using the SLEUTH model mation/movie_small.htm

Coefficients – Dispersion – Breed – Spread – Slope – Road Gravity Rules – Spontaneous Growth Rule (centralized) – New Spreading Centers Rule (non-centralized) – Edge Growth Rule (non- centralized) – Road-Influenced Growth Rule (non-centralized) For more info. about SLEUTH: Introduction - SLEUTH

Bruce Force Calibration – coefficient combinations Monte Carlo Iteration – 10 ~ 100 Iterations for each coefficient combination Massive-volume Data – High spatial resolution – Large study area High Computational Intensity –The model calibration for a medium sized data set and minimal data layers requires about 1200 CPU hours on a typical workstation (Clarke. 2003) Calibration of SLUETH Introduction - SLEUTH

Solution 1: Simplifying Assumptions – Each coefficient affects the model behavior in a linear manner – Iteratively narrow down the coefficient search range 0, 50, , 75, , 62, 75 … Reach the optimal coefficient value – Problem: random processes in CA very likely non-linear relationships assumptions generate unreliable calibration results inaccurate simulation results possibly flawed conclusions and theories, and improper management and planning decisions

Introduction - SLEUTH Solution 2: Smart Agents – Computational Intelligence – Genetic algorithm, Artificial Neural Networks, Ant Intelligence, etc. – Search the coefficient space more efficiently – Issue: which one is better/smarter? Solution 3: High-performance Computing (HPC) – Dumb Dumb with Fast Legs – Exhaust the coefficient space in a acceptable/feasible time period – Remove simplifying assumptions

Introduction – High-performance Computing (HPC) Parallel computing is the use of multiple computing units (computers, processors, or processes) working together on a common task in a concurrent manner in order to achieve higher performance In contrast to sequential computing that usually has only one computing unit Performance is usually measured with computing time A massive parallel computing system ( )

pRPL: parallel Raster Processing Library Raster is born to be parallelized

Writing a parallel raster processing program is as simple as writing a serial program Possible usage – Massive-volume geographic raster processing – Image (including remote sensing imagery) processing – Large-scale Cellular Automata (CA) and Agent-based Modeling Free downloadable and open source – pRPL - Introduction

Object-Oriented programming style – Written in C++ – Built upon the Message Passing Interface (MPI) Class templates support arbitrary data types – e.g. integer, char, double precision floating point number, even user-defined types Transparent Parallelism pRPL - Features

Spatially Flexible – Supports any arbitrary neighborhood configuration – Supports centralized and non-centralized algorithms pRPL - Features (cont.)

Regular and irregular decomposition pRPL - Features (cont.)

Multi-layer processing pRPL - Features (cont.)

The four growth rules in the SLEUTH model, were implemented using pRPL pSLEUTH - Parallelization

Processer Grouping – With pRPL, pSLEUTH is able to organize the processors in groups. Data parallelism within a group, task parallelism among groups. – Static tasking and dynamic tasking pSLEUTH - Parallelization (cont.)

Data – Urban areas of the continental US (1980 and 1990, 4948×3108) Calibration Settings – Only three values (0, 50, 100) will be evaluated for each coefficient – The total number of simulations is 243 (= 3 5 ) – Each simulation includes 11 ( = ) years pSLEUTH - Experiments Settings

Computer cluster at University of California, Santa Barbara – 128 dual CPU 3.06 GHz Xeon nodes – 2-GB RAM each

pSLEUTH – Performance

GPU - Introduction Graphics Processing Unit (GPU) – Massive number (hundreds) of computing cores on one unit – Very inexpensive ($200 - $2000) – General parallel processing with Compute Unified Device Architecture (CUDA) – Personal Supercomputer

GPU – Preliminary Study Classical CA - Game Of Life (GOL) – 10,000 X 10,000 – 100 iterations Desktop PC with 1.6 GHz dual-core CPU – 100 minutes Desktop PC with NVIDIA GeForce GTX 260 GPU – 192 cores – 6 minutes – 16.6 speed-up

GPU – Preliminary Study Keeneland – Hybrid computer cluster with multiple CPUs and GPUs – 120 nodes with 240 CPUs and 360 GPUs – Developed by Georgia Institute of Technology, University of Tennessee at Knoxville, and Oak Ridge National Laboratory, sponsored by NSF Keeneland-GOL – 8 GPUs used, 25 seconds, 240 speed-up – 20 GPUs used, 20 seconds, 300 speed-up

HPC greatly reduces the computing time for model calibration – Enables the removal of simplifying assumptions – Can increase the reliability of calibration and accuracy of simulation New computing accelerator technology enables HPC-SLUETH with very low costs Opportunities for us to test and verify conclusions and theories, as well as to stimulate new theories Conclusion

Comments and Questions?