1 Spatial Big Data Challenges Intersecting Cloud Computing and Mobility Shashi Shekhar McKnight Distinguished University Professor Department of Computer.

Slides:



Advertisements
Similar presentations
Geographic Information Systems “GIS”
Advertisements

7/03Spatial Data Mining G Dong (WSU) & H. Liu (ASU) 1 6. Spatial Mining Spatial Data and Structures Images Spatial Mining Algorithms.
Spatial Dependency Modeling Using Spatial Auto-Regression Mete Celik 1,3, Baris M. Kazar 4, Shashi Shekhar 1,3, Daniel Boley 1, David J. Lilja 1,2 1 CSE.
Major Operations of Digital Image Processing (DIP) Image Quality Assessment Radiometric Correction Geometric Correction Image Classification Introduction.
University of Minnesota CG_Hadoop: Computational Geometry in MapReduce Ahmed Eldawy* Yuan Li* Mohamed F. Mokbel*$ Ravi Janardan* * Department of Computer.
Transitioning Experiences with Army Geo Spatial Center (AGC) Pradeep Mohan 4 th Year PhD Student 1.
Raster Based GIS Analysis
Parallelizing Spatial Data Mining Algorithms: A case study with Multiscale and Multigranular Classification PGAS 2006 Vijay Gandhi, Mete Celik, Shashi.
Spatial Mining.
GIS: The Grand Unifying Technology. Introduction to GIS  What is GIS?  Why GIS?  Contributing Disciplines  Applications of GIS  GIS functions  Information.
Sustainability: Spatial Computing Challenges Shashi Shekhar McKnight Distinguished University Professor University of Minnesota
A PARALLEL FORMULATION OF THE SPATIAL AUTO-REGRESSION MODEL FOR MINING LARGE GEO-SPATIAL DATASETS HPDM 2004 Workshop at SIAM Data Mining Conference Barış.
Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS,
Spatial Data Mining: Teleconnections Shashi Shekhar Mcknight Distinguished University Professor, U of Minnesota only in old plan Only in new plan In both.
Group Members Faculty : Professor Shashi Shekhar Professor Mohamed Mokbel Students : Mete Celik Betsy George James Kang Sangho Kim Xiaojia Li Qingsong.
Multiscale, Multigranular Land Cover Classification: Performance Optimization Vijay Gandhi, Abhinaya Sinha 1 st May, 2006.
(Geo) Informatics across Disciplines! Why Geo-Spatial Computing? Societal: Google Earth, Google Maps, Navigation, location-based service Global Challenges.
Panelist: Shashi Shekhar McKnight Distinguished Uninversity Professor University of Minnesota Cyber-Infrastructure (CI) Panel,
1 Evacuation Planning Algorithms Professor Shashi Shekhar Dept. of Computer Science, University of Minnesota Participants: Q. Lu, S. Kim February 2004.
What’s That? : A Location Based Service Department of Computer Science and Engineering University of Minnesota Presented by: Don Eagan Chintan Patel
1 Context-Inclusive Approach to Speed-up Function Evaluation for Statistical Queries: An Extended Abstract Vijay Gandhi, James Kang, Shashi Shekhar University.
University of Minnesota Department of Computer Science and Engineering Directed by Professor Shashi Shekhar Department of Computer Science and Engineering.
Shashi Shekhar Professor, Computer Science Department Teaching: Csci 8705: Topics in Scientific Databases Csci.
Source:
E CO - R OUTING U SING S PATIAL B IG D ATA routing/files/iii_2012.pdf.
Roger ZimmermannCOMPSAC 2004, September 30 Spatial Data Query Support in Peer-to-Peer Systems Roger Zimmermann, Wei-Shinn Ku, and Haojun Wang Computer.
A Distributed Approach for Planning Radio Communications David Kidner 1, Ian Fitzell 2, Phillip Rallings 3, Miqdad Al Nuaimi 2 & Andrew Ware 3 University.
U.S. Department of the Interior U.S. Geological Survey David V. Hill, Information Dynamics, Contractor to USGS/EROS 12/08/2011 Satellite Image Processing.
Ch 4. The Evolution of Analytic Scalability
6 am 11 am 5 pm Fig. 5: Population density estimates using the aggregated Markov chains. Colour scale represents people per km. Population Activity Estimation.
Last Words COSC Big Data (frameworks and environments to analyze big datasets) has become a hot topic; it is a mixture of data analysis, data mining,
Army High Performance Computing Research Center Prof. Shashi Shekhar Computational Sciences & Engineering for Defense Technology Applications Enabling.
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 2007 (TPDS 2007)
1. Wheeler, R.E. Notes on View Camera Geometry. 2003, Wolf, P.R. and DeWitt, B.A. Elements of Photogrammetry(with.
CG&GIS Lab Computer Graphics and Geographic Information Systems Laboratory University of Ni š Faculty of Electronic Engineering Prof. Dr Dejan Rančić Prof.
The Paper Map A long and rich historyA long and rich history Has a scale or representative fractionHas a scale or representative fraction –The ratio of.
Mapping and analysis for public safety: An Overview.
Combined Central and Subspace Clustering for Computer Vision Applications Le Lu 1 René Vidal 2 1 Computer Science Department, Johns Hopkins University,
Parallel dynamic batch loading in the M-tree Jakub Lokoč Department of Software Engineering Charles University in Prague, FMP.
Lagrangian Xgraphs: A logical data-model for Spatio-temporal Network Data Acknowledgement: Venkata Gunturi, Shashi Shekhar University of Minnesota, Minneapolis.
NSF Industry/University Cooperative Research Center High Performance Database Research Center Naphtali D. Rishe, Director, HPDRC.FIU.edu.
Presentation Template KwangSoo Yang Florida Atlantic University College of Engineering & Computer Science.
Big traffic data processing framework for intelligent monitoring and recording systems 學生 : 賴弘偉 教授 : 許毅然 作者 : Yingjie Xia a, JinlongChen a,b,n, XindaiLu.
Data Structures and Algorithms in Parallel Computing Lecture 7.
Spatial Computing Shashi Shekhar McKnight Distinguished University Professor Dept. of Computer Sc. and Eng. University of Minnesota
COMPUTER GRAPHICS CS 482 – FALL 2015 SEPTEMBER 29, 2015 RENDERING RASTERIZATION RAY CASTING PROGRAMMABLE SHADERS.
Parallel Applications And Tools For Cloud Computing Environments CloudCom 2010 Indianapolis, Indiana, USA Nov 30 – Dec 3, 2010.
Identifying and Analyzing Patterns of Evasion HM Investigator: Shashi Shekhar (U Minnesota) Collaborators: Renee Laubscher, James Kang Kickoff.
AegisDB: Integrated realtime geo-stream processing and monitoring system Chengyang Zhang Computer Science Department University of North Texas.
1 Travel Times from Mobile Sensors Ram Rajagopal, Raffi Sevlian and Pravin Varaiya University of California, Berkeley Singapore Road Traffic Control TexPoint.
Integrating Geographic Information Systems and Vehicle Operations Hal Bowman ESRI, Inc.
Spatial Networks Introduction to Spatial Computing CSE 5ISC Some slides adapted from Shashi Shekhar, University of Minnesota.
Wildland Fire Emissions Study – Phase 2 For WRAP FEJF Meeting Research in progress by the CAMFER fire group: Peng Gong, Ruiliang Pu, Presented by Nick.
Database Laboratory TaeHoon Kim. /18 Work Progress.
COMP7330/7336 Advanced Parallel and Distributed Computing Task Partitioning Dr. Xiao Qin Auburn University
COMP7330/7336 Advanced Parallel and Distributed Computing Task Partitioning Dynamic Mapping Dr. Xiao Qin Auburn University
Www. infofusion.se Information Fusion Requirements on Databases Ronnie Johansson.
BIG Geospatial Data. WHAT IS SPATIAL BIG DATA?  Defined in part by the context, use-case  Data too big, complex for traditional desktop GIS  Often.
Accelerating K-Means Clustering with Parallel Implementations and GPU Computing Janki Bhimani Miriam Leeser Ningfang Mi
Big Data Analytics and HPC Platforms
Auburn University
Introduction to Spatial Computing CSE 5ISC
Auburn University COMP7330/7336 Advanced Parallel and Distributed Computing Mapping Techniques Dr. Xiao Qin Auburn University.
Sameh Shohdy, Yu Su, and Gagan Agrawal
(Geo) Informatics across Disciplines!
Edge computing (1) Content Distribution Networks
Ch 4. The Evolution of Analytic Scalability
Panel on Research Challenges in Big Data
FREERIDE: A Framework for Rapid Implementation of Datamining Engines
Presentation transcript:

1 Spatial Big Data Challenges Intersecting Cloud Computing and Mobility Shashi Shekhar McKnight Distinguished University Professor Department of Computer Science and Engineering University of Minnesota

2 Spatial Databases: Representative Projects only in old plan Only in new plan In both plans Evacutation Route Planning Parallelize Range Queries Storing graphs in disk blocksShortest Paths

3 Why cloud computing for spatial data? Geospatial Intelligence [ Dr. M. Pagels, DARPA, 2006] Estimated at 140 terabytes per day, 150 peta-bytes annually Annual volume is 150x historical content of the entire internet Analyze daily data as well as historical data

4 Eco-Routing U.P.S. Embraces High-Tech Delivery Methods (July 12, 2007) By “The research at U.P.S. is paying off. ……..— saving roughly three million gallons of fuel in good part by mapping routes that minimize left turns.” Minimize fuel consumption and GPG emission –rather than proxies, e.g. distance, travel-time –avoid congestion, idling at red-lights, turns and elevation changes, etc.

5 Real-time and Historic Travel-time, Fuel Consumption, GPS Tracks 5

6 Eco-Routng Research Challenges Frames of Reference –Absolute to moving object based (Lagrangian) Data model of lagrangian graphs –Conceptual – generalize time-expanded graph –Logical – Lagrangian abstract data types –Physical – clustering, index, Lagrangian routing algorithms Flexible Architecture –Allow inclusion of new algorithms, e.g., gps-track mining –Merge solutions from different algorithms Geo-sensing of events, –e.g., volunteered geographic information (e.g., open street map), –social unrest (Ushahidi), flash-mob, … Geo-Prediction, –e.g., predict track of a hurricane or a vehicle –Challenges: auto-correlation, non-stationarity Geo-privacy

7 Cloud Computing and Spatial Big Data Motivation Case Study 1: Simpler to Parallelize Case Study 2 – Harder Case Study 3 – Hardest Wrap up

8 Simpler: Land-cover Classification Multiscale Multigranular Image Classification into land-cover categories Inputs Output at 2 Scales

9 Parallelization Choice 1. Initialize parameters and memory 2. for each Spatial Scale 3. for each Quad 4. for each Class 5. Calculate Quality Measure 6 end for Class 7. end for Quad 8. end for Spatial Scale 9. Post-processing Input 64 x 64 image (Plymouth County, MA) 4 classes (All, Woodland, Vegetated, Suburban) LanguageUPC Platform Cray X1, 1-8 processors)

10 Harder: Parallelizing Vector GIS ( 1/30) second Response time constraint on Range Query Parallel processing necessary since best sequential computer cannot meet requirement Blue rectangle = a range query, Polygon colors shows processor assignment Set of Polygons Display Graphics Engine Local Terrain Database Remote Terrain Databases 30 Hz. View Graphics 2Hz. 8Km X 8Km Bounding Box High Performance GIS Component 25 Km X 25 Km Bounding Box

11 Data-Partitioning Approach Initial Static Partitioning Run-Time dynamic load-balancing (DLB) Platforms: Cray T3D (Distributed), SGI Challenge (Shared Memory)

12 DLB Pool-Size Choice is Challenging!

13 Hardest – Location Prediction Nest locations Distance to open water Vegetation durability Water depth

14 Ex. 3: Hardest to Parallelize Maximum Likelihood Estimation Need cloud computing to scale up to large spatial dataset. However, computing determinant of large matrix is an open problem!

15 Cloud Computing and Spatial Big Data Motivation: Spatial Big Data in National Security & Eco-routing Case Study 1: Simpler to Parallelize –Map-reduce is okay –Should it provide spatial declustering services? –Can query-compiler generate map-reduce parallel code? Case Study 2 – Harder –Need dynamic load balancing beyond map-reduce Case Study 3 – Hardest –Need new computer science, e.g., Eco-routing algorithms determinant of large matrix Parallel formulation of evacuation route planning

16 Acknowledgments HPC Resources, Research Grants –Army High Performance Computing Research Center-AHPCRC –Minnesota Supercomputing Institute - MSI Spatial Database Group Members –Mete Celik, Sanjay Chawla, Vijay Gandhi, Betsy George, James Kang, Baris M. Kazar, QingSong Lu, Sangho Kim, Sivakumar Ravada USDOD –Douglas Chubb, Greg Turner, Dale Shires, Jim Shine, Jim Rodgers –Richard Welsh (NCS, AHPCRC), Greg Smith Academic Colleagues –Vipin Kumar –Kelley Pace, James LeSage –Junchang Ju, Eric D. Kolaczyk, Sucharita Gopal