1 - 1 Concurrent Web Map Cache Server A Vision for IndianaMap Zao Liu, Marlon Pierce, Geoffrey Fox Community Grids Laboratory Indiana University Neil Devadasan.

Slides:



Advertisements
Similar presentations
Concurrent Web Map Cache Server Zao Liu, Marlon Pierce, Geoffrey Fox Community Grids Laboratory Indiana University.
Advertisements

1 - 1 Concurrent Web Map Cache Server A Vision for IndianaMap Zao Liu, Marlon Pierce, Geoffrey Fox Community Grids Laboratory Indiana University Neil Devadasan.
Spatial Vision Innovative Geospatial Solutions BioSIRT - A national system using Open GIS components Ian Miller.
Framework Data Development and Web Map Services WV GIS Technical Center 27 Nov 2012 Kurt Donaldson WV GIS Technical Center 27 Nov 2012 Kurt Donaldson.
Technical Architectures
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
IURC Digital Mapping Update Electric Utilities Task Force April 6, 2004.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
GIS Evolution at the Madison County Council of Governments for the Madison County Government Consortium.
North Carolina Geospatial Data Archiving Project (NCGDAP) Project Overview Partnership –University library (NCSU) and state agency (NCCGIA) –$520,000 funding,
Internet GIS. A vast network connecting computers throughout the world Computers on the Internet are physically connected Computers on the Internet use.
Introduction to Databases Transparencies 1. ©Pearson Education 2009 Objectives Common uses of database systems. Meaning of the term database. Meaning.
QoS-enabled middleware by Saltanat Mashirova. Distributed applications Distributed applications have distinctly different characteristics than conventional.
PHASE 3: SYSTEMS DESIGN Chapter 8 System Architecture.
Help!!! Some Future Semantic Grid Activities CrisisGrid and ServoGrid PTLIU Laboratory for Community Grids Geoffrey Fox Computer Science, Informatics,
Osama Shahid ( ) Vishal ( ) BSCS-5B
By Mihir Joshi Nikhil Dixit Limaye Pallavi Bhide Payal Godse.
GIS technologies and Web Mapping Services
Copyright 2010, The World Bank Group. All Rights Reserved. Integrating Agriculture into National Statistical Systems Section A 1.
Interoperability ERRA System.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 12 Slide 1 Distributed Systems Architectures.
Projects – 216,978 Architects – 20,653 Images – 84,293 Institutions - 31 Hits – 42,000 a day Hits since 2000 – over.
GIS 1110 Designing Geodatabases. Representation Q. How will we model our real world data? A. Typically: Features Continuous Surfaces and Imagery Map Graphics.
A Cloud is a type of parallel and distributed system consisting of a collection of inter- connected and virtualized computers that are dynamically provisioned.
Presenter: Dipesh Gautam.  Introduction  Why Data Grid?  High Level View  Design Considerations  Data Grid Services  Topology  Grids and Cloud.
Jeremy D. Bartley Kansas Geological Survey An Introduction to an Index of Geospatial Web Services.
U.S. Environmental Protection Agency WATERS Status Update
Larger GIS Community Can answer: –Local questions at small extents Spatial and temporal extents limited –Global questions at low resolution (while ignoring.
An Introduction To Building An Open Standard Web Map Application Joe Daigneau Pennsylvania State University.
National Earth Science Infrastructure Program AuScope Limited Headquarters School of Earth Sciences University of Melbourne Victoria 3010 Tel
GIS FOR COMMUNITY DEVELOPMENT. WHAT DOES GIS STAND FOR? Hardware and Software Data Mapping Standards GIS Savvy Users GIS G eographic I nformation S ystems.
Concurrent Web Map Cache Server: A “Web 2.0 Meets SOA” Case Study
Data Scrounging 101 Steve Signell, Instructor Robert Poirier, TA School of Science Rensselaer Polytechnic Institute Monday,
The 2000 Decennial Census School District Project: Using Census Data for the School District Mapping System **** Development and Implementation Tai A.
QuakeSim Work: Web Services, Portlets, Real Time Data Services Marlon Pierce Contributions: Ahmet Sayar,
GIS On The Web: An Overview of ArcIMS. *The easy flow of geographic data can offer real-life solutions in many societal sectors, including municipal government,
1 Introduction to Middleware. 2 Outline What is middleware? Purpose and origin Why use it? What Middleware does? Technical details Middleware services.
Geocaching and the North Dakota GIS Hub Bob Nutsch GIS Coordinator State of North Dakota GIS Day 2004.
Preserving State and Local Government Digital Geospatial Data Steve Morris Head of Digital Library Initiatives North Carolina State University Libraries.
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
Concurrent Web Map Cache Server A “Web 2.0 Meets SOA” Case Study Zao Liu, Marlon Pierce, Sunghoon Ko, Geoffrey Fox Community Grids Laboratory Indiana University.
CA-OES CAL(IT)2 Feb. 20, 2002 Internet GIServices
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
SAVI Community Information System SAVI: Social Assets and Vulnerabilities Indicators The Polis Center at IUPUI.
Using the AccuGlobe Software with the IndianaMap Using the AccuGlobe Software.
Integrating Geographical Information Systems and Grid Applications Marlon Pierce Contributions: Ahmet Sayar,
March 2005 IGIC Statewide GIS Conference The IndianaMap Roll-out Project Presented By: Neil M. Devadasan, The POLIS Center Philip S. Worrall, Innovative.
INTRODUCTION TO GIS  Used to describe computer facilities which are used to handle data referenced to the spatial domain.  Has the ability to inter-
II Course on GBIF Node Management Arusha, Tanzania 31 st October and 1 st November 2008 Tim ROBERTSON Systems Architect GBIF Secretariat The GBIF Data.
About the Indiana Geographic Information Council 140 North Senate Avenue Indianapolis, IN
REST By: Vishwanath Vineet.
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
Web Technologies Lecture 13 Introduction to cloud computing.
ATLAS Database Access Library Local Area LCG3D Meeting Fermilab, Batavia, USA October 21, 2004 Alexandre Vaniachine (ANL)
SPDF Science Advisory Group - September 29-30, 2005 Page 12/24/2016 9:09:48 PM Services of the Space Physics Data Facility (SPDF) / Sun-Earth Connection.
Maine GeoLibrary Board Forums Provided as part of the Board’s Strategic Planning & Integrated Land Records Project.
INDIANA GEOLOGICAL SURVEY Indiana University Denver Harper Chris Walls Chris Dintaman Nathan Eaton.
Function BIRN The ability to find a subject who may have participated in multiple experiments and had multiple assessments done is a critical component.
Indiana GIS Local Resolution NHD Applications for Surveyors ISPLS State Conference January 18, 2013 (12:45 pm – 3:30 pm)
“It’s Not a Sprint – It’s a Marathon” GIS 101 For Local Government ICIT Midyear Conference June 17, 2010 Jeff Miller, Dubuque County Matt Boeck, Story.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
US Army Corps of Engineers BUILDING STRONG ® Huntington District Floodplain Management Services Dan Bailey, CFM Huntington District August 2012.
DISTRIBUTED FILE SYSTEM- ENHANCEMENT AND FURTHER DEVELOPMENT BY:- PALLAWI(10BIT0033)
GIS Basic Training June 7, 2007 – ICIT Midyear Conference
INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM
The Improvement of PaaS Platform ZENG Shu-Qing, Xu Jie-Bin 2010 First International Conference on Networking and Distributed Computing SQUARE.
Introduction to Databases Transparencies
Indiana Geographic Information Officer INDIANA OFFICE OF TECHNOLOGY
Data Warehouse.
The IndianaMap Vision: To create and maintain a comprehensive statewide geospatial data resource that: Leverages geospatial data assets to maximize benefit.
Presentation transcript:

1 - 1 Concurrent Web Map Cache Server A Vision for IndianaMap Zao Liu, Marlon Pierce, Geoffrey Fox Community Grids Laboratory Indiana University Neil Devadasan The Polis Center IUPUI October 27, 2006

1 - 2 Where are we today?  The current IndianaMap uses data collected by the Indiana Geological Survey (IGS)  IGS periodically collects the best available State and Federal data and authors the data on a central web server  The web service includes the 2005 Statewide Orthophotography, INDOT and TIGER roads, USGS 10 foot contours, and Census boundaries

1 - 3 Where are we today?  Most current detailed Geographic Information is located with local government systems.  Key data includes parcels, addresses, roads, and infrastructure data  This data is not readily available at a regional or statewide level for decision making because of technical limitations

1 - 4 Comparison of state and county data  10 foot contours (1990)1 foot contours (2006)  Missing local roadsLocal roads (2006)  No parcelsParcels (2006)  No point addressesPoint addresses (2006)  Jurisdictional boundaries (2001)Jurisdictional boundaries (2006)

1 - 5 The Polis Center Middleware Research Neil Devadasan The Polis Center IUPUI

1 - 6 Many individual counties have web sites  When connecting to the service you receive all data not the subset of data that you need (Indianapolis 100+ layers)  You have no control over the data that you retrieve and query

1 - 7 Combining data from multiple web sites  Depending on the characteristics of the web sites, combining data may cause problems. Leaking tanks (Indiana Geological Survey Atlas of Indiana) overlaid on Marion County Parcels (Indianapolis GIS Web site) Public Land Survey Sections (Indiana Geological Survey Atlas of Indiana) overlaid on Marion County Parcels (Indianapolis GIS Web site) Note: Parcels are obliterated

The Polis Center’s Distributed Web GIS Middleware Research Strategy To take advantage of this highly accurate local data for use statewide, a variety of technical issues must be overcome such as: Projecting the information to a single coordinate system Standardizing symbology Retrieving individual Layers

1 - 9 Upper White River Watershed Alliance

IGIC IndianaMap Grant

Performance is an issue and thus scalability may be limited  Performance is constrained by the performance of the Individual servers

Federating, Tiling, and Caching Web Map Servers Zao Liu, Neil Devadasan, and Marlon Pierce

Basic Problem: Data Federation  Integrated GIS systems have obvious benefits but inevitably systems are developed by various state and local government agencies. Bottom up rather than top down  This tends to give excellent local information but it breaks down at the county boundary.

Considerations  We assume heterogeneity in GIS map and feature servers. Must find a way to federate existing services  We must reconcile ESRI, OGC, Google Map, and other technical approaches. Make a clean distinction between clients and services Must try to take advantage of Google, ESRI, etc rather than compete.  We must have good performance and interactivity. Servers must respond quickly--launching queries to 20 different map servers is very inefficient. Clients should have simplicity and interactivity of Google Maps and similar AJAX style applications.

Two Phase Approach: Caching and Tiling  Federation through caching: WMS and WFS resources are queried and results are stored on the cache servers. WMS images are stored as tiles.  These can be assembled into new images on demand (c. f. Google Maps).  Projections and styling can be reconciled.  We can store multiple layers this way. We build adapters that can work with ESRI and OGC products; tailor to specific counties.  Tiling: Client programs obtain images directly from our tile server.  That is, don’t go back to the original WMS for every request. Similar approaches can be used to mediate WFS requests. The tile server can re-cache and tile on demand if tile sections are missing.

Some Technical Details

Storage of caching entire state  Takes about TB to store the entire state to zoom level 13 this way. There are tiles for zoom levels 0-13, tiles for 0-14 levels (nearly 12 TB). There are ~10 layers for each scale  Aerial photo layer tiles take 25~30 KB  Other layers (parcels, roads) are much smaller: 30~36 KB for all remaining 9 layers per tile  So we need almost 60KB * tiles to store all map data  Layers from Google (Hybrids, Street, Google Satellite) don’t need to be cached. This is large but possible.  We can easily spread our caching server over multiple hosts to store even higher magnification scales.  Efficient tiling storage can save disk space.

Current Progress  Supports ESRI and OGC servers Now 17 counties is being cached. (Marion, Monroe are fully cached for 13 zoom levels)  7 layers has been proved that they can be easily cached. Aerial photo layer, street, interstate layer, parcel, parcel ID, county boundary, school). 3 more layers can be easily shown in client without caching. (Google Map, Google Satellite, Hybrids).  Querying parcel information across boundary. ( MARION-HANCOCK boundary)  Support Geocode querying.  Higher resolution than Google Satellite.  Google Map-like interaction.  Performance and Reliability. Cache Server still work even the county server doesn’t work. Much faster response to the client.

Tradeoffs of Caching  Cached images must be store somewhere.  More zoom levels, much more disk space is needed. For 12 levels, GB. For 13 levels, TB. For 14 levels, about 12 TB. (It may be not necessary to cache this zoom level for all counties. We can cache this level for the requirement of some place.  Difficulty of map re-projection.  Latency of keeping update with county servers.  Inconsistencies in available layers.

Next Steps  Caching more counties If county uses ESRI or OGC map server, current agent plugins can be used. We believe we can do the entire state We just don’t have the data.  Find a way to keep current with county servers, especially when the county server change layer id. Recent Monroe county example  Establish a standard for layers. (Different county server use different name for the same layer)  The tiling services should support multiple server styles URLs for REST/AJAX style clients WSDL and SOAP for formal Web Services Support OGC and ESRI clients.  Collaborative clients, dynamic layers (i.e. weather is an obvious addition).

Concurrent Web Map Cache Server Zao Liu, Marlon Pierce, Geoffrey Fox Community Grids Laboratory Indiana University

Introduction  Geographical Information Systems combine online dynamic maps and databases.  Many GIS software packages exist  GIS servers around state of Indiana ESRI ArcIMS and ArcMap Server (Marion, Vanderburgh, Hancock, Kosciusco, Huntington, Tippecanoe) Autodesk MapGuide (Hamilton, Hendricks, Monroe, Wayne) WTH Mapserver ™ Web Mapping Application (Fulton, Cass, Daviess, City of Huntingburg) based on several Open Source projects.  These are not compatible

Map Server Federation  Integrating GIS map servers is not trivial  Our solution: create a virtual map server to act as an agent server Translates map requests from generic format to the format expected by the specific map server. Provides a common language and programming interface for constructing clients  The agent server by itself will work but performance is not good Must wait for slowest server to respond Failure prone: a county server may not respond at all Adds additional overhead for combining images  Combining the agent server with a caching server solves these problems. Caches images for greater performance

Agent Server Architecture County Server Agent server

Caching Server  The agent server runs offline to harvest map images from county map servers. Images are stored as tiles. Tiles at county boundaries may be combined for greater storage and performance efficiency.  Clients connect to the cache server instead of the agent server.  The cache server constructs the requested image from pre-fetched tiles. Inspired by Google Maps approach Will enable more interactive clients (so-called AJAX programming)  Image construction may be parallelized/multi-threaded for greater performance. Potentially takes advantage of new multi-core server architectures from Sun, Intel, and AMD.

Tiling Example Agent server requests entire county maps for a particular zoom level and then breaks up into tiles.

Tiling and Caching at County Boundaries Marion County Hancock County Bounding box requests across boundaries have many empty tiles. Removing these empty tiles decreases storage requirements and increases cache server performance

The combined map

Caching and Tiling Layers  Map servers typically contain base maps and optional layers Parcel boundaries, roads, and township boundaries are layers.  We cache each layer separately.  Layers and base maps are combined dynamically using Java Advanced Image libraries. Common techniques

Tradeoffs of Caching  Cached images must be stored somewhere.  Currently, three counties (Hancock, Marion, and Cass) are cached at 11 different zoom levels. Photo images, layers Takes 100 ’ s-1000 ’ s GB of storage

Caching the Entire State  Takes about GB to store the entire state to zoom level 10 this way. There are tiles for zoom levels 0-10, tiles for 0-11 levels There are ~10 layers for each scale  Aerial photo layer tiles take 60 KB  Other layers (parcels, roads) are much smaller: 36 KB for all remaining 9 layers per tile  So we need 96KB * tiles to store all map data This is large but possible. Current commercial servers hosts like Sun T2000 can have 1 TB external (RAID) storage.  We can easily spread our caching server over multiple hosts to store even higher magnification scales.  Efficient tiling storage can save disk space.

Summary of Contributions  Development of agent server to pre-fetch map images from county map servers. Stores images as tiles. Removes redundant/empty tiles. Supports ESRI and OGC servers  Development of caching server Provides a uniform mechanism for clients to interact with different map servers. Increases performance and reliability  Don’t have to go to source map servers for every request. Will enable more interactive clients  Google Map-like interaction

Demonstration

Next Steps  University-private sector partnership MOU’s with local government to implement system for emergency response University and private sector funding to implement ESRI or OGC map server functionality  Develop Full Implementation System Finalize requirements Formalize programming interface using Web Service standards (WSDL and SOAP) Develop functionality Investigate scalability and performance issues