NSF Industry/University Cooperative Research Center High Performance Database Research Center Naphtali D. Rishe, Director, HPDRC.FIU.edu TerraFly.com
HPDRC Expertise: Data visualization Spatial databases Internet-distributed heterogeneous databases Database design methodologies Information analysis GIS Location Data Health Informatics Director: Dr. Naphtali David Rishe The Inaugural Outstanding University Professor of FIU Awards: $40M, Patents: 4, Books: 4, Papers: 250 HPDRC.FIU.edu TerraFly.com
GIS Solutions Based On Geo-Spatial Research Technology Next generation GIS: Internet geo-visualization & spatial cloud- computing platform and service Advanced geo-spatial computing engine Open architecture, API provided 40 TB database of aerial imagery and spatial data Rich datasets in a user-friendly environment Professionally customizable to domain requirements NASA, NSF, IBM and USGS funded technology HPDRC.FIU.edu TerraFly.com
Web-BasedWeb-Based Open Architecture API Geo-Spatial Mapping Solution HPDRC.FIU.edu TerraFly.com
Address Locator Vicinity Information Demographic Data Nationwide Layers: 2000 US Census, Businesses, Schools, Travel, Features (airports, heliports, public buildings, churches, hospitals, libraries, post offices, towers, tunnels, water, etc.), Property Lines and much more HPDRC.FIU.edu TerraFly.com
Your System Your System Your Data Your Data TerraFly system TerraFly Integration Solution HPDRC.FIU.edu TerraFly.com
Query Date Range Time Series Animation Player Census Block Groups are Selectable Layer Control HPDRC.FIU.edu TerraFly.com
MapReduce use cases ● ● Spatial data indexing [SSDBM’09] ● ● Geospatial query support [SSDBM’10] ● ● Parallel spatial data processing [GrC’10] ● ● Parallel set-similarity spatial joins ● ● Real Estate data analytics HPDRC.FIU.edu TerraFly.com
● ● Objective : Provide computational analytics for estimating how an event influences property values ● ● Dataset: Miami-Dade county records (~ 20M) ● ● Community Boundaries: U.S. Census Blocks and Tracts (polygons) ● ● Property transaction geo-database : Join of property public records ( Deeds, T ax Roll ) HPDRC.FIU.edu TerraFly.com ParcelDateValueAreaTypeLocation K840Condo1, K2000Single-Family2, K1085Single-Family2,2 ………………
● Method Virtual Community (VC): a set of homes with similar characteristics within a geographical area Consider unit price per VC, e.g. median of $/sq-foot Compute VC’s unit price rate change to compare communities Hadoop MapReduce : Temporal, in parallel, self-join of the dataset to compute property value rate change ParcelDateValue…Loc K1, K2, K3, K4, K5, K6, K7, K8,2 ………… K9, K9, K2, K1, K2, K2, K9,2 Reduce Map (Partitions records by: Community, Type, Date) (Computes community values and change rate) … Reduce … CommunityRateValue… % % % %60 ……… Input: dataset, t start, t end Output: Community value change rate
Hurricane Andrew, August 1992, Q2’92 vs. Q4’92 Drop Raise No change HPDRC.FIU.edu TerraFly.com