The BOP (Billion Object Platform) and WorldMap / Dataverse Integration Harvard Center for Geographic Analysis Tuesday, July 12, 2016 Ben Lewis, Mercè Crosas,

Slides:



Advertisements
Similar presentations
Broadband Session Michael Byrne. Broadband Map Technical Details Data Integration Map Presentation Since Launch.
Advertisements

Sheldon Brown, UCSD, Site Director Milton Halem, UMBC Director Yelena Yesha, UMBC Site Director Tom Conte, Georgia Tech Site Director Fundamental Research.
Opinion Mapping Travelblogs Efthymios Drymonas Alexandros Efentakis Dieter Pfoser Research Center Athena Institute for the Management of Information Systems.
So What is GIS??? “A collection of computer hardware, software and procedures that are used to organize, manage, analyze and display.
@ 2007 Austin Troy. Geoprocessing Introduction to GIS Geoprocessing is the processing of geographic information. Perform spatial analysis and modeling.
Introduction to GIS. Watershed Discretization (model elements) + Land Cover Soil Rain Results Intersect model elements with Digital Elevation Model (DEM)
Rebecca Boger Earth and Environmental Sciences Brooklyn College.
Managing Data Interoperability with FME Tony Kent Applications Engineer IMGS.
Hadoop Team: Role of Hadoop in the IDEAL Project ●Jose Cadena ●Chengyuan Wen ●Mengsu Chen CS5604 Spring 2015 Instructor: Dr. Edward Fox.
Mapping and GIS for the Internet Ruilan Shi Department of Geography McGill University Presented on June 1, 2001 on Carto2001.
SOFTWARE SYSTEMS DEVELOPMENT MAP-REDUCE, Hadoop, HBase.
1 1 ISyE 6203 Radical Tools Intro To GIS: MapPoint John H. Vande Vate Spring 2012.
ICT Technologies Session 2 4 June 2007 Mark Viney.
HBase A column-centered database 1. Overview An Apache project Influenced by Google’s BigTable Built on Hadoop ▫A distributed file system ▫Supports Map-Reduce.
___________________________________________GIST: A New Tool for Visualizing Geographic Data Environmental Modeling Center__________________________________________________.
Enabling Technology for Participatory Spatial Decision Making Hans Voss Gennady Andrienko Natalia Andrienko Spatial Decision Support Team
Development of Dynamic SLD and Understanding WCS Using Geo-server Supervisor Prof N.L Sarda Dept. of Computer Science & Engg. IIT-Bombay Bharti M.Tech.
By N.Gopinath AP/CSE Cognos Impromptu. What is Impromptu? Impromptu is an interactive database reporting tool. It allows Power Users to query data without.
May 2003National Coastal Data Development Center Brief Introduction Two components Data Exchange Infrastructure (DEI) Spatial Data Model (SDM) Together,
ALPHA a framework to support collaborative research Matt Bertrrand
TerraPop Mission Enabling research, learning, and policy analysis by providing integrated spatiotemporal data describing people and their environment.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
MODIS Data at NSIDC MODIS Science Team Meeting - Nov. 2, 2006.
Introducing the new SWITRS GIS Map application in TIMS Safe Transportation Research and Education Center University of California, Berkeley
Introduction to Metadata March 2016 What is Metadata?
Presented by: Shahab Spring Introduction Data Analytics Plugins Learning Resources.
SIMULATION COMPONENT AND MODFLOW DATA MODEL. Simulation Component.
Conference Call: Access Code: Geocoding For Legislative Advocacy.
Mary Ganesan and Lora Strother Campus Tours Using a Mobile Device.
Dan O’Brien School of Public Policy and Urban Affairs, Northeastern University Research Director, Boston Area Research Initiative, Harvard University The.
Data Visualization with Tableau
Pilot Southeast Conservation Planning Atlas (CPA)
Key Terms Attribute join Target table Join table Spatial join.
GeoNetwork OpenSource: Geographic data sharing for everyone
Software Systems Development
policymap Teach your students mapping
Designing a Spatial/GIS Project
GIS Basic Training June 7, 2007 – ICIT Midyear Conference
The Geographic Support System Initiative (GSSI)
Managing Big Data and Little Data with WorldMap + Dataverse
The Center for Geographic Analysis Harvard University Our mission, services, process, staff, and sample projects Jeff Blossom, Center for Geographic.
Harry Williams, Cartography
Flanders Marine Institute (VLIZ)
Lecture 22: Using ArcToolbox Tools in Python
Touring Data with Power Map
Accessing Spatial Information from MaineDOT
Geographic Information Systems
CyberGIS: Reston, VA, September 22, 2018
Spatial Data Processing
Preliminaries: -- vector, raster, shapefiles, feature classes.
Data Queries Raster & Vector Data Models
Dynamic Data Access and Dynamically Generated WMS Layers
Power Apps Canvas and Model-Driven
Overview of big data tools
Building an online tool for spatial joins using open source software
OGC GeoPackage Format A Container to support the integration of Statistical and Geospatial Data Marcus Blake Assistant Director, Geospatial Solutions Australian.
QGIS, the data model, use and storage
Zoie Barrett and Brian Lam
Vector Geoprocessing.
Web AppBuilder for ArcGIS
Geoprocessing Sample Tools for Lidar Data Management
The Big And Far Math Challenge Demonetization
Andrew Hendrickson & Brian Embley
Introduction to Portal for ArcGIS
Server & Tools Business
GEO 481 Lab Geographical Information Systems Spring 2019
Adding Value to Registries through Geospatial Big Data Fusion Geospatial Health Context Big Table Facilitating Geospatial Analysis in Health Research.
Dynamic Data Access and Dynamically Generated WMS Layers
Working with Temporal Data
Big Data and Analytics: Getting Started with ArcGIS
Presentation transcript:

The BOP (Billion Object Platform) and WorldMap / Dataverse Integration Harvard Center for Geographic Analysis Tuesday, July 12, 2016 Ben Lewis, Mercè Crosas, Raman Prasad

Billion Object Platform - funded by Sloan General purpose, open source, streaming, big spatio-temporal data exploration and extraction Performs basic sentiment analysis Runs on commodity hardware and software Built on Spatial Lucene and Solr. Exposes all functions through an API

Other geospatial visualization work (funded by the Boston Area Research Initiative) 1.Spatial stamping in Billion Object Platform 2.Table visualization –Tables with well defined area columns (Census codes) –Tables with lat/longs 3.Geospatial data visualization –Shapefiles

The “Billion Streaming Geo-tweets” dataset A new dataset type in Dataverse which supports real-time streaming and visual, interactive exploration The content is geo-tweets (tweets containing GPS coordinate from originating device). Currently 1-2% of tweets are geo-tweets, about 8 million per day. The CGA has been harvesting geo-tweets since Main components: –1) Geo-tweet harvesting and archiving system –2) software and hardware platform to support interactive exploration of a billion spatio-temporal objects. –3) API to provide query access to the archive from Dataverse. –4) client-side tools for querying/visualizing the contents of the archive, extracting subsets, pushing them to Dataverse.

The “Billion Streaming Geo-tweets” dataset What does a landing page look like when… –Data source is external to Dataverse –The data source is continuously being updated –The data does not consist of “files” in the traditional Dataverse sense

The BOP: streaming big data… A closer look at the Billion Streaming Geotweets

API to streaming geo-tweets Built on Solr

A dataset landing page which enables data exploration and extraction A client which enables interactive exploration in multiple dimensions

Demo of Big Data exploration using predecessors of BOP : Japan Data Archive and HHypermap Japan Data Archive rt=relevant& rt=relevant& HHypermap Distributed Archive

2) Table Geocoding Work funded by NSF. Goal is to enable Dataverse tables with well-known geographic encodings to be easily visualized as maps

Pick the “Geospatial Data Type”

Choose (a) WorldMap “Join Layer” & (b) File column to join

Table visualized

Apply cartographic classification

Map symbolized

Map saved back to Dataverse

Thank You Ben Lewis

Phase II? Use Polygons to Symbolize Big Data Perform big data query. Find 10 million tweets mentioning Brexit. 18

( Geographic region and sentiment stamping ) Geographic stamping: As tweets stream in they will be stamped with census block, census tract, and Admin 2 codes. –To support aggregations by census or admin as well as by heatmap grid. Sentiment stamping: As tweets stream in a basic attempt will be made to determine sentiment. –To support heatmaps representing average sentiment values as well as count values.

Geo-tweet Dataverse