1 Kalev Leetaru, Eric Shook, and Shaowen Wang CyberInfrastructure and Geospatial Information Laboratory (CIGI) Department of Geography and Geographic Information.

Slides:



Advertisements
Similar presentations
Introduction to Smoothing and Spatial Regression
Advertisements

State of CyberGIS State of CyberGIS Shaowen Wang CyberInfrastructure and Geospatial Information Laboratory (CIGI) Department of Geography and Geographic.
11 Pre-conference Training MCH Epidemiology – CityMatCH Joint 2012 Annual Meeting Intermediate/Advanced Spatial Analysis Techniques for the Analysis of.
SAN DIEGO SUPERCOMPUTER CENTER The Integration of 2 Science Gateways: CyberGIS + OpenTopography Choonhan Youn, Nancy Wilkins-Diehr, SDSC Christopher Crosby,
A CyberGIS Environment for Near-Real-Time Spatial Analysis of Social Media Data Shaowen Wang CyberInfrastructure and Geospatial Information Laboratory.
University of Wisconsin-Milwaukee Geographic Information Science Geography 625 Intermediate Geographic Information Science Instructor: Changshan Wu Department.
GIS and Spatial Statistics: Methods and Applications in Public Health
19 th Advanced Summer School in Regional Science Overview of advanced techniques in ArcGIS data manipulation.
SimDL: A Model Ontology Driven Digital Library for Simulation Systems Jonathan Leidig - Edward A. Fox Kevin Hall Madhav Marathe Henning Mortveit.
Geographic Information Systems
1 从信息化基础设施角度展望 下一代地理信息系统 王少文 What is Cyberinfrastructure? It was six men of Indostan To learning much inclined, Who went to see the elephant.
Geographic Information Systems. What is a Geographic Information System (GIS)? A GIS is a particular form of Information System applied to geographical.
Panelist: Shashi Shekhar McKnight Distinguished Uninversity Professor University of Minnesota Cyber-Infrastructure (CI) Panel,
Why Geography is important.
An Introduction to Social Simulation Andy Turner Presentation as part of Social Simulation Tutorial at the.
Marine GIS Applications using ArcGIS Global Classroom training course Marine GIS Applications using ArcGIS Global Classroom training course By T.Hemasundar.
Introduction to the Use of Geographic Information Systems in Public Health Elio Spinello, MPH California State University, Northridge.
Chapter 1 Essentials of Geography
Small-Scale Raster Map Projection using the Compute Unified Device Architecture (CUDA) U.S. Department of the Interior U.S. Geological Survey Michael P.
University of Wisconsin-Milwaukee Geographic Information Science Geography 625 Intermediate Geographic Information Science Instructor: Changshan Wu Department.
Title: Spatial Data Mining in Geo-Business. Overview  Twisting the Perspective of Map Surfaces — describes the character of spatial distributions through.
The Science of Geography Geography is –methods, not just a body of knowledge –holistic –eclectic.
Chapter 1 Essentials of Geography
CyberGIS Toolkit: A Software Toolbox Built for Scalable cyberGIS Spatial Analysis and Modeling Yan Liu 1,2, Michael Finn 4, Hao Hu 1, Jay Laura 3, David.
1 Babak Behzad, Yan Liu 1,2,4, Eric Shook 1,2, Michael P. Finn 5, David M. Mattli 5 and Shaowen Wang 1,2,3,4 Babak Behzad 1,3, Yan Liu 1,2,4, Eric Shook.
Geometric Correction It is vital for many applications using remotely sensed images to know the ground locations for points in the image. There are two.
A High-Throughput Computational Approach to Environmental Health Study Based on CyberGIS Xun Shi 1, Anand Padmanabhan 2, and Shaowen Wang 2 1 Department.
Chapter Menu Introduction Section 1: Geography Skills Handbook
Name: Sujing Wang Advisor: Dr. Christoph F. Eick
CyberGIS in Action CyberGIS in Action Shaowen Wang CyberInfrastructure and Geospatial Information Laboratory (CIGI) Department of Geography and Geographic.
1 GISolve – TeraGrid GIScience Gateway Shaowen Wang Department of Geography and Grid Research & educatiOn ioWa (GROW) The University of Iowa May.
Small-Scale Raster Map Projection Transformation Using a Virtual System to Interactively Share Computing Resources and Data U.S. Department of the Interior.
Guofeng Cao CyberInfrastructure and Geospatial Information Laboratory Department of Geography National Center for Supercomputing Applications (NCSA) University.
Realizing CyberGIS Vision through Software Integration Anand Padmanabhan, Yan Liu, Shaowen Wang CyberGIS Center for Advanced Digital and Spatial Studies.
Data Types Entities and fields can be transformed to the other type Vectors compared to rasters.
Hassan A. Karimi Geoinformatics Laboratory School of Information Sciences University of Pittsburgh 3/27/20121.
Chapter 8 – Geographic Information Analysis O’Sullivan and Unwin “ Describing and Analyzing Fields” By: Scott Clobes.
Guofeng Cao CyberInfrastructure and Geospatial Information Laboratory Department of Geography National Center for Supercomputing Applications (NCSA) University.
UNIT 1: GIS DEFINITIONS AND APPLICATIONS
Chapter 1 Foundations of Geography Elemental Geosystems 4e Robert W. Christopherson Charlie Thomsen.
Applying Spatial Analysis Techniques to Make Better Decisions
What’s the Point? Working with 0-D Spatial Data in ArcGIS
Chapter 11 Spatial Analysis Credit to Prof Michael Goodchild.
Statistical Surfaces Any geographic entity that can be thought of as containing a Z value for each X,Y location –topographic elevation being the most obvious.
Integrating Geographic Information Systems (GIS) into your Curriculum Teaching American History Meg Merrick & Heather Kaplinger Year 2 GIS Inservices.
Guofeng Cao CyberInfrastructure and Geospatial Information Laboratory Department of Geography National Center for Supercomputing Applications (NCSA) University.
Special Topics in Geo-Business Data Analysis Week 3 Covering Topic 6 Spatial Interpolation.
GEOSPATIAL CYBERINFRASTRUCTURE. WHAT IS CYBERINFRASTRUCTURE(CI)?  A combination of data resources, network protocols, computing platforms, and computational.
Puulajeittainen estimointi ja ei-parametriset menetelmät Multi-scale Geospatial Analysis of Forest Ecosystems Tahko Petteri Packalén Faculty.
Guofeng Cao CyberInfrastructure and Geospatial Information Laboratory Department of Geography National Center for Supercomputing Applications (NCSA) University.
Using Cyberinfrastructure to Study the Earth’s Climate and Air Quality Don Wuebbles Department of Atmospheric Sciences University of Illinois, Urbana-Champaign.
CyberGIS Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Black and White Introduction to Cyberinfrastructure Eric Shook Department of Geography Kent State University.
Shaowen Wang 1, 2, Yan Liu 1, 2, Nancy Wilkins-Diehr 3, Stuart Martin 4,5 1. CyberInfrastructure and Geospatial Information Laboratory (CIGI) Department.
Principles of GIS Fundamental database concepts – II Shaowen Wang
Point-pattern analysis of Nashville, TN robberies: It’s all about that kernel Ingrid Luffman and Andrew Joyner, Department of Geosciences, East Tennessee.
A Black-Box Approach to Query Cardinality Estimation
Shaowen Wang1, 2, Yan Liu1, 2, Nancy Wilkins-Diehr3, Stuart Martin4,5
Summary of Prev. Lecture
Principles of GIS Fundamental spatial concepts – Part II Shaowen Wang
UNIT 1: GIS DEFINITIONS AND APPLICATIONS
Principles of GIS Fundamental database concepts Shaowen Wang
Principles of GIS Geocomputation – Part II Shaowen Wang
Problems with Vector Overlay Analysis (esp. Polygon)
GEOCODING Creates map features from addresses or place-names.
What is Human Geography?
Spatial interpolation
The Science of Geography
Spatial Interpolation (Discrete Point Data)
9. Spatial Interpolation
Presentation transcript:

1 Kalev Leetaru, Eric Shook, and Shaowen Wang CyberInfrastructure and Geospatial Information Laboratory (CIGI) Department of Geography and Geographic Information Science School of Earth, Society, and Environment National Center for Supercomputing Applications (NCSA) University of Illinois at Urbana-Champaign CyberGIS ‘ 12, Urbana IL, August 8, 2012 A CyberGIS Approach to Digital Humanities and Social Sciences: The World of Textual Geography and a Case Study of Wikipedia’s History of the World

10

11

14

15

16

17

18

19

Workflow CyberGIS Sentiment Mining Fulltext Geocoding

Inside the CyberGIS “black box” Security Domain Decomposition XSEDE GISolve Middleware CI Data & Viz Resource Selection Task Scheduling Clouds Workflow Management Services Open Service API OSG Emotional Heatmap

Data Input for a Topic A set of locations with 3 attributes Latitude, longitude point location 1. Number of articles mentioning this location 2. Number of articles mentioning both this location and topic 3. Average tone of articles mentioning both this location and topic Latitude, longitude point location 1. Number of articles mentioning this location 2. Number of articles mentioning both this location and topic 3. Average tone of articles mentioning both this location and topic

Data Input for a Topic A set of locations with 3 attributes Latitude, longitude point location 1. Number of articles mentioning this location 2. Number of articles mentioning both this location and topic 3. Average tone of articles mentioning both this location and topic Latitude, longitude point location 1. Number of articles mentioning this location 2. Number of articles mentioning both this location and topic 3. Average tone of articles mentioning both this location and topic ?

Spatializing Emotion 3 important elements 1. Importance of location 2. Prevalence of topic 3. Emotion toward topic Goal: Capture 3 elements on a single map

1) Importance of Location Every mention of a location increases its importance Every mention of a location increases its importance Generate a density map of the number of times a location is mentioned in text using Kernel Density Estimation (KDE) based on k nearest neighbor search Generate a density map of the number of times a location is mentioned in text using Kernel Density Estimation (KDE) based on k nearest neighbor search

1) Importance of Location

2) Prevalence of Topic We term topic intensity to capture the prevalence of a topic relative to other topics, and adopt a method commonly used in epidemiological studies to estimate it We term topic intensity to capture the prevalence of a topic relative to other topics, and adopt a method commonly used in epidemiological studies to estimate it Relative risk is a ratio of the KDE of disease infection locations and case control locations Relative risk is a ratio of the KDE of disease infection locations and case control locations

Topic Intensity KDE(articles that mention a topic)___ KDE(articles that do not mention the topic) KDE(articles that mention a topic)___ KDE(articles that do not mention the topic) Relative Risk KDE(points with disease)__ KDE(points without disease) KDE(points with disease)__ KDE(points without disease)

Topic Intensity

3) Emotion Toward a Topic Challenging question: Is the emotional measure tone, discrete or continuous? Challenging question: Is the emotional measure tone, discrete or continuous? –Is tone "countable" like trees or does it exist as a continuum like air temperature? Tone is a continuum: Tone is a continuum: –Cannot have "number of tones"

3) Emotion Toward a Topic A different method is used, because tone is continuous and not discrete A different method is used, because tone is continuous and not discrete Inverse distance weighted (IDW) interpolation is used to estimate tone across space creating a tone map Inverse distance weighted (IDW) interpolation is used to estimate tone across space creating a tone map Tone map captures positive and negative tone toward a particular topic across space Tone map captures positive and negative tone toward a particular topic across space

3) Emotion Toward a Topic

Overview – 3 layers 1) Article density - Proxy: Importance of location 2) Topic intensity - Proxy: Prevalence of topic relative to other topics 3) Tone - Proxy: Emotion toward a topic

Overview – 3 layers 1) Article density - Proxy: Importance of location 2) Topic intensity - Proxy: Prevalence of topic relative to other topics 3) Tone - Proxy: Emotion toward a topic First two layers represent scaling factors for tone Value range: Value range: Value range:

Emotional Heatmap Article Density Topic Intensity Emotional Heatmap Tone * = *

Emotional Heatmap of Armed Conflict in 2003 (Wikipedia)

Summary First steps, but started the dialogue First steps, but started the dialogue Balance Balance –Managing the complexity of cyberinfrastructure access –Simplifying the workflow of chaining of spatial analytics –Making sense of what’s involved Scientific rigor Scientific rigor

Ongoing Work Translate spatial knowledge to domain knowledge by answering a basic question: why is this here and not there? Translate spatial knowledge to domain knowledge by answering a basic question: why is this here and not there? Tackle spatial aggregation issues Tackle spatial aggregation issues –Represent locations as areas not points –Areal interpolation

39 Acknowledgments Guofeng Cao, Anand Padmanabhan Guofeng Cao, Anand Padmanabhan National Science Foundation National Science Foundation –BCS –OCI- –OCI –Open Science Grid –XSEDE SES070004N

40 Thanks!