Geographical Data Mining Stan Openshaw Centre for Computational Geography University of Leeds.

Slides:



Advertisements
Similar presentations
What are Geographical Information Systems (GIS) & ArcView GIS software? What is a Geographical Information System (GIS)? Introduction to ESRI ArcView 3.x.
Advertisements

Geographic Information Systems “GIS”
Geography FACULTY OF Environment Living with Difference in Europe: making communities out of strangers in an era of super mobility and super diversity.
Towards Adaptive Web-Based Learning Systems Katerina Georgouli, MSc, PhD Associate Professor T.E.I. of Athens Dept. of Informatics Tempus.
GIS in Health and Crime Analysis Stan Openshaw * or Andrew Turner** School of Geography, University of Leeds Leeds LS2 9JT
1 Smart Crime Pattern Analysis Using the Geographical Analysis Machine Ian Turton, Stan Openshaw, James Macgill CCG, University of Leeds
WISER: 25 May 2011 Debbie Hall Map Room Bodleian Library Maps and mapping: Online Maps Online Mapsand Digital Mapping.
Border around project area Everything else is hardly noticeable… but it’s there Big circles… and semi- transparent Color distinction is clear.
Spatial Hypermedia and Augmented Reality
GIS 200 Introduction to GIS Buildings. Poly Streams, Line Wells, Point Roads, Line Zoning,Poly MAP SHEETS.
Dissemination pathways Science and policy
SIMULATORS: A basic tool to contract time and experiment before risking money.  TARGET USERS:  Private Investors  Brokerage AXE,AXEΠΕΥ, ΑΕΠΕΥ  BANKS.
The Future of GeoComputation Ian Turton Centre for Computational Geography University of Leeds.
© Digital Worlds Embedding Geographical Information Systems into the Curriculum.
Business Intelligence: Essential of Business
Geography and Public Health: Using Technology to Strengthen Programs ANDREW INGLIS: USAID | DELIVER PROJECTOCTOBER 8, 2010 BLAKE ZACHARY: MEASURE DHS.
Ch 5 Practical Point Pattern Analysis Spatial Stats & Data Analysis by Magdaléna Dohnalová.
Geographic Information Science
GIS: A tool for proper planning and informed decision-making June/July 2011 Lucky Msimanga.
GIS Lecture 1 Introduction to GIS Buildings. Poly Streams, Line Wells, Point Roads, Line Zoning,Poly MAP SHEETS.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Geographical Data Products Carol Blackwood UKBORDERS 3 rd July 2012.
Data Mining Techniques
An Introduction to the world of population Theme D – GCSE.
Cyber-Infrastructure for Agro-Threats Steve Goddard Computer Science & Engineering University of Nebraska-Lincoln.
Unit 1 Living in the Digital WorldChapter 4 – Smart Working This presentation will cover the following topic: Running a business online Name:
GIS2: Geo-processing and Metadata Treg Christopher.
Business and disparities in South Carolina SC Business Coalition on Health Board meeting June 16, 2015.
United Nations Regional Seminar on Census Data Dissemination and Spatial Analysis Amman, Jordan, May, 2011 Spatial Analysis & Dissemination of Census.
Geographic Information Systems (GIS) An Introduction.
Igor Kuzma, Statistical Office of the Republic of Slovenia Tomaž Žagar, Geodetic Institute of Slovenia GIS Portal – dissemination of geostatistics
Exploring Metropolitan Dynamics with an Agent- Based Model Calibrated using Social Network Data Nick Malleson & Mark Birkin School of Geography, University.
Building Bridges: Enterprise Learning in the Middle Years Bexley North Public School and Kogarah High School.
Visualising advocacy
Support the spread of “good practice” in generating, managing, analysing and communicating spatial information Introduction to GIS for the Purpose of Practising.
GIS FOR COMMUNITY DEVELOPMENT. WHAT DOES GIS STAND FOR? Hardware and Software Data Mapping Standards GIS Savvy Users GIS G eographic I nformation S ystems.
Geographic Techniques for Teachers GCU 674. Today’s Challenges Local, National, Global Environmental, Social, Political, Economic … What is done to help.
Providing sustainable travel tools for A Guide to the Successful Management and Promotion of FE / HE online TravelBUDi Solutions: a fast moving world.
The Five Themes of Geography Creating topic sentences from the Five Themes in the Media.
KNOWLEDGE BASED TECHNIQUES INTRODUCTION many geographical problems are ill-structured an ill-structured problem "lacks a solution algorithm.
Geohra (GeoGame) Collaborative gaming as a tool for promoting of environment and cultural heritage.
Web based Hydrology and Water Resources Information System for India
Web Access to Census Interaction Data John Stillwell and Oliver Duke-Williams Centre for Computational Geography University of Leeds, Leeds LS2 9JT Paper.
ISDS Research Committee Data Visualization for Health Surveillance: Current Concepts and New Horizons 23 September 2009 The GeoViz Toolkit: An easy-to-use.
Geographic Visualization to Support Epidemiology in Bulgaria Anthony C. Robinson GeoVISTA Center Department of Geography The Pennsylvania State University.
Getting Geographic Data. GIS data Commercial  Pay  Free Government Internet Geography Network ArcGIS Online.
World Of Technolo gy. Let´s start with some numbers. The number of internet devices in 1984 was 1000, in 1992 had grown to 1 million users. Eight years.
INTRODUCTION TO GIS  Used to describe computer facilities which are used to handle data referenced to the spatial domain.  Has the ability to inter-
COORDINATED SCHOOL HEALTH School Year NEW-trition for Mississippi Schools TASTY TIDBITS From the Office of Healthy Schools.
Towards Web-based representation and processing of health information, Gao S, Mioc D, Yi X, Anton F, Oldfield E, Coleman DJ, International.
What is GIS ? A method to visualize, manipulate, analyze, and display spatial data “Smart Maps” linking a database to the map.
INTRODUCTION TO THE UK’S NATIONAL RIVER FLOW ARCHIVE Matt Fry Systems Development Manager National River Flow Archive.
Teleworking The road to the future? Teleworking §What is it? §Who can telecommute? §What are the benefits? §Why doesn’t everyone do it?
Dalit Gasul Department of Geography and Environmental Studies, University of Haifa CRI-Project Review Day, Tuesday, February 26, 2008.
5. Methodology Compare the performance of XCS with an implementation of C4.5, a decision tree algorithm called J48, on the reminder generation task. Exemplar.
Smart Web Search Agents Data Search Engines >> Information Search Agents - Traditional searching on the Web is done using one of the following three: -
Social Media in Basic Skills Train the trainers. In Turkey, to be a teacher in public schools, you have to have 180 hours ICT course certificate at the.
Nearest Neighbour and Clustering. Nearest Neighbour and clustering Clustering and nearest neighbour prediction technique was one of the oldest techniques.
Progranimate an Introduction. What is Progranimate Progranimate is a programming environment designed specifically for novices Progranimate allows the.
Geographic Information Systems “GIS”
INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM
GIS I First Principles.
Presenter Organisation(s)
Geog 192 – Urban GIS Applications
Interval Mapping.
Presenter Organisation(s)
Spatio-temporal information in society: data
What is Geography By Mrs. Davalos.
Data Access Tool Biodiversity Information Service for Powys & Brecon Beacons National Park Steve Goddard th Oct 2013 Thanks for inviting me Introduce.
Topic 5: Cluster Analysis
Presentation transcript:

Geographical Data Mining Stan Openshaw Centre for Computational Geography University of Leeds

BUT Ian Turton, CCG, Leeds University For the latest on Stan

Why would we want to do this? Geographical Data Explosion Public imperative Lack of geographically aware tools

Mountains of Data

Swamps of Data

We know what you spend...

…where you spend it...

…who you talk to...

…where you live... LS2 9JT What your neighbours are like

...Crime data and... crime type crime location insurance data

...Health data environmental data socio-economic data admissions data

Geographical Hyperspace Geography –x,y co-ordinates, postcodes Time –days, hours, months Attributes –place - pollution sources, soil type, distance to motorway –cases - type of disease, age, sex

Data Mining

Turning data into knowledge How do these data sets fit together? Is there anything important hidden in here? Does geography make a difference?

DatatypeNature of Data Interaction _________________________________________ 1.spatial data 2.time data 3.multiple attribute data 4.geography and time data 5.time and multiple attribute data 6.geography and multiple attribute data 7.geography, time, and multiple attribute data

HISTORICALLY these effects have been hidden by research design BUT

The result is often data strangulation The patterns are being destroyed or damaged by the research design

What is needed is a geographic data mining technology that works

How can we do this? Developing new smarter methods Testing them –HPC is vital to this process Disseminating them –Internet –Java

Being SMART is not just a matter of methodology but also involves access, usability, relevancy, and result communication factors

The complete novice should be able to perform some sophisticated geographical analysis and get some useful and understandable results on the same day the work started

User Friendly Spatial Analysis provides analysis that users need simple to perform highly automated making it fast and efficient readily understood results are self-evident and can be communicated to non-experts safe and trustworthy

What we did in this study Comparison of techniques on the same data Multiple techniques –GAM/K –GAM/K-T –MAPEX –GDM1/2 –FLOCK –Proprietary Data Mining Tools

Study Area

Stan’s Cases

Chris’ cases

How to search the geographic space Exhaustively –GAM, GEM Smartly –Genetic algorithm mapex, gdm –Flocking boids

GAM & GEM

Mapex & GDM

FLOCK

And the Attributes... Exhaustively –GAM, GEM Smartly –Genetic algorithm mapex, gdm, boids

GAM & GEM with time

Rock A Rock B Rock C Rock D Geology Map

railway 2 km buffer polygon

Combined Geology and Railway Buffer Map Rock A Rock B Rock C Rock D 2 km

Combinations of Attributes If we have 8 attributes with 10 classes each There are 3160 permutations of 2 classes from 80 compared with 24,040,016 if any 5 are used Smart searches are essential –use GA to generate possible combinations of interest

Proprietary Data Miners

Results How to visualise them?

Results GAM/K –did very well –was not put off by time or attributes GAM/KT –worked well –time clusters found MAPEX / GDM/1 –worked well

Results continued FLOCK –worked very well Data mining –didn’t work at all well out of the box –could have built a GAM inside them

What next? Build a harder data set for more tests Re-run the analysis Put it all on the web

Thanks to European Research Office of the US Army ESRC grant R for paying Ian’s salary. ESRC/JISC for the Census data purchase. OS for the bits of the maps they own.

To find out more Web based Multi-engine spatial analysis tools James Macgill, Openshaw and Turton –Session 1A Sunday Smart Crime Pattern Analysis using GAM Ian Turton, Openshaw and Macgill –Session 7A Tuesday

Contacts check out smart pattern analysis on the web Latest news on Stan