Exploring Metropolitan Dynamics with an Agent- Based Model Calibrated using Social Network Data Nick Malleson & Mark Birkin School of Geography, University.

Slides:



Advertisements
Similar presentations
1 Using GIS to Understand Behavior Patterns of Twitter Users Yue Li M.S. Civil/Geomatics Engineering Purdue University Committee: Dr.Jie Shan (Chair),
Advertisements

USING AGENT-BASED MODELLING TO DEPICT BASIN CLOSURE IN THE NAIVASHA BASIN, KENYA: A FRAMEWORK OF ANALYSIS P. R. VAN OEL, A. VAN DER VEEN, R. BECHT.
Understanding and preventing crime: A new generation of simulation models Nick Malleson and Andy Evans Alison Heppenstall Linda See Mark Birkin Centre.
Integrating Land Use in a Hedonic Price Model Using GIS URISA 2001 Yan Kestens Marius Thériault François Des Rosiers Centre de Recherche en Aménagement.
School of something FACULTY OF OTHER School of Geography FACULTY OF ENVIRONMENT Extending spatial interaction models with agents for understanding relationships.
Crime Mapping & Analysis William Jarvis & Ibrahim Sabek CSCI 5715 Prof. Shashi Shekhar Wilson, Ronald and Filbert, Katie. “Crime Mapping and Analysis.”
Understanding and preventing crime: A new generation of simulation models Nick Malleson and Andy Evans.
VISIT: Virtual Intelligent System for Informing Tourists Kevin Meehan Intelligent Systems Research Centre Supervisors: Dr. Kevin Curran, Dr. Tom Lunney,
Understanding Population Trends and Processes: Links between internal migration, commuting and within household relationships Oliver Duke-Williams School.
GENESIS Web 2.0 Agent City Simulation: Establishing a user community and enabling collaborators to manipulate simulations and develop models Andy Turner.
Agent-based Modeling: Methods and Techniques for Simulating Human Systems Eric Bonabaun (2002) Proc. National Academy of Sciences, 99 Presenter: Jie Meng.
School of something FACULTY OF OTHER School of Geography FACULTY OF ENVIRONMENT Modelling Individual Consumer Behaviour
The Future of GeoComputation Ian Turton Centre for Computational Geography University of Leeds.
CCG 1 MoSeS Introduction and Progress Report Andy Turner
Geographic Information System - ArcView University at Buffalo Summer Institute 2003 May 12, 2003.
Synthetic data, microsimulation and socio-demographic forecasting Mark Birkin School of Geography Professor of Spatial Analysis and Policy.
The Research Process. Purposes of Research  Exploration gaining some familiarity with a topic, discovering some of its main dimensions, and possibly.
Spatial Simulation for Education Policy Analysis in Ireland An Initial Exploration Gillian Golden University College Dublin
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
School of something FACULTY OF OTHER School of Geography FACULTY OF ENVIRONMENT GeoCrimeData Understanding Crime Context with Novel Geo-Spatial Data Nick.
Geographic Profiling in Australia – An examination of the predictive potential of serial armed robberies in the Australian Environment By Peter Branca.
International Health Policy Program -Thailand BY SOPIT NASUEB SPATIAL ANAYSIS (TECHNIQUES)
Capturing Criminological Spaces with Agent-based Models Andy Evans Nick Malleson Alison Heppenstall Linda See Mark Birkin Centre for Applied Spatial Analysis.
Attention and Event Detection Identifying, attributing and describing spatial bursts Early online identification of attention items in social media Louis.
6 am 11 am 5 pm Fig. 5: Population density estimates using the aggregated Markov chains. Colour scale represents people per km. Population Activity Estimation.
Data Mining Chun-Hung Chou
Last Words COSC Big Data (frameworks and environments to analyze big datasets) has become a hot topic; it is a mixture of data analysis, data mining,
Estimating Individual Behaviour from Massive Social Data for An Urban Agent-Based Model Nick Malleson & Mark Birkin School of Geography, University ESSA.
Zhiyong Wang In cooperation with Sisi Zlatanova
Indiana GIS Conference, March 7-8, URBAN GROWTH MODELING USING MULTI-TEMPORAL IMAGES AND CELLULAR AUTOMATA – A CASE STUDY OF INDIANAPOLIS SHARAF.
Newsjunkie: Providing Personalized Newsfeeds via Analysis of Information Novelty Gabrilovich et.al WWW2004.
Outline What Neural Networks are and why they are desirable Historical background Applications Strengths neural networks and advantages Status N.N and.
Health Datasets in Spatial Analyses: The General Overview Lukáš MAREK Department of Geoinformatics, Faculty.
Geodemographic modelling collaboration Alex Voss, Andy Turner Presentation to Academia Sinica Centre for Survey Research
DESIGN AND DEVELOPMENT OF A CUSTOMIZED GIS FOR AIR QUALITY MANAGEMENT In the name of Allah Dr. S. Motesaddi Director General for air pollution research.
Time Series Data Analysis - I Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What are Time Series? How to.
Generic Approaches to Model Validation Presented at Growth Model User’s Group August 10, 2005 David K. Walters.
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
1 Statewide Land-Use Allocation Model for Florida Stephen Lawe, John Lobb & Kevin Hathaway Resource Systems Group.
Knowledge Discovery from Mobile Phone Communication Activity Data Streams Fergal Walsh Data Stream Research presented in this poster was funded by a Strategic.
Wei Feng , Jiawei Han, Jianyong Wang , Charu Aggarwal , Jianbin Huang
Introduction to Spatial Microsimulation Dr Kirk Harland.
Siyuan Liu *#, Yunhuai Liu *, Lionel M. Ni *# +, Jianping Fan #, Minglu Li + * Hong Kong University of Science and Technology # Shenzhen Institutes of.
SICSA student induction day, 2009Slide 1 Social Simulation Tutorial International Symposium on Grid Computing Taipei, Taiwan, 7 th March 2010.
Introduction. Spatial sampling. Spatial interpolation. Spatial autocorrelation Measure.
GeoSpatial and GeoTemporal Informatics for dynamic and complex systems May Yuan.
A. Bazzani, B. Giorgini, S. Rambaldi Physics of the City Lab Dept. of Physics - CIG – INFN Models and tools for the analysis.
1 | Program Name or Ancillary Texteere.energy.gov Water Power Peer Review Assessment of Energy Production Potential from Tidal Streams in the United States.
Location Choice Modeling for Shopping and Leisure Activities with MATSim: Utility Function Extension and Validation Results A. Horni IVT ETH Zurich.
Web-Mining …searching for the knowledge on the Internet… Marko Grobelnik Institut Jožef Stefan.
HIMMELI - Heuristic three-level Instrument combining urban Morphology, Mobility, service Environments and Locational Information Morphologically structured.
Lily R. Jenkins and Diane E. Gan CSAFE Centre University of Greenwich 1.
Re-thinking Modelling: a Call for the Use of Data Mining in Data-driven Social Simulation Samer Hassan Javier Arroyo Celia Guti é rrez Universidad Complutense.
Using geolocated Twitter traces to infer residence and mobility Nigel Swier, Bence Kormaniczky, and Ben Clapperton.
Short term forecast of travel times on the Danish highway network based on TRIM data Klaus Kaae Andersen Thomas Kaare Christensen Bo Friis Nielsen Informatics.
Exploring Microsimulation Methodologies for the Estimation of Household Attributes Dimitris Ballas, Graham Clarke, and Ian Turton School of Geography University.
Complexity Settlement Simulation using CA model and GIS (proposal) Kampanart Piyathamrongchai University College London Centre for Advanced Spatial Analysis.
Injecting Data into Simulation: Can Agent-Based Modelling Learn from Microsimulation? Samer Hassan Juan Pav ó n Nigel Gilbert Universidad Complutense de.
A Nonparametric Method for Early Detection of Trending Topics Zhang Advisor: Prof. Aravind Srinivasan.
Urban Planning Group Implementation of a Model of Dynamic Activity- Travel Rescheduling Decisions: An Agent-Based Micro-Simulation Framework Theo Arentze,
Implementing Dynamic Data Assimilation in the Social Sciences Andy Evans Centre for Spatial Analysis and Policy With: Jon Ward, Mathematics; Nick Malleson,
Presenting spatial data Analyst training course – 23 rd June 2014.
Description of national ongoing/intended data processing
Web Mining Ref:
Summary Presented by : Aishwarya Deep Shukla
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Location Recommendation — for Out-of-Town Users in Location-Based Social Network Yina Meng.
School of Geography FACULTY OF ENVIRONMENT
Twitter as a novel source of mobility indicators
Presentation transcript:

Exploring Metropolitan Dynamics with an Agent- Based Model Calibrated using Social Network Data Nick Malleson & Mark Birkin School of Geography, University

Outline Research aim: develop a model of urban-dynamics, calibrated using novel crowd-sourced data. Background: Data for evaluating agent-based models Crowd-sourced data Data and study area: Twitter in Leeds Establishing behaviour from tweets Integrating with a model of urban dynamics

Agent-Based Modelling Autonomous, interacting agents Represent individuals or groups Usually spatial Model social phenomena from the ground-up A natural way to describe systems Ideal for social systems

Advantages of ABM More “natural” for social systems than statistical approaches Dynamic history of system Can include physical space / social processes in models of social systems Designed at abstract level: easy to change scale Bridge between verbal theories and mathematical models

Disadvantages of ABM Single model run reveals a theorem, but no information about robustness Computationally expensive Sensitivity analysis and many runs required Small errors can be replicated in many agents “Methodological individualism” Modelling “soft” human factors Lack of individual-level data for evaluation

Data in Agent-Based Models Data required at every stage: Understanding the system Calibrating the model Validating the model But high-quality data are hard to come by Many sources are too sparse, low spatial/temporal resolution Censuses focus on attributes rather than behaviour and occur infrequently

Crowd-Sourced Data for Social Science “Crisis” in “empirical sociology” (Savage and Burrows, 2007) Traditional surveys are small and occur infrequently Often focus on population attributes rather than behaviour Often spatially / demographically aggregated These are being superseded “knowing capitalism” Amazon.com purchasing suggestions / supermarket reward cards “crowd-sourced” data / “volunteered geographical information” E.g. OpenStreetMap, Flikr, Twitter, FourSquare, Facebook Potentially very useful for agent-based models Calibration / validation Evaluating models in situ

Data and Study Area Twitter Social networking / microblogging service Users create public ‘tweets’ of up to 140 characters For the most part, tweets are publicly available Include information about user, time/date, location, text etc. ‘Streaming API’ provides real-time access to tweets Collected Data 1.2M+ geo-located tweets around Leeds (June 2011 – March 2012). 403,922 Tweets within district 2,683 individual users Highly Skewed (10% of all tweets from 8 most prolific users) Filtered non-people

Temporal Trends Hourly peak in activity at 10pm Daily peak on Tuesday - Thursday General increase in activity over time

Spatial Overview Point density appears to cluster around urban centres. Also able to distinguish roads in non-urban areas General pattern somewhat distorted by locations of prolific users

Analysis of Individual Behaviour – Anchor Points Spatial analysis to identify the home locations of individual users Some clear spatio- temporal behaviour (e.g. communting, socialising etc.). Estimate ‘home’ and then calculate distance from home at different times Journey to work?

More important than aggregate patterns, we can identify the behaviour of individual users Estimate ‘home’ and then calculate distance at different times Could estimate journey times, means of travel etc. Very useful for calibration of an ABM Spatio-Temporal Behaviour

Activity Matrices (I) Once the ‘home’ location has been estimated, it is possible to build a profile of each user’s daily activity The most common behaviour at a given time period takes precedence ‘Raw’ behavioural profiles Interpolating to remove no- data At Home Away from Home No Data User a b c d e f g h i User a b c d e f g h i

Activity Matrices (II) Overall, activity matrices appear reasonably realistic Peak in away from home at ~2pm Peak in at home activity at ~10pm. Next stages: Develop a more intelligent interpolation algorithm (borrow from GIS?) Spatio-temporal text mining routines to use textual content to improve behaviour classification

Towards A Model of Urban Dynamics – Design Use microsimulation to synthesise an initial population (all residents in a city) Estimate where people go to work Estimate when people go to work and how long they spend there (initial model parameters) Calibrate these parameters to data from Twitter (e.g. ‘activity matrices’) using a genetic algorithm

Prototype Model

Conclusions & Future Work New “crowd-sourced” data can help to improve social models Improved identification of behviour “Spatio-temporal text mining” Methods to classify text based on spatio-temporal location as well as textual content In situ model calibration

Thank you Nick Malleson, School of Geography, University of Leeds