2013.10.24Y. Yuan and M. Raubal1 Investigating the distribution of human activity space from mobile phone usage Yihong Yuan 1,2 and Martin Raubal 1 1 Institute.

Slides:



Advertisements
Similar presentations
Yinyin Yuan and Chang-Tsun Li Computer Science Department
Advertisements

+ Failure rates. + outline Failure rates as continuous distributions Weibull distribution – an example of an exponential distribution with the failure.
Evidence synthesis of competing interventions when there is inconsistency in how effectiveness outcomes are measured across studies Nicola Cooper Centre.
The Normal Distribution
By Venkata Sai Pulluri ( ) Narendra Muppavarapu ( )
Mech4740 Numerical Methods, Hong Kong University of Science and Technology. 1 Part Five Curve Fitting.
Descriptive statistics. Statistics Many studies generate large numbers of data points, and to make sense of all that data, researchers use statistics.
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
The role of cartographic visualizations to improve spatial cognition in geography fieldwork Xiaoling Wang, Efiom Ekpenyong, Corné P.J.M. van Elzakker,
Integration of Software Cost Estimates Across COCOMO, SEER- SEM, and PRICE-S models Tom Harwick, Engineering Specialist Northrop Grumman Corporation Integrated.
Continuous Random Variables Chap. 12. COMP 5340/6340 Continuous Random Variables2 Preamble Continuous probability distribution are not related to specific.
Ch. 6 The Normal Distribution
Analysis of Simulation Input.. Simulation Machine n Simulation can be considered as an Engine with input and output as follows: Simulation Engine Input.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Modeling and Prediction of Abdominal Tumor Motion Haobing Wang Department of Computer Science May 9 th, 2003.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution Business Statistics: A First Course 5 th.
Chapter 13, Part 1 STA 200 Summer I At this point… we have a couple of methods for graphing a data set (histogram, stem-and-leaf plot) we have a.
Geographic Profiling in Australia – An examination of the predictive potential of serial armed robberies in the Australian Environment By Peter Branca.
6 am 11 am 5 pm Fig. 5: Population density estimates using the aggregated Markov chains. Colour scale represents people per km. Population Activity Estimation.
1 Statistical Analysis - Graphical Techniques Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
Chap 6-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 6 The Normal Distribution Business Statistics: A First Course 6 th.
1 GIScience and the Big Data Age Yihong Yuan Department of Geography Texas State University.
Graphical Summary of Data Distribution Statistical View Point Histograms Skewness Kurtosis Other Descriptive Summary Measures Source:
SPC for Real-World Processes William A. Levinson, P.E. Intersil Corporation Mountaintop, PA.
Cluster Detection Comparison in Syndromic Surveillance MGIS Capstone Project Proposal Tuesday, July 8 th, 2008.
Applied Quantitative Analysis and Practices LECTURE#11 By Dr. Osman Sadiq Paracha.
Density Curves and the Normal Distribution.
2.1 Density Curves and the Normal Distribution.  Differentiate between a density curve and a histogram  Understand where mean and median lie on curves.
Tile-based parallel coordinates and its application in financial visualization Jamal Alsakran, Ye Zhao Kent State University, Department of Computer Science,
Recognizing personalized flexible activity patterns Sergio A. Ordonez M. July 2015.
C2: Exponential Functions Learning Objective: to be able to recognise a function in the form of f(x) = a x.
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
NACLIM annual meeting - 15/10/ NACLIM Annual Meeting 2014 (Berlin) WP4.2 - Extraction of city morphology indicators for urban climate modeling: a.
CEN st Lecture CEN 4021 Software Engineering II Instructor: Masoud Sadjadi Monitoring (POMA)
Modeling and Forecasting Household and Person Level Control Input Data for Advance Travel Demand Modeling Presentation at 14 th TRB Planning Applications.
Selecting Input Probability Distribution. Simulation Machine Simulation can be considered as an Engine with input and output as follows: Simulation Engine.
September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Conducting and interpreting multivariate analyses.
Intermittency Analysis and Spatial Dependence of Magnetic Field Disturbances in the Fast Solar Wind Sunny W. Y. Tam 1 and Ya-Hui Yang 2 1 Institute of.
Chapter 20 Statistical Considerations Lecture Slides The McGraw-Hill Companies © 2012.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 6-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Scatter Plots. Scatter plots are used when data from an experiment or test have a wide range of values. You do not connect the points in a scatter plot,
Empirical comparison of historical data and age- structured assessment models for Prince William Sound and Sitka Sound Pacific herring Peter-John F. Hulson,
1 Statistical Analysis - Graphical Techniques Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
Lecture 3 – Sep 3. Normal quantile plots are complex to do by hand, but they are standard features in most statistical software. Good fit to a straight.
Introduction Exploring Categorical Variables Exploring Numerical Variables Exploring Categorical/Numerical Variables Selecting Interesting Subsets of Data.
AP Statistics Review Day 1 Chapters 1-4. AP Exam Exploring Data accounts for 20%-30% of the material covered on the AP Exam. “Exploratory analysis of.
Relationship of vegetation to socioeconomic status in Austin, Texas Kimberly Nichter, Department of Geography and the Environment This study observes the.
Fast Data Analysis with Integrated Statistical Metadata in Scientific Datasets By Yong Chen (with Jialin Liu) Data-Intensive Scalable Computing Laboratory.
CHS Community Health Survey
Using satellite data and data fusion techniques
Chapter 4 Continuous Random Variables and Probability Distributions
Understanding and Comparing Distributions
Understanding and Comparing Distributions
Instrumental Surface Temperature Record
Understanding Human Mobility from Twitter
Describing Location in a Distribution
Understanding and Comparing Distributions
CHAPTER 1: Picturing Distributions with Graphs
Instrumental Surface Temperature Record
Understanding and Comparing Distributions
Grant Number: IIS Institution of PI: WPI PIs: Matthew O
Descriptive and Inferential
Siyao Xu Earth, Energy and Environmental Sciences (EEES)
Statistics for Managers Using Microsoft® Excel 5th Edition
10-5 The normal distribution
Understanding and Comparing Distributions
Understanding and Comparing Distributions
Chapter 7 The Normal Distribution and Its Applications
The Normal Distribution
Presentation transcript:

Y. Yuan and M. Raubal1 Investigating the distribution of human activity space from mobile phone usage Yihong Yuan 1,2 and Martin Raubal 1 1 Institute of Cartography and Geoinformation, ETH Zurich 2 Department of Geography, University of California, Santa Barbara

Y. Yuan and M. Raubal2 Background Information and communication technologies (ICTs) –Greater mobility flexibility –A wide range of Spatio-temporal data sources Modeling the activity pattern of phone users –Call Detailed Records (CDRs) –Low resolution / precision –Uncertainty?

Y. Yuan and M. Raubal3 Background Activity space of human beings –“the local areas within which people travel during their daily activities” –How to measure an activity space Size Shape Regularly visited places (e.g., home, work) ……

Y. Yuan and M. Raubal4 Objective & Motivation Explore the distribution of activity space of phone users based on (CDRs) Why is this important? –Large scale, urban oriented research –Model fitting Abstract and generalized indicators Provide reference for various fields A neat fitted distribution can facilitate comparison between cities/countries/cultures, etc…

Y. Yuan and M. Raubal5 Methodology Three measures of Activity space –Radius -> Scale –Eccentricity -> Shape Shape Index: 1- Eccentricity –Entropy->Regularity Transformation of Trajectories (Gonzalez et al. 2008).

Y. Yuan and M. Raubal6 Example dataset Mobile Phone Connections in 10 cities in Northeast China –3 large, 5 median, 2 small –Time, duration, and locations of mobile phone connections over 9 days

Y. Yuan and M. Raubal7

Y. Yuan and M. Raubal8 Descriptive plots (Probability density functions) Radius (b)

Y. Yuan and M. Raubal9 Probability distribution functions Shape Index

Y. Yuan and M. Raubal10 Probability distribution functions Entropy

Y. Yuan and M. Raubal11 Model Fitting Various models –Why model fitting is needed sometimes –Exponential… –Skewed normal… Cross-variable comparison –One flexible model for all three variables –Weibull distribution For exploratory analysis Both cross-city and cross-variable comparison

Y. Yuan and M. Raubal12 Weibull distribution An effective model in various applications such as industrial engineering –where k > 0 is the shape parameter and λ > 0 is the scale parameter of the distribution. –With k=1, the Weibull distribution turns into an exponential distribution The combination of simplicity and flexibility in the shape of the Weibull distribution ( )

Y. Yuan and M. Raubal13 Weibull Distribution (Cont.) Scale parameter lamda –How much the curve stretches out –When k stays the same, The larger lamda is,, the distribution gets stretched out to the right and its height decreases, while maintaining its shape and location.

Y. Yuan and M. Raubal14 Weibull distribution fitting

Y. Yuan and M. Raubal15 Discussion The distribution of radius is close to an exponential distribution (k 2 ≈1) The decay of Shape Index is faster than exponential (k 3 <1). It also shows that the distribution of Entropy is similar to a skewed normal distribution (k 1 ≈2).

Y. Yuan and M. Raubal16 Application We have also constructed a regression model showing that –k 2 is significantly correlated with the size of the cities, indicating that bigger cities have a faster decay trend for individual activity radii. –city size is not significantly correlated with either entropy or SI.

Y. Yuan and M. Raubal17 Summary Weibull distribution –Fit into three variables The distribution of radius is close to an exponential distribution

Y. Yuan and M. Raubal18 Future Work Further examine other potential factors Test the robust of the method –Apply to other data Applications

Y. Yuan and M. Raubal19 Acknowledgement This research is funded by: –Swiss National Science Foundation, Grant No

Y. Yuan and M. Raubal20