SAS Mapping functionality to measure and present the Veracity of Location Data.

Slides:



Advertisements
Similar presentations
Summary Statistics/Simple Graphs in SAS/EXCEL/JMP.
Advertisements

Zhongmin Li and Geeta Mahendra
A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
Programming for Artists ART 315 Dr. J. R. Parker Art/Digital Media Lab Lec 13 Fall 2010.
Using ODS Regions to Create Custom Reports Kate Morrow, M.S. Statistician Vermont Oxford Network, Burlington, VT.
Chapter 3 – Data Visualization © Galit Shmueli and Peter Bruce 2010 Data Mining for Business Intelligence Shmueli, Patel & Bruce.
Analysis of Variance Compares means to determine if the population distributions are not similar Uses means and confidence intervals much like a t-test.
1 Video Processing Lecture on the image part (8+9) Automatic Perception Volker Krüger Aalborg Media Lab Aalborg University Copenhagen
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
Today: Run SAS programs on Saturn (UNIX tutorial) Runs SAS programs on the PC.
Descriptive Statistics In SAS Exploring Your Data.
Richard J Self - University of Derby 1 Smart Device Location Services:- A Reliable Analytics Resource? CORS/INFORMS, Montreal, June 2015 Richard J Self.
Basic And Advanced SAS Programming
Lesson 13 Another MACRO Example MAP Plotting. Macro Example Goal of Macro named Summary: For a given dataset give summary statistics using PROC CONTENTS,
Chembakolli: A village in India
Let SAS Do the Coding for You! Robert Williams Business Info Analyst Sr. WellPoint Inc.
WEEK VI Malcolm Collins-Sibley Mentor: Shervin Ardeshir.
Weather Station Project Temperature Light Dark Sensor Wind Speed Wind Direction 1.
Richard J Self - University of Derby 1 Governance Strategies for the Cloud, Big Data and other Technologies in Education Richard J Self BA LLM MBCS FHEA.
Biostatistics Analysis Center Center for Clinical Epidemiology and Biostatistics University of Pennsylvania School of Medicine Minimum Documentation Requirements.
Karl L. Wuensch Department of Psychology East Carolina University
SAS® Maps as Tools to Display and Clarify Healthcare Outcomes Barbara B. Okerson, HMC, Richmond, VA.
Topic 28: Unequal Replication in Two-Way ANOVA. Outline Two-way ANOVA with unequal numbers of observations in the cells –Data and model –Regression approach.
Introduction to SAS Essentials Mastering SAS for Data Analytics Alan Elliott and Wayne Woodward SAS ESSENTIALS -- Elliott & Woodward1.
Niraj J. Pandya, Element Technologies Inc., NJ.  Summarize all possible combinations of class level variables even if few categories are altogether missing.
Tips & Tricks MASUG02/18/2005. Multiple Graphs on One Page.
Richard J Self - University of Derby 1 Inspiring Undergraduates to High Achievement Levels in Big Data Analytics and Governance using SAS ® Richard J Self.
The Adoption of METIS GSBPM in Statistics Denmark.
Milan Patel Geographic Information Tools- Possibilities and Challenges.
Scalable Analysis of Distributed Workflow Traces Daniel K. Gunter and Brian Tierney Distributed Systems Department Lawrence Berkeley National Laboratory.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
Richard J Self - University of Derby 1 Students as Research Partners or I don’t do Research Richard J Self Senior Lecturer in Analytics and Governance.
Richard J Self - University of Derby 1 Data Governance and Consequences of using LBS Richard J Self Senior Lecturer in Analytics and Governance
Michael Auld PhUSE Brighton PhUSE 2011 Brighton2 Skewed F-shape curve may reveal bias in the population May indicate power of trial isn’t strong.
Introduction to SAS/Graph 9.2 Ken Barz Colorado Prevention Center 22Oct2009 Ken Barz Colorado Prevention Center.
Define your Own SAS® Command Line Commands Duong Tran – Independent Contractor, London, UK Define your Own SAS® Command Line Commands Duong Tran – Independent.
Sampling Error Estimation – SORS practice Rudi Seljak, Petra Blažič Statistical Office of the Republic of Slovenia.
Topic 6: Estimation and Prediction of Y h. Outline Estimation and inference of E(Y h ) Prediction of a new observation Construction of a confidence band.
Introduction to SAS Macros Center for Statistical Consulting Short Course April 15, 2004.
1 Using the Magical Keyword “INTO” in PROC SQL Thiru Satchi Blue Cross and Blue Shield of Massachusetts Boston Area SAS Users Group April 5, 1999.
Lecture 3 Topic - Descriptive Procedures Programs 3-4 LSB 4:1-4.4; 4:9:4:11; 8:1-8:5; 5:1-5.2.
A State Perspective Mentoring Conference New Orleans, LA 2/28/2005 RCRAInfo Network Exchange.
1 Research Question  Can a vision-based mobile robot  with limited computation and memory,  and rapidly varying camera positions,  operate autonomously.
Planar Graphs Graph Coloring
Effective SAS greplay’ing and how to avoid stretching By David Mottershead Senior Programmer, Quanticate.
Tips & Tricks From your fellow SAS users 9/30/2004.
Alan Kinder CGeog Chartered Geographer online CPD Google Maps Getting started with GIS – for free.
MASUG September 15, Agenda  Guest Introductions  John Boling – SAS inSchool  Tim Garton – Health Forecasts  Announcements  Tips & Tricks 
Richard J Self - University of Derby 1 Emerging IT Product Developments An Amazing Range of Opportunities Web Services, Grids, Clouds, Crowd Sourcing and.
Copyright © 2010, SAS Institute Inc. All rights reserved. SAS ® Using the SAS Grid.
Color Marking Seed Germination Lab. Background Red - discussion of manipulated variable.
Lesson 8 - Topics Creating SAS datasets from procedures Using ODS and data steps to make reports Using PROC RANK Programs in course notes LSB 4:11;5:3.
Chapter 7: Macros in SAS  Macros provide for more flexible programming in SAS  Macros make SAS more “object-oriented”, like R 1 © Fall 2011 John Grego.
Computing with SAS Software A SAS program consists of SAS statements. 1. The DATA step consists of SAS statements that define your data and create a SAS.
Customize SAS Output Using ODS Joan Dong. The Output Delivery System (ODS) gives you greater flexibility in generating, storing, and reproducing SAS procedure.
Multiple Imputation using SAS Don Miller 812 Oswald Tower
U.S. Department of the Interior U.S. Geological Survey Automatic Generation of Parameter Inputs and Visualization of Model Outputs for AGNPS using GIS.
Creating Geographic Rating Area Maps: How to Combine Counties, Split Counties, & use Zip Code Boundaries Rick Andrews Office of the Actuary Centers for.
Big Data Using Big Data for Cultures and Communities Jeremy Reffin Simon Wibberley CASM, University of Sussex Carl Miller CASM, Demos July 2014.
SAS ® is a very powerful tool when producing Graphics. A single graphical data step can easily create a Kaplan Meier Plot, but there is no single graphical.
Senior Project Poster Day 2006, CIS Dept. University of Pennsylvania One if by land… Yosef Weiner, David Charles Pollack Faculty Advisor: C.J. Taylor,
Visualising spatial archaeological data The Aboriginal Sites Decision Support Tool (ASDST) Mal Ridges.
CMS SAS Users Group Conference Learn more about THE POWER TO KNOW ® October 17, 2011 PROC GMAP, HTML and You Thomas Kornfield, CMS.
Based on Learning SAS by Example: A Programmer’s Guide Chapters 1 & 2
Use SAS to Automate Hospital Reports generation Cardiac Service BC Stats Team, PHSA Tina Yang.
Lesson 13 More SGPLOT examples MAP Plotting Questions.
Stand Quietly.
CHAPTER 29: Multiple Regression*
3 Iterative Processing.
Students will be able to dilate shapes
Presentation transcript:

SAS Mapping functionality to measure and present the Veracity of Location Data

2 University of Derby, UK Richard J Self, Senior Lecturer in Analytics and Governance Vishal Patel, Final Year Student, University of Derby Daniel Corah, Final Year Student, University of Derby, Viktor Horecny, Final Year Student, University of Derby

3 Objectives  SAS – Exploring Mapping Functionality to Visualise Veracity of Location Data  Lessons Learned about Location Data Veracity and SAS Visualisations

4 Context  Smart Device Locations Services is seen as reliable  May not be true, consequences are many  Retail LBS based marketing  Social network apps  Photo locations in social media and Google Maps  Forensics  Criminal Justice system  Research Question is  To what extent is A-GPS reliable and in what circumstances?

5 Triggers to Research Project

6 Final Year Student Project  12 students researching  3 are co-authors, contributing valuable analyses  7 students contributed data to this presentation (2460 data points)  Daniel Corah  Vishal Patel  Amna Almutawa  Ishwa Khadka  Victor Horecny  Shehzaad kashmiri  Farondeep Bains

7 Critical Questions  Levels of accuracy in different conditions  Indoors / outdoors  Rural / residential / urban  Weather conditions  Stability of indicated location  Differences between devices (make / model / operating system)

8 V Patel – Key Insight – Models Vary phoneNMeanStd DevStd Err Nexus iPhone Diff (1-2) MethodVariancesDFt ValuePr > |t| PooledEqual SatterthwaiteUnequal Proc Univariate – Histogram issues

9 D Corah – Key Insight – Stone Built Houses Proc SGPLOT

10 V Horecny – Key Insight – Chipsets HTC-M8 (blue) modern chipset HTC-Desire S (Pink) early version chipset Uses XL/JMP ®

11 Other Insights  Cloud conditions affect accuracy  Accuracy variable with time

12 Overall Accuracy of LBS 85% <+ 25 metres 2364 out of 2420 <= 500 m

13 Accuracy Variable with Time Start-up of LS max error 360m Uses Annotate coding and macros

14 Accuracy Variable with Time

15 Consolidated Data – 2420 points Red = > 300m

16 Annotate for Time Based Accuracy  Challenges  Auto-scaling and boundaries  Data System  ANNOMAC coding for labels

17 Raw Data Lat_True_Deg Long_True_De gLat_Xif_DegLong_Xif_DegLoc_IndImage_PathDate_Time_StampPhone_type OpenIMG_0464.JPG06/09/ :24:24iPhone 5C OpenIMG_0466.JPG17/08/ :56:51iPhone 5C OpenIMG_0465.JPG17/08/ :52:47iPhone 5C circuit 1IMG_01102.jpg23/03/ :25:46iPhone 5C circuit 1IMG_01103.jpg23/03/ :25:47iPhone 5C circuit 1IMG_01104.jpg23/03/ :25:48iPhone 5C circuit 1IMG_01105.jpg23/03/ :25:49iPhone 5C circuit 1IMG_01106.jpg23/03/ :25:50iPhone 5C circuit 1IMG_01107.jpg23/03/ :25:51iPhone 5C circuit 1IMG_01108.jpg23/03/ :25:52iPhone 5C circuit 1IMG_01109.jpg23/03/ :25:53iPhone 5C Lat_True_Deg and Long_True_Deg found through Google Maps Lat_Xif_Deg, Long_Xif_Deg and Date_Time_Stamp read from images using IrfanView

18 Boundaries  proc means data=work_derby min max noprint;  output out=means_derby;  var x y;  run;  /* deduce and output corner coordinates (in Lat (Y) / Long Degrees (X)) and output using symput */  data _null_;  set means_derby;  if _stat_ = 'MIN' then do;  call symput('min_x', x);  call symput('min_y',y);  end;  if _stat_ = 'MAX' then do;  call symput('max_x', x);  call symput('max_y',y);  end;  run;

19 Auto-Scaling  xsys = '1'; /* using Frame area*/  ysys = '1';  hsys = '3';  dotsize=0.5; /*basic size of plotted error dot % of frame */  /* plot data in centered 90% of Frame Area */  /* min_x etc set from previous section */  x=(90-(x - symget('min_x'))*90 / (symget('max_x') - symget('min_x')))+5;  y=(y - symget('min_y'))*90 / (symget('max_y') - symget('min_y'))+5;

20 Dot Generation – using annomac macros  if error < 1 then do;  dotsize=dotsize*1; /* small dot for high accuracy */  %slice(x,y,0,360,dotsize,darkgreen,solid,3); /* different colors for different errors */  end;  else if error>=1 and error < 10 then do;  dotsize=dotsize*1.5;  %slice(x,y,0,360,dotsize,mediumgreen,solid,3);  end;  else if error >= 10 and error < 100 then do;  dotsize=dotsize*2;  %slice(x,y,0,360,dotsize,mediumyellow,solid,3);  end;  else if error >= 100 and error < 200 then do;  dotsize=dotsize*2.5; /* large dot for big error */  %slice(x,y,0,360,dotsize,darkyellow,solid,3);  end;

21 Adding Sequence Labels  length color $64  number $4  posn 8.  ;  retain posn;  if _n_ = 1 then posn = 0;.  posn = posn + 1;  if posn = 10 then posn = 1; /* similar to using the MOD( ) function base 9 */  if posn=1 then do;  %label(x,y,number,white,0,0,3,times new roman,1); /* Position cannot be added from a variable */  end; /* in the label macro (last macro parameter) */  else if posn=2 then do;  %label(x,y,number,white,0,0,3,times new roman,2);  end; /* etc. */

22 Final Output – Using Proc GANNO  goptions reset=all border cback=black ctitle=white;  proc ganno annotate=workanno; /* from previous Data Step */  run;

23 Conclusions  Mapping relies on using Annotate  Can be displayed in Proc GMAP or GANNO  GANNO allows simple scaling.

24 Session ID #3202