SPF workshop February 2014, UBCO1 CH1. What is what CH2. A simple SPF CH3. EDA CH4. Curve fitting CH5. A first SPF CH6: Which fit is fitter CH7: Choosing.

Slides:



Advertisements
Similar presentations
Pivot Tables. What are Pivot Tables? A pivot table gives you a way to group, summarize and compare data from a spreadsheet You can do some of the same.
Advertisements

How to work with Pivot Tables Step by step instruction.
Example 2.2 Estimating the Relationship between Price and Demand.
EXCEL 101 Level 1 on a MAC CORE (Centre for Organizational Resilience), For Youth Initiative.
Objectives 1.Identify the functions of a spreadsheet 2.Identify how spreadsheets can be used. 3.Explain the difference in columns and rows. 4.Locate specific.
Using Excel to Understand Your Data Clayton County Public Schools Department of Research, Evaluation and Assessment Assistant Principal In-Service.
Introduction to Excel 2007 Part 2: Bar Graphs and Histograms February 5, 2008.
Regression Analysis Using Excel. Econometrics Econometrics is simply the statistical analysis of economic phenomena Here, we just summarize some of the.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
Chapter 6: Pivot Tables Spreadsheet-Based Decision Support Systems Prof. Name Position (123) University Name.
Managing Grades with Excel Viewing Help To view Help 1.Open Excel on your computer. 2.In the top right hand corner of the Excel Screen type in the.
Spreadsheets and Non- Spatial Databases Unit 4: Module 15, Lecture 2- Advanced Microsoft Excel.
Pivot Tables Need HW and exam. Why? A pivot table gives you a way to group, summarize and compare data in a spreadsheet. You can do the same tasks with.
Excel: Pivot Tables Computer Information Technology Section 6-18.
Modeling Data Use for Teacher and School Leader Candidates By: Dr. Jessica Zarian Assistant Professor, Metropolitan College of New York.
1 The Basics of Regression. 2 Remember back in your prior school daze some algebra? You might recall the equation for a line as being y = mx + b. Or maybe.
LSP 120: Quantitative Reasoning and Technological Literacy Section 118 Özlem Elgün.
Using Excel for Data Analysis in CHM 161 Monique Wilhelm.
Experimental Evaluation
LSP 120: Quantitative Reasoning and Technological Literacy Section 903 Özlem Elgün.
CS1100: Computer Science and Its Applications Creating Graphs and Charts in Excel.
Spreadsheet Problem Solving
Problem 1: Relationship between Two Variables-1 (1)
Social Statistics S519: Evaluation of Information Systems.
Spreadsheets. What are the parts Rows are numbered vertically Columns are lettered horizontally Where rows and columns intersect is called a cell A sheet.
Advanced Tables Lesson 9. Objectives Creating a Custom Table When a table template doesn’t suit your needs, you can create a custom table in Design view.
DISCLAIMER This guide is meant to walk you through the physical process of graphing and regression in Excel…. not to describe when and why you might want.
Excel 2007 Part (2) Dr. Susan Al Naqshbandi
Computer Literacy BASICS
Estimation and Hypothesis Testing. The Investment Decision What would you like to know? What will be the return on my investment? Not possible PDF for.
Hypothesis Testing in Linear Regression Analysis
Assignments  1. Grade graphs and conclusions.  2. Introduction to Reaction Time.  3. Begin Pre-Lab of Reaction Time.
Spreadsheets in Finance and Forecasting Presentation 8: Problem Solving.
Introduction to Excel, Word and Powerpoint Developing Valuable Technology Skills! Shawn Koppenhoefer Training in Research in Reproductive Health/Sexual.
 Introduction to MS-Excel Introduction to MS-Excel  Entering data in EXCEL Entering data in EXCEL  Formulas & Functions in EXCEL Formulas & Functions.
CH1. What is what CH2. A simple SPF CH3. EDA CH4. Curve fitting CH5. A first SPF CH6: Which fit is fitter CH7: Choosing the objective function CH8: Theoretical.
Social Statistics: Introduction.  Statistics describes a set of tools and techniques for describing, organizing and interpreting information or data.
1 Performing Spreadsheet What-If Analysis Applications of Spreadsheets.
1 CH1. What is what CH2. A simple SPF CH3. EDA CH4. Curve fitting CH5. A first SPF CH6: Which fit is fitter CH7: Choosing the objective function CH8: Theoretical.
Evaluation of Alternative Methods for Identifying High Collision Concentration Locations Raghavan Srinivasan 1 Craig Lyon 2 Bhagwant Persaud 2 Carol Martell.
Consolidate Consolidate Multiple Worksheets to a Single Sheet in Excel.
Colleague, Excel & Word Best of Friends Presented by: Joan Kaun & Yvonne Nelson College of the Rockies.
Database Systems Microsoft Access Practical #3 Queries Nos 215.
Standard Deviation!. MENWOMEN MEN: _____ WOMEN:______ Let's say we randomly select.
June 21, Objectives  Enable the Data Analysis Add-In  Quickly calculate descriptive statistics using the Data Analysis Add-In  Create a histogram.
1 PivotTables and Pivot Charts Cookie Setton for lesson downloads.
1 7. What to Optimize? In this session: 1.Can one do better by optimizing something else? 2.Likelihood, not LS? 3.Using a handful of likelihood functions.
Technical Science Scientific Tools and Methods Tables and Graphs.
SPF workshop February 2014, UBCO1 CH1. What is what CH2. A simple SPF CH3. EDA CH4. Curve fitting CH5. A first SPF CH6: Which fit is fitter CH7: Choosing.
Chapter 22: Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
1 CH1. What is what CH2. A simple SPF CH3. EDA CH4. Curve fitting CH5. A first parametric SPF CH6: Which fit is fitter CH7: Choosing the objective function.
Multiple Regression I 1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Multiple Regression Analysis (Part 1) Terry Dielman.
PERFORMING CALCULATIONS Microsoft Excel. Excel Formulas A formula is a set of mathematical instructions that can be used in Excel to perform calculations.
Spreadsheets What is Excel?. Objectives 1. Identify the parts of the Excel Screen 2. Identify the functions of a spreadsheet 3. Identify how spreadsheets.
Intermacs Form Download Excel Tutorial Pivot Tables, Graphic Tools, Macros By: Devin Koehl.
SPF workshop UBCO February CH1. What is what CH2. A simple SPF CH3. EDA CH4. Curve fitting CH5. A first SPF CH6: Which fit is fitter CH7: Choosing.
This tutorial will talk you through a very basic workbench queueing simulation. The queueing system modelled is of customers entering an infinite capacity.
Unit 3: Text, Fields & Tables DT2510: Advanced CAD Methods.
Introduction to Excel Lecture 3. Excel basics O Excel is a software program that can make number manipulation easy O It is also referred as a spreadsheet.
Pivot Table Working with Excel (2010). What can we do with a pivot table ?  Creating a pivot table  Connection between variables  Calculate data (sum,
Microsoft Office Tips Pivot tables. Agenda Learn how to create and use PivotTables Q&A Excel 2010 is very similar to 2007, I have tried to demonstrate.
CTS130 Spreadsheet Lesson 6 Working with Math & Trig, Statistical, and Date & Time Functions.
Statistics Descriptive Statistics. Statistics Introduction Descriptive Statistics Collections, organizations, summary and presentation of data Inferential.
Problem Solving Using Excel
Welcome to Week 03 College Statistics
Descriptive Statistics
Probability and Statistics
Analysis and Empirical Results
Spreadsheets.
Pivot tables and charts
Presentation transcript:

SPF workshop February 2014, UBCO1 CH1. What is what CH2. A simple SPF CH3. EDA CH4. Curve fitting CH5. A first SPF CH6: Which fit is fitter CH7: Choosing the objective function CH8: Theoretical stuff Ch9: Adding variables CH10. Choosing a model equation 3. Exploratory Data Analysis Using Colorado data we built a simple SPF and showed how E{μ} and σ{μ} are estimated. In this session: The modeling process. Our data. What an EDA is used for. How to use a Pivot Table. Some obvious observations. How crashes depend on segment length and AADT.

SPF workshop February 2014, UBCO2 The Data The SPF How to make an SFP out of data The Modeller

SPF workshop February 2014, UBCO3 The Data The Modeller What does the data say? Initial EDA Decisions, decisions: Which traits to use? Equation or not? If not equation how to smooth? If smooth, what form? How to estimate parameters? Does it fit? Add a variable?...

SPF workshop February 2014, UBCO4 What questions can an EDA answer? 1.Is there an orderly relationship between a variable and  ? 2. If yes, what function can represent it? The same questions will be asked whenever a new variable is to be added. IEDA and VIEDA EDA is not a collection of tools, it is a quest to understand the data in order to make good modeling decisions

5 This data will be used throughout. How many segments? How many miles? How many I&F crashes? Average AADT? Go to ‘Spreadsheets to accompany Power Points.’ Open #2. Initial EDA on ‘1. Original Data’ workbook 5323 segments, 6029 miles, 21,718 I&F, 52,317 total 2,151 Avg AADT, max 20,000

Zero or no data? What information? 6

SPF workshop February 2014, UBCO7 Holes were plugged, errors corrected but outliers may exist. To get an idea how crashes vary with ‘Segment Length & ‘AADT’ I computed five year average AADT ( ) and sum of I&F crashes for See ‘2. Condensed Data’ workbook

SPF workshop February 2014, UBCO8 The ‘Pivot Table’ spreadsheet tool makes tabulations easy. Move to ‘3.Data & Pivot’ workbook To answer: Create a table with ‘AADT’ bins on the side, ‘Segment Length’ across the top, and various stats in cells. Is there an orderly relationship linking E{μ} to Segment Length and AADT? If yes, what does it look like? 1 2

9 Must include headings row Select: Existing Worksheet, Choose location, Click OK

10 This is what you now see: SPF workshop February 2014, UBCO

11 Drag this To here Now this column opens

SPF workshop February 2014, UBCO12 Right click on any number in the ‘Row Labels’ column to open the ‘menu’. Click on ‘Group’. This will open Change to 0 Change to 20,000 Click OK

SPF workshop February 2014, UBCO13 Now the Row Labels turn to: (If the field list disappeared, click on Row Labels) Now drag ‘Miles’ into the ‘Column Labels’ area

SPF workshop February 2014, UBCO14 Now the columns have to be ‘grouped’ As before, right-click on any column label and select ‘Group’ in menu. Click OK Choose: 0.5 and 20

15 Now that the rows and columns are ready Drag this to here

SPF workshop February 2014, UBCO16 Number of crashes in each bin Where we have a fair number of crashes

17 To get a different summary, right-click anywhere within the table to open: 2. Choose: ‘Count’ 1. Click SPF workshop February 2014, UBCO

18 This gets us the count of segments in each bin. No information Good information

19 To get the estimate of  for a bin divide the number of crashes in previous table by number of segments from this table. The Pivot makes it easy: Right-click again within the table and choose ‘Average’.

20 (After changing the number format): Estimates of 

21 If all we know about a certain two- lane rural Colorado road segment is that it is 3.0 miles long, what is our estimate of its μ? Answer: 4.74 accidents in five years Why? Because this is the estimate of the E{μ} of the population of units with the same known traits. Pause EDA Reflections and morals SPF workshop February 2014, UBCO

22 If we also know about that segment that its AADT=2500, what is now our estimate of its μ? Answer: 6.80 F&I accidents in five years

SPF workshop February 2014, UBCO23 Noticing the obvious: O.O. #1. Populations defined by different traits have different E{μ}‘s. Traits Length=3 miles4.74 Length=3 miles & AADT= O.O. #2. For the of a population to be an unbiased estimate of the μ of a specific unit, the traits of that unit must be the same as the traits that define the population

SPF workshop February 2014, UBCO24 Not so obvious conclusions: SPFs serve various uses: Screening, comparing E{μ}s, estimating μ’s etc. If, e.g., ‘Pavement Friction’ is not in the data for screening but is known for estimation of μ then we need two SPFs, one SPF without ‘Pavement Friction’ and one with. No SPF fits all uses New footing. How does one usually decide about whether to use a trait? How must one decide? How does one usually report results? How must one report?

SPF workshop February 2014, UBCO25 O.O. #3. The more traits define a population the fewer are the segments from which E{μ} is estimated and the larger is its standard error Traits S.L.=3 miles1224/258=4.74√1224/258=±0.14 S.L.=3 miles & AADT= /35=6.80√238/35=±0.44 Another not-so-obvious conclusion: Adding a trait to the SPF will diminish bias but reduce the accuracy of. The right course of action?

SPF workshop February 2014, UBCO 26 Return to EDA Recall that SPFs provide estimates of E{μ} and σ{μ} We use these To estimate these

SPF workshop February 2014, UBCO27 One way to estimate σ{μ} is: So, this is what we need now This is an estimate of this.

SPF workshop February 2014, UBCO28 Sample Variances of crash counts: To get crash count variances, right-click in table, go to ‘Summarize data by’ and then ‘more options’. From the options choose ‘VARp’. Use again ‘3. Data and Pivot’ worksheet

29 What is the effect of Terrain? (Flat, rolling, mountainous)

How to capture ‘Terrain’? LengthAADTMountainousRollingM/R <0.5 miles miles Increasing with Segment Length & AADT? Implication for modeling? 30

SPF workshop February 2014, UBCO31 We asked two questions of the (initial) EDA? 1.Is there an orderly relationship? (If not, do not add trait to SPF) 2. If yes, what function can represent it? Visualization. 3D vs. 2D

32 Orderly? Yes. E{μ} increases with AADT. What function? Not clear. Visualization for AADT (holding Segment Length constant)

Why so much fluctuation? 1.Randomness of crash counts; 2.In many cells have few segments; 3.Differences in unaccounted-for traits. Moral: What we are looking at may not be what we are looking for. Mountainous, curves, steep grades Flat, mild curves, no grade 33

34 Orderly? Yes. Increasing? Yes. What function? Not clear Visualization for Segment Length (holding AADT constant)

SPF workshop February 2014, UBCO35 Summary for section 3. Ingredients for SFP: Data, Experience, Computation, Judgment Unlike in baking, SPF development is not predefined sequence of steps; It is a gradual progress towards a satisfactory result consisting of steps and missteps. EDA provides guidance. It is not something you do once, before computing begins; you use it all the time. More about this later.

SPF workshop February 2014, UBCO36 1.Data come with holes and error; fix these early; 2.The Pivot Table is a useful tool of EDA (as is graphing). 3.Two obvious but important observations: a. When a trait is added E{μ} changes; b. This has implications for model building & reporting c. Adding a trait diminishes the accuracy with which E{μ} is estimated. 4.Segment Length, AADT and Terrain are ‘safety-related’, what functions is not clear. EDA helps to answer two core questions: A.Is the trait ‘safety-related’; B.If yes, what function can represent that relationship.