Exploratory Analysis of Crash Data

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Distributions of sampling statistics Chapter 6 Sample mean & sample variance.
Sampling: Final and Initial Sample Size Determination
Confidence Intervals This chapter presents the beginning of inferential statistics. We introduce methods for estimating values of these important population.
Statistics for Business and Economics
Chapter 11- Confidence Intervals for Univariate Data Math 22 Introductory Statistics.
Spring Sampling Frame Sampling frame: the sampling frame is the list of the population (this is a general term) from which the sample is drawn.
Estimating the Population Mean Assumptions 1.The sample is a simple random sample 2.The value of the population standard deviation (σ) is known 3.Either.
IEEM 3201 Two-Sample Estimation: Paired Observation, Difference.
Chapter Topics Confidence Interval Estimation for the Mean (s Known)
Fall 2006 – Fundamentals of Business Statistics 1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 7 Estimating Population Values.
Sampling and Sampling Distributions
BCOR 1020 Business Statistics
Standard error of estimate & Confidence interval.
The Chi-square Statistic. Goodness of fit 0 This test is used to decide whether there is any difference between the observed (experimental) value and.
Inference for regression - Simple linear regression
Statistical Intervals for a Single Sample
CONFIDENCE INTERVALS of Means AP STATISTICS, CHAPTER 19 Mrs. Watkins.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Estimates and Sample Sizes Lecture – 7.4
Estimating Population Parameters Mean Variance (and standard deviation) –Degrees of Freedom Sample size –1 –Sample standard deviation –Degrees of confidence.
Statistical estimation, confidence intervals
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Large sample CI for μ Small sample CI for μ Large sample CI for p
Estimation: Confidence Intervals Based in part on Chapter 6 General Business 704.
7 - 1 © 1998 Prentice-Hall, Inc. Chapter 7 Inferences Based on a Single Sample: Estimation with Confidence Intervals.
Confidence Intervals for Variance and Standard Deviation.
CONFIDENCE INTERVALS.
Point Estimates point estimate A point estimate is a single number determined from a sample that is used to estimate the corresponding population parameter.
Confidence Intervals for a Population Mean, Standard Deviation Unknown.
EGR 252 Ch. 9 Lecture1 JMB th edition Slide 1 Chapter 9: One- and Two- Sample Estimation  Statistical Inference  Estimation  Tests of hypotheses.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Lesoon Statistics for Management Confidence Interval Estimation.
1 Chapter 8 Interval Estimation. 2 Chapter Outline  Population Mean: Known  Population Mean: Unknown  Population Proportion.
Welcome to MM207 - Statistics! Unit 6 Seminar: Inferential Statistics and Confidence Intervals.
ESTIMATION OF THE MEAN. 2 INTRO :: ESTIMATION Definition The assignment of plausible value(s) to a population parameter based on a value of a sample statistic.
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
Chapter 8 Confidence Interval Estimation Statistics For Managers 5 th Edition.
Statistics for Business and Economics 7 th Edition Chapter 7 Estimation: Single Population Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
Confidence Intervals and Sample Size
Inference for the Mean of a Population
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
ESTIMATION.
Point and interval estimations of parameters of the normally up-diffused sign. Concept of statistical evaluation.
Chapter 4: Sampling and Statistical Inference
Chapter 4. Inference about Process Quality
Chapter 9 One and Two Sample Estimation
Normal Distributions and Sampling Distributions
Active Learning Lecture Slides
Statistics in Applied Science and Technology
Georgi Iskrov, MBA, MPH, PhD Department of Social Medicine
Data Analysis for Two-Way Tables
2) Using the data in the table above, compute the sample mean.
Quantitative Methods PSY302 Quiz 6 Confidence Intervals
CONCEPTS OF ESTIMATION
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
Review of Hypothesis Testing
Chapter 7 Estimation: Single Population
Confidence Interval Estimation
Confidence Intervals for a Standard Deviation
IE 355: Quality and Applied Statistics I Confidence Intervals
Inference on the Mean of a Population -Variance Known
DESIGN OF EXPERIMENT (DOE)
Chapter 8 Estimation: Single Population
Chapter 8 Estimation.
Chapter 9 Estimation: Additional Topics
Chapter 7 Estimation: Single Population
Business Statistics For Contemporary Decision Making 9th Edition
Presentation transcript:

Exploratory Analysis of Crash Data Fall 2017

Sampling Frame Sampling frame: the sampling frame is the list of the population (this is a general term) from which the sample is drawn. It is important to understand how the sampling frame defines the population represented. Example: If the study seeks to identify the safety effects of traffic signals, the sample frame should include a sample of signalized intersections in a given geographical area. If a control group is included, the sampling frame will include sites categorized under this group. Sig Int #2 Sig Int #1 Unsig Int #2 Unsig Int #1 Unsig Int #7 Sig Int #9 Signalized Unsignalized

Sampling Frame Map crashes for Year 1 Map crashes for Year 2

Sampling Frame Number of Crashes for Year 1 3 10 5 2 7 1 1 4 2 11 2 6 3 1 8 10 5 1 2 4 6 1 3 Number of Crashes for Year 2 6 3 7

Signalized Intersections Database Sampling Frame Signalized Intersections Database Intersection Number Crashes/Year Traffic Flow – Major Other Site Characteristics* Year 1 11,500 2 3 12,000 10 10,000 … 9 6 6,300 12,200 6,100 * ex: Nb of lanes, actuated signals, exclusive left-turn lane, etc.

Signalized Intersections Database Sampling Frame Signalized Intersections Database Crash Count 1 Intersection 1 Year 1 2 Crash Count 6 3 Intersection 9 Year 1 2

Unsignalized Intersections Database Sampling Frame Unsignalized Intersections Database Intersection Number Crashes/Year Traffic Flow – Major Other Site Characteristics* Year 1 2 8,400 9,000 3 8,500 … 7 7,900 5 8,600 9,400 9 7,800 * ex: Nb of lanes, actuated signals, exclusive left-turn lane, etc.

Histograms

Ogives Source: Washington et al. (2003)

Box Plots

Scatter Diagrams

Scatter Diagrams

Scatter Diagrams

Scatter Diagrams

Bar and Line Charts Source: Washington et al. (2003)

3D Bar Charts

Two by Two Tables Crash Severity / Flow Range < 5,000 5,000-9,999 ≥ 10,000 Fatal 10 12 15 Non-Fatal Injury 100 120 135 PDO 550 700 900

Maps

Maps – GIS Information http://www.saferoadmaps.org/home/

Confidence Intervals Statistics are usually calculated from samples, such as the sample average X, variance s2, the standard deviation s, are used to estimate the population parameters. For instance: X is used as an estimate of the population μx s2 is used as an estimate of the population variance σ2 Interval estimates, defined as Confidence Intervals, allow inferences to be drawn about the population by providing an interval, a lower and upper value, within which the unknown parameter will lie with a prescribed level of confidence. In other words, the true value of the population is assumed to be located within the estimated interval.

Confidence Interval for μ and known σ2 Confidence Intervals Confidence Interval for μ and known σ2 95% CI Any CI 90% CI

Confidence Intervals Compute the 95% confidence interval for the mean vehicular speed. Assume the data is normally distributed. The sample size is 1,296 and the sample mean X is 58.86. Suppose the population standard deviation (σ) has previously been computed to be 5.5.

Confidence Intervals Answer Compute the 95% confidence interval for the mean vehicular speed. Assume the data is normally distributed. The sample size is 1,296 and the sample mean X is 58.86. Suppose the population standard deviation (σ) has previously been computed to be 5.5. Answer

Confidence Interval for μ and unknown σ2 Confidence Intervals Confidence Interval for μ and unknown σ2 95% CI Any CI 90% CI Only valid if n > 30

Confidence Intervals Answer Same example: Compute the 95% confidence interval for the mean vehicular speed. Assume the data is normally distributed. The sample size is 1,296 and the sample mean X is 58.86. Now, suppose a sample standard deviation (s) has previously been computed to be 4.41. Answer

Confidence Interval for a Population Proportion Confidence Intervals Confidence Interval for a Population Proportion The relative frequency in a population may sometimes be of interest. The confidence interval can be computed using the following equation: Where, p is an estimator of the proportion in a population; and, q = 1 – p. Normal approximation is only good when np > 5 and nq > 5. ^ ^ ^

Confidence Intervals A transportation agency located in a small city is interested to know the percentage of people who were involved in a collision during the last calendar year. A random sample is conducted using 1000 drivers. From the sample, it was found that 110 drivers were involved in at least one collision. Compute the 90% CI.

Confidence Intervals Answer A transportation agency located in a small city is interested to know the percentage of people who were involved in a collision during the last calendar year. A random sample is conducted using 1,000 drivers. From the sample, it was estimated that 110 drivers were involved in at least one collision. Compute the 90% CI. Answer

Population Proportion

Confidence Interval Population Variance Confidence Intervals Confidence Interval Population Variance When the population variance is of interest, the confidence interval can be computed using the following equation: Where, X 2 is Chi-Square with n-1 degrees of freedom Assumption: the population is normally distributed.

Confidence Intervals Taking the same example before on the vehicular speed, compute the confidence interval (95%) for variance for the speed distribution. A sample of 100 vehicles has shown a variance equal to 19.51 mph.

Confidence Intervals Taken from Chi-Square Table Answer Taking the same example before on the vehicular speed, compute the confidence interval (95%) for variance for the speed distribution. A sample of 100 vehicles has shown a variance equal to 19.51 mph. Taken from Chi-Square Table Answer

The Chi-Square Goodness-of -fit Non-parametric test useful for observations that are assumed to be normally distributed. Need to have more than 5 observations per cell. The test statistic is If the value on the right-hand side is less than the Chi-Square with n-1 degrees of freedom, the observed and estimated values are the same. If not, the observed and estimated values are not the same. You can also perform this test for two-way contingency tables.