ISE 390 Chapter 1 Introduction Spring 2005 Probability and Statistics for Modern Engineering, Second Edition, Lawrence L. Lapin.

Slides:



Advertisements
Similar presentations
Agricultural and Biological Statistics
Advertisements

Measures of Dispersion
Descriptive Statistics
Chapter 3 Describing Data Using Numerical Measures
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Descriptive Statistics
Introduction to Educational Statistics
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Chapter Two Descriptive Statistics McGraw-Hill/Irwin Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Introductory Mathematics & Statistics
Data observation and Descriptive Statistics
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 3 Describing Data Using Numerical Measures.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Describing Data: Numerical
Summarizing Scores With Measures of Central Tendency
Data Handling Collecting Data Learning Outcomes  Understand terms: sample, population, discrete, continuous and variable  Understand the need for different.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Describing Data from One Variable
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
Numerical Descriptive Techniques
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
2011 Summer ERIE/REU Program Descriptive Statistics Igor Jankovic Department of Civil, Structural, and Environmental Engineering University at Buffalo,
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Modified by ARQ, from © 2002 Prentice-Hall.Chap 3-1 Numerical Descriptive Measures Chapter %20ppts/c3.ppt.
1 1 Slide Descriptive Statistics: Numerical Measures Location and Variability Chapter 3 BA 201.
What is Business Statistics? What Is Statistics? Collection of DataCollection of Data –Survey –Interviews Summarization and Presentation of DataSummarization.
Descriptive Statistics: Numerical Methods
Chapter 2 Describing Data.
Biostatistics Class 1 1/25/2000 Introduction Descriptive Statistics.
Skewness & Kurtosis: Reference
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Numerical Measures. Measures of Central Tendency (Location) Measures of Non Central Location Measure of Variability (Dispersion, Spread) Measures of Shape.
CHAPTER 1 Basic Statistics Statistics in Engineering
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Unit 3: Averages and Variations Week 6 Ms. Sanchez.
Data Summary Using Descriptive Measures Sections 3.1 – 3.6, 3.8
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-1 Chapter 2: Displaying and Summarizing Data Part 2: Descriptive Statistics.
LIS 570 Summarising and presenting data - Univariate analysis.
Chapter 2 Describing and Presenting a Distribution of Scores.
Honors Statistics Chapter 3 Measures of Variation.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Chapter 3 EXPLORATION DATA ANALYSIS 3.1 GRAPHICAL DISPLAY OF DATA 3.2 MEASURES OF CENTRAL TENDENCY 3.3 MEASURES OF DISPERSION.
Summarizing Data with Numerical Values Introduction: to summarize a set of numerical data we used three types of groups can be used to give an idea about.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Statistics -Descriptive statistics 2013/09/30. Descriptive statistics Numerical measures of location, dispersion, shape, and association are also used.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 2 Describing and Presenting a Distribution of Scores.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 18.
Descriptive Statistics
COMPLETE BUSINESS STATISTICS
Exploratory Data Analysis
Figure 2-7 (p. 47) A bar graph showing the distribution of personality types in a sample of college students. Because personality type is a discrete variable.
Statistical Methods Michael J. Watts
Analysis and Empirical Results
Statistical Methods Michael J. Watts
Chapter 5 STATISTICS (PART 1).
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
NUMERICAL DESCRIPTIVE MEASURES
Descriptive Statistics
Description of Data (Summary and Variability measures)
Numerical Descriptive Measures
Descriptive Statistics
Basic Statistical Terms
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Statistics: The Interpretation of Data
LESSON 3: CENTRAL TENDENCY
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Presentation transcript:

ISE 390 Chapter 1 Introduction Spring 2005 Probability and Statistics for Modern Engineering, Second Edition, Lawrence L. Lapin

Class Objectives Develop an understanding of the terms used in the study of Statistics. Understand the difference in populations and samples. Demonstrate capability to collect, organize, present, analyze, and interpret numerical data. Understand Statistical Measures used to describe populations and samples including: Location Central Tendency Mean Median Mode Position Percentiles, Fractiles, and Quartiles Variability Range Variance Standard Deviation Coefficient of Variation

Definitions Statistic (Singular) – A data point (Boeing closed at $45.00.) Statistics (Plural) – The Dow was down 25 points today. (Aggregate of 30 stock prices) Statistics (Discipline) - The collection, organization, presenting, analyzing, and interpreting of numerical data for the purpose of providing information with which to reach a decision or communicate information in the face of uncertainty. Statistical Population – The collection of ALL possible members of a specific group of interest. Sample – A subset of the statistical population used to represent the population from which information is inferred regarding the population. Quantitative Population – Observations are numerical values. Qualitative Population – Observations are attributes described in terms of color, shape, sex, etc.

Types of Quantitative Data Ratio Data – Operations of addition, subtraction, multiplication, and division are applicable, Examples include time or physical data such as minutes, height, weight, etc. Interval Data – Only addition or subtraction is applicable. Temperature scales are examples. Ordinal Data – No arithmetic operations apply. Some relative value is associated with the scale. This includes scales such as hurricane or tornado ratings or the Beaufort wind scale. Nominal Data – Numbers that represent arbitrary code. No relative meaning. Codes for offices or colors are examples.

Analysis of Data Sample Times to Inspect Test Devices for Calibration Number of Inspections Histogram Frequency Polygon Frequency Curve Frequency Distribution

Stem and Leaf Plot Advantage Over Other Displays of Data: No data is lost.

Music preferences in 200 young adults 14 to 19 The pie chart quickly tells you that the majority of students like rap best (50%), and the remaining students prefer alternative (25%), rock and roll (13%), country (10%) and classical (2%). Tip! When drawing a pie chart, ensure that the segments are ordered by size (largest to smallest) and in a clockwise direction. Source (or "Adapted from", if appropriate): Statistics Canada's Internet Site, full URL of source pages, and date of extraction. Statistics Canada information is used with the permission of Statistics Canada. Users are forbidden to copy the data and redisseminate them, in an original or modified form, for commercial purposes, without the expressed permission of Statistics Canada. Information on the availability of the wide range of data from Statistics Canada can be obtained from Statistics Canada's Regional Offices, its World Wide Web site at and its toll-free access number Rap 100 Alternative 50 Rock and Roll 26 Country 20 Classical 4 Relative and Cumulative Frequency Distributions

Ogive for Cumulative Frequency Distribution

Probability Density Function Cumulative Probability Density Function ba Normal Probability Distributions x

SPSS Scatter Diagram for Concrete Strength vs. Pulse Velocity

Scatter Diagram

Scatter Diagrams

Statistical Measures Population descriptors are called Parameters Sample descriptors are called Sample Statistics Location Central Tendency Mean Median Mode Position Percentiles, Fractiles, and Quartiles Variability Range Variance Standard Deviation Coefficient of Variation

Central Tendency – Mean Population Mean  lower case Greek letter Mu) N = number of members in population Sample Mean - n = number of members in sample Sample Mean Approximated from Grouped Data

Central Tendency – Median and Mode Median – That value above and below which an equal number of observations lie. Denoted by m In the case where there is an even number of samples or observations, the Median is the average of the middle two samples The Median is ( )/2 = Mode – The most frequently occurring value is the Mode Where the data is taken from a continuum, the Mode is taken to be the midpoint of the class interval with the highest frequency.

Frequency Distribution Forms and Summary Measures Positional Comparisons of the Three Measures of Central Tendency for Symmetrical and Skewed distributions Bimodal Frequency Distributions

Median – The median will be between the 50 th and 51 st observation. Adding = 47, we know that the Median will be on the stem for 14. Counting in on the 14 stem, we find that the 50 th and 51 st observations are both The Median is ( )/2 = Mode – Look for the fraction that occurs most frequently on one stem. In this data, that is the 2 in the 14 stem. The Mode is Using Stem and Leaf Plot to Find Median and Mode

Finding Percentiles from Ungrouped Data 1.Sort the data in ascending order. 2.Establish the decimal allowable range..1 < d <.9 (d = decimal equivalent of desired percentile; n = sample size) 3.Find the relative position of the desired percentile. ( n + 1) d (10 + 1).1 Let k be the largest integer such that: k < ( n + 1) d 1 < (1.11) k = 1 The desired percentile will lie between X k and X k + 1 or X 1 and X 2 4.Compute the percentile value. Q d = X k + [( n + 1) d - k](X k + 1 – X k ) = [(10 + 1).1 – 1] (5.4 – 5.3) Q.1 = [(11).1 – 1](.1) = (.1)(.1) = 5.31 k < (11).25 = 2.75 k = 2 Q.25 = [(11).25 – 2]( ) = (0.75)(.3) = = 5.63 k < (11).5 = 5.5 k = 5 Q..50 = [(11).5 – 5](6.1 – 6.1) = (0.5)(0) = = 6.10 k < (11).75 =8. 25 k = 8 Q.75 = [(11).75 – 8]( ) = (0. 25)(.1) = =

Percentiles, Fractiles, and Quartiles Percentile – The point below which the stated percentage lies. Example: 17 of 20 observations fall below 23. What percentile is 23? 17/20 = is the 85 th percentile. Fractile - The point below which the stated fraction lies. Example: 17 of 20 observations fall below 23. What fractile is 23? 17/20 = is the.85-fractile. Quartile – Divides data into four groups of equal frequency First Quartile – Same as the 25 th percentile (.25-fractile) Second Quartile - Same as the 50 th percentile (.50-fractile) and also the Median. Third Quartile - Same as the 75 th percentile (.75-fractile)

First Quartile Second Quartile Third Quartile Fourth Quartile ( ).25 = Q.25 = (25.25 – 25.0)(13.1 – 13.0) = ( ).5 = 50.5 Q.5 = (50.5 – 50.0)(14.2 – 14.2) = 14.2 ( ).75 = Q.75 = (75.75 – 75.0)(15.2 – 15.0) = Finding Percentiles from Ungrouped Data

Finding Percentiles Graphically 14.2 Q.50

Finding Percentiles from Grouped Data (Raw data unavailable, but grouped into frequency distribution) Q d = X k + [( n + 1) d - k] ( ) (73 + 1)(.25) = 18.5 The greatest cumulative frequency not exceeding 18.5 is 7 for the first interval, so k = 7. Q.25 will fall in the second interval, and its cumulative frequency is 30. X k + 1 – X k h - k Q.25 = [( ] ( ) = –

Variability* *The term “Dispersion” is also used.

Variability – Interquartile Range (Difference in the third quartile, Q.75 and the first quartile, Q.25 ) From Slide 16; Q.25 = 5.63, Q.50 = 6.10, Q.75 = 6.43 Interquartile Range = 6.43 – 5.63 = 0.80 Interquartile Range Range Boxplot

Variance and Standard Deviation (Measures deviation of individual observations around the mean.) Population Variance Standard Deviation Sample Calculation by Hand Approximated from Grouped Data  – Lower case Greek letter  Sigma The sum of the deviations of the samples about the sample mean = 0.

Composite Summary Measures Coefficient of Variation Population – Sample - Coefficient of Skewness – Provides direction and degree of skewness of frequency distribution m = Median SK > 0, Positively Skewed SK < 0, Negatively Skewed SK = 0, No Skew Kurtosis – Measures the “peakedness”of a distribution A.Which has the greatest Coefficient of Variation? B.Mesokurtic – Meso means intermediate. C.Leptokurtic – Lepto means slender. D.Platykurtic – Platy means flat.

Proportion Population Proportion Sample Proportion