Presentation is loading. Please wait.

Presentation is loading. Please wait.

MAT 135 Introductory Statistics and Data Analysis Adjunct Instructor

Similar presentations


Presentation on theme: "MAT 135 Introductory Statistics and Data Analysis Adjunct Instructor"— Presentation transcript:

1 MAT 135 Introductory Statistics and Data Analysis Adjunct Instructor
Kenneth R. Martin Lecture 3 September 14, 2016 Confidential - Kenneth R. Martin

2 Confidential - Kenneth R. Martin
Agenda Housekeeping Readings Collect HW #2 Review HW #2 Chapter 1, 14, 10, & 2 Quiz #1 Confidential - Kenneth R. Martin

3 Confidential - Kenneth R. Martin
Housekeeping Read, Chapter 1.1 – 1.4 Read, Chapter 14.1 – 14.2 Read, Chapter 10.1 Read, Chapter 2 Confidential - Kenneth R. Martin

4 Confidential - Kenneth R. Martin
Housekeeping Collect HW #2 Confidential - Kenneth R. Martin

5 Confidential - Kenneth R. Martin
Housekeeping Review HW #2 Confidential - Kenneth R. Martin

6 Confidential - Kenneth R. Martin
Housekeeping Quiz #1 Confidential - Kenneth R. Martin

7 Confidential - Kenneth R. Martin
Review What have we learned so far ? Confidential - Kenneth R. Martin

8 Statistics – Application to Research
Confidential - Kenneth R. Martin

9 Confidential - Kenneth R. Martin
Statistics Why collect samples ? Often impractical to collect all the data from the entire population (i.e. U.S. census). Some test methods are destructive – we wouldn’t have any products or services left to ship to a customer! Too expensive to sample the entire population. Don’t have to collect 100% of the population ! We can use inferential statistics to make sound conclusions about the population. Population and Sample Sampling Scheme POPULATION SAMPLE Measure Data! Use data from the SAMPLE to make conclusions about the POPULATION Confidential - Kenneth R. Martin

10 Confidential - Kenneth R. Martin
Summarizing Data Data The experiment was performed and the data was collected, now what ? We have the raw data … is it useable ? Confidential - Kenneth R. Martin

11 Confidential - Kenneth R. Martin
Summarizing Data We will “describe” the data either: Numerically (analytically as single values); Graphically (tables, graphs, charts, etc.); Both. Confidential - Kenneth R. Martin

12 Confidential - Kenneth R. Martin
Summarizing Data Data Raw Data: represents a listing of all the observed values Frequency Distribution: represents a lumping together of the observed values into “intervals” (classes), where the frequency of each score can fall into any given interval. These “Intervals” can be either quantitative or qualitative in nature. Confidential - Kenneth R. Martin

13 Confidential - Kenneth R. Martin
Summarizing Data Frequency A frequency is the number of times or how often a category, score, or range of scores occurs Confidential - Kenneth R. Martin

14 Confidential - Kenneth R. Martin
Summarizing Data Frequency Distribution (ungrouped) When the data set or its range is small A summarization display of how the data points (observations) occur in a data set. Confidential - Kenneth R. Martin

15 Confidential - Kenneth R. Martin
Summarizing Data Frequency Distribution / Histogram Confidential - Kenneth R. Martin

16 Confidential - Kenneth R. Martin
Summarizing Data Frequency Distribution (ungrouped) Column Chart from Excel Confidential - Kenneth R. Martin

17 Confidential - Kenneth R. Martin
Summarizing Data Frequency Distribution (ungrouped) Column Chart from Minitab Confidential - Kenneth R. Martin

18 Confidential - Kenneth R. Martin
Summarizing Data Frequency Distribution (ungrouped) Dot Plot from Minitab Confidential - Kenneth R. Martin

19 Confidential - Kenneth R. Martin
Summarizing Data Frequency Distribution (categorical) A summarization display for data that can be placed in specified categories. Data can be from categorical / nominal or ordinal level data. Confidential - Kenneth R. Martin

20 Confidential - Kenneth R. Martin
Summarizing Data Frequency Distribution (categorical) Make a Table for the data. Tally up the frequencies of categories. Find the percentage of each class / category: % = (f / n)*100. Confidential - Kenneth R. Martin

21 Confidential - Kenneth R. Martin
Summarizing Data Frequency Distribution (categorical) Column Chart from Excel Confidential - Kenneth R. Martin

22 Confidential - Kenneth R. Martin
Summarizing Data Frequency Distribution (categorical) Column Chart from Minitab Confidential - Kenneth R. Martin

23 Confidential - Kenneth R. Martin
Summarizing Data Frequency Distribution (categorical) Dot Plot from Minitab Confidential - Kenneth R. Martin

24 Confidential - Kenneth R. Martin
Summarizing Data Frequency Distribution (grouped) A summarization display of a larger range of data and how the data points (observations) occur within a “interval / class” of observed values or groups of observed values. Interval (Bin) / Class Width: range of possible values of each interval. Open Interval: an interval with no defined upper or lower boundary Confidential - Kenneth R. Martin

25 Confidential - Kenneth R. Martin
Summarizing Data Frequency Distribution (grouped) Lower Boundary / class limit: the smallest possible value in each interval / class of a frequency distribution Upper Boundary / class limit : the largest possible value in each interval / class of a frequency distribution Confidential - Kenneth R. Martin

26 Confidential - Kenneth R. Martin
Summarizing Data Frequency Distribution (grouped) Rules / Guidelines: General rule is 5-20 “bins” / classes. Bins must be mutually exclusive. Bins must be continuous. Bins / classes must be exhaustive of data. Bins must be equal width, except for open-ended bins. Confidential - Kenneth R. Martin

27 Confidential - Kenneth R. Martin
Summarizing Data Frequency Distribution (grouped) Confidential - Kenneth R. Martin

28 Confidential - Kenneth R. Martin
Summarizing Data Frequency Distribution (grouped) Confidential - Kenneth R. Martin

29 Confidential - Kenneth R. Martin
Summarizing Data Frequency Distribution (grouped) Confidential - Kenneth R. Martin

30 Confidential - Kenneth R. Martin
Summarizing Data Frequency Distribution (grouped) Confidential - Kenneth R. Martin

31 Confidential - Kenneth R. Martin
Summarizing Data Relative Frequency Distribution: summary display that distributes the proportion of values in each interval Cumulative Frequency Distribution: summary display that distributes the sum of frequencies across a series of intervals Relative Cumulative Frequency Distribution: summary display that distributes the sum of relative frequencies across the intervals Confidential - Kenneth R. Martin

32 Confidential - Kenneth R. Martin
Summarizing Data Confidential - Kenneth R. Martin

33 Confidential - Kenneth R. Martin
Summarizing Data Histogram Displays the distribution of measurement (continuous) or discrete data into a bar graph form, for interpretation and presentation. Maps data, categorizes and counts the number of data points into “bins” (bars) Bins (on x-axis) are adjacent and non-overlapping and are displayed as the same width Confidential - Kenneth R. Martin

34 Confidential - Kenneth R. Martin
Summarizing Data Histogram (from Six Sigma) Confidential - Kenneth R. Martin

35 Confidential - Kenneth R. Martin
Summarizing Data Histogram Confidential - Kenneth R. Martin

36 Confidential - Kenneth R. Martin
Summarizing Data Histogram Confidential - Kenneth R. Martin

37 Confidential - Kenneth R. Martin
Summarizing Data Frequency Distribution & Histogram Used to analyze data, their associated distributions and to solve problems Reveals Central Tendency, Variation of data spread & Capability Reveals if the data is skewed from center (shape) Reveals the # of modes of data (humps), and any gaps of data Confidential - Kenneth R. Martin

38 Confidential - Kenneth R. Martin
Summarizing Data Histogram, steps of construction Collect and present all data. Count the total number of data points, n. Determine the range, R, of the entire data set. Establish the number of bins, K, from n. Charts available to refer, typically 5-20 bins Determine the bin width, W. W = R / K Make sure W has 1 more decimal point than in data set Confidential - Kenneth R. Martin

39 Confidential - Kenneth R. Martin
Summarizing Data Histogram, steps of construction Determine the bin boundaries (end points of bins) Easiest to start with lowest # in data set and add bin width Construct a frequency table with bin size, bin midpoint, and frequency count of data in each bin. “Draw” the histogram. The histogram is nothing more than a visual frequency table. Confidential - Kenneth R. Martin

40 Confidential - Kenneth R. Martin
Summarizing Data Process aim = 9.0 minutes Spec = + / minutes n = 125 R = 1.7 K = 10 W = 1.7 / 10 = 0.17 Round up to 0.20 Assure 1 + decimal point First bin Min value = 9.0 (+0.20 width) (all data <9.2) Histogram Confidential - Kenneth R. Martin

41 Confidential - Kenneth R. Martin
Summarizing Data Frequency Distribution & Histogram Construct a frequency table and a histogram. First bin Min = 9.0 (+0.20) (all data <9.2) Second bin data Etc. Confidential - Kenneth R. Martin

42 Confidential - Kenneth R. Martin
Summarizing Data Confidential - Kenneth R. Martin

43 Confidential - Kenneth R. Martin
Summarizing Data Confidential - Kenneth R. Martin

44 Confidential - Kenneth R. Martin
Summarizing Data Histogram Confidential - Kenneth R. Martin


Download ppt "MAT 135 Introductory Statistics and Data Analysis Adjunct Instructor"

Similar presentations


Ads by Google